Job Description
We are seeking highly motivated and technically skilled students or recent graduates for our Site Reliability Engineer (Internship) program.
Your responsibilities will include:
- Understands basic cloud infrastructure and monitoring tools such as Prometheus, Grafana, or Cloud Monitoring.
- Assists in routine operational tasks, such as system checks and basic incident response under supervision.
- Learns to follow established procedures for system maintenance, patching, and updates.
- Gains foundational knowledge of Service Level Objectives (SLOs) and error budgets.
- Supports documentation of incidents and postmortems.
Requirements:
- Familiarity with at least one cloud provider (e.g., Huawei Cloud, AWS, Azure).
- Exposure to scripting languages (e.g., Python, Bash).
- Basic understanding of Linux/Unix systems and networking fundamentals.
- Knowledge of CI/CD pipelines and version control (Git).
- Strong willingness to learn cloud operations, observability, and automation.
- Excellent communication skills.
- Pragmatic and problem-solving attitude.
- A naturally curious and proactive approach to learning and problem-solving
- Good skills at writing / editing documentation.
- Bsc's degree Computer Science, Software Engineering, a related field, or equivalent practical experience.
Additional requirement:
- Familiarity with OpenStack deployment or operations
- Familiarity with public cloud deployment or operations