Site Reliability Engineer

Vista Samaneh Asa Tehran

Posted 2 years ago

Job Description

● Identify operational problems by observing and studying system functioning and performance results; investigating complaints and suggestions; completing troubleshooting procedures. ● Develop operational solutions by defining, studying, estimating, and screening alternative solutions. ● Improve operational quality results by studying, evaluating, and recommending process re-design; implementing changes. ● Provide operational management information by collecting, analyzing, and summarizing operating and engineering data and trends. ● Ensure IT services' health and resiliency through monitoring the applications and defining effective monitoring KPIs for operational activities.

Requirements

● Strong understanding of Linux system administration. ● Practical experience with containers and the tools such as Docker and Kubernetes. ● Experience in logging, metrics, monitoring, and alerting tools such as ELK or Prometheus and Alert manager. ● Self-motivated and technically curious.

Employment Type

  • Full Time

Details

To see more jobs that fit your career