Job Description

At Okala, we are looking to hire a skilled and proactive Site Reliability Engineer (SRE) who will play a critical role in maintaining system reliability, improving automation, and supporting scalable production environments.
With a hands-on mindset, you will help drive operational excellence and ensure high availability of our services.

your story belongs here.

What You'll Be Doing:

Deploy updates, patches, and fixes while providing advanced technical support

Design and build automation tools to reduce errors and enhance system stability and customer experience

Perform root cause analysis on production incidents and resolve complex technical issues

Develop scripts to automate repetitive operational tasks

Design and maintain procedures for system troubleshooting and preventive maintenance

Rapidly build effective working relationships with teams and drive issues toward resolution through clear communication

Configure and maintain monitoring and alerting systems to ensure service reliability

Participate in on-call rotations to support 24/7 operational environments

What You Bring:

Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)

At least 1 year of hands-on experience in Site Reliability Engineering, DevOps, or a closely related role

Strong expertise in Linux system administration and troubleshooting in production environments

Excellent scripting and automation skills using Bash, Python, or similar languages

Solid understanding of databases, storage systems, and SQL

Deep knowledge of containerization and orchestration, especially Kubernetes

Strong experience designing and maintaining CI/CD pipelines (GitLab CI/CD preferred)

Experience implementing automated deployment, testing, and rollback strategies

Hands-on experience with monitoring, logging, and alerting (observability) tools

Familiarity with configuration management and infrastructure automation tools such as Ansible

Strong understanding of networking concepts (DNS, routing, firewalls, load balancing)

Experience with distributed systems and event streaming platforms (e.g., Kafka)

Strong problem-solving skills including incident response, root cause analysis, and reliability improvement

Proactive, self-driven, and solution-oriented mindset

Why Okala?

Because at Okala, you’re not just maintaining systems — you’re helping power one of Iran’s leading online retail platforms.
With services across 200+ cities and millions of daily users, we move fast, learn continuously, and are always looking for smarter, more scalable solutions.

What You’ll Enjoy:

A dynamic and friendly work environment

Weekly gatherings: Mafia Night, Cinema Night & more

Access to practical and high-quality training programs

Free breakfast, commuting & lunch subsidies

Supplementary health insurance, in-house doctor, and parking

Birthday gifts, seasonal bonuses, and exclusive Okala discount codes

Ready to build, automate, and scale with us?
Let’s write the next chapter of success together.

برای مشاهده‌ی شغل‌هایی که ارتباط بیشتری با حرفه‌ی شما دارد،

محاسبه‌کننده حقوق

چقدر حقوق بگیرم؟