We are looking for an experienced engineer with strong Linux and system-level expertise who can operate autonomously in complex production environments. You must be able to independently troubleshoot incidents, lead and support post-incident service recovery, and drive improvements to overall system stability, performance, and observability. We are looking for a hands-on Site Reliability Engineer (SRE) with a strong background in Linux infrastructure and third-party system operations.

This role focuses on managing and optimizing large-scale environments (5,000+ hosts) running technologies like Kafka, Redis, and Kubernetes.

The position does not involve application development but requires deep operational expertise and solid troubleshooting skills.

Senior Site Reliability Engineer

Related Jobs

Staff Engineer, Systems & DevOps

Senior Backend Engineer - Pharmacy

Senior Software Engineer, Streaming Platform

Senior Software Engineer - Developer Experience (open to remote across ANZ)

AI Engineer, Data Replication

Senior Threat Detection Engineer - Tooling and Automation (ANZ remote)

Staff Backend Engineer, Forward Deployed

Staff Frontend Engineer, Forward Deployed

Sr. Manager, Developer Experience

Senior Frontend Engineer - Pharmacy

Browse Similar Jobs