CloudLinux is transforming the Linux infrastructure market by ensuring security and stability for over 500,000 servers worldwide. Our products - CloudLinux OS, TuxCare, and Imunify360 - are the de facto standard in the hosting industry and Enterprise segment.
We are seeking a visionary engineer to lead the evolution of our data platform. In 2025, we are shifting from classic database administration to an Internal Database-as-a-Service (DBaaS) model. We need a specialist who doesn’t just "configure backups," but designs resilient distributed systems, writes code to automate infrastructure, and transforms databases into a reliable service for product teams.
If you are tired of endless tickets and want to build platforms capable of processing petabytes of data, this role is for you.
Your Challenges & Responsibilities:
- DBaaS Architecture: Design and implement a self-service platform based on Terraform and Ansible, enabling the deployment of HA clusters (PostgreSQL and ClickHouse, MongoDB, Redis) in a heterogeneous environment (Bare Metal + OpenNebula + Kubernetes + Public Clouds). You will turn infrastructure into a product.
- Scaling ClickHouse: Manage exponentially growing analytics clusters (12+ clusters, tens of terabytes of data). You will tackle sharding, table engine optimization (ReplicatedMergeTree), and building reliable S3 backup pipelines under high load.
- Data Platform & Analytics Support: Maintain and scale the infrastructure for Apache Airflow and Redash. You will ensure the reliability of ETL pipelines and visualization tools, bridging the gap between raw infrastructure and the data analytics team.
- Reliability as Code: Implement SRE practices in data management. Replace manual incident response with automated self-healing mechanisms. Define and implement SLO/SLI for all databases.
- Stack Modernization: Lead the migration process from legacy solutions to modern cloud patterns. Participate in decision-making regarding the implementation of Kubernetes operators for stateful workloads.
- Expertise & Mentorship: Serve as the technical authority for product teams, helping them optimize data schemas and SQL queries for high-load systems.
Our Tech Stack:
- Databases: PostgreSQL 15+ (Patroni, PgBouncer), ClickHouse (Sharded/Replicated), MongoDB, Redis, Kafka
- Data & Analytics: Apache Airflow, Redash (Infrastructure & Integration).
- Infrastructure: Own 3+DC colocation (OpenNebula, Kubernetes, Bare Metal), AWS, Google Cloud, Azure, DO – Hybrid Cloud.
- Automation & IaC: Terraform, Ansible, Python/Go, GitLab, Jenkins, Gerrit.
- Observability: VictoriaMetrics, Grafana, Loki.
Why CloudLinux?
- Culture: A Remote-first company with an "Employees First" principle. We value results, not hours in the office.
- Impact: Your architectural decisions will determine the stability of services used by thousands of companies around the world.
- Growth: We support professional development and pay for training and conferences.