Zettabyte Space·about 9 hours ago
At Zettabyte, we’re building the infrastructure layer for the AI-first world. Our mission is to make AI compute ubiquitous, seamless, and limitless by operating a cloud where AI workloads run securely at massive scale—anywhere, anytime.
We run a multi-tenant GPU cloud for AI developers and enterprises. Security isn’t a support function here—it’s a core platform capability.
Zettabyte is scaling a shared, high-performance AI compute platform in a space where traditional cloud security models break down. Multi-tenant GPUs, high-speed networking, and untrusted customer workloads introduce security challenges that don’t have off-the-shelf answers.
We’re hiring a Staff Security Engineer to define and own the security architecture of our platform. You’ll operate with wide latitude, shaping how isolation, detection, and trust are built into the system from day one.
This role is ideal for someone who thrives in early-to-mid stage environments, enjoys working through ambiguity, and wants to build security systems that scale with the business—not slow it down.
Own the end-to-end security architecture for multi-tenant Kubernetes GPU clusters
Design tenant isolation, egress control, and network segmentation across compute, storage, and networking layers
Define and implement runtime security and intrusion detection for untrusted AI workloads
Build security primitives (identity, secrets, encryption, policy enforcement) that platform teams build on
Secure the software supply chain, from CI/CD pipelines to container admission
Lead threat modeling and security design reviews for new platform features
Drive compliance readiness (SOC 2, ISO 27001) without slowing engineering velocity
Act as a force multiplier: unblock teams, set standards, and raise the security bar across the org
Lead security incident response and turn incidents into systemic improvements
7+ years of experience in security engineering for cloud-native, infrastructure, or distributed systems
Deep, hands-on expertise in Kubernetes security (RBAC, PSA, network policies, admission controllers)
Strong understanding of cloud security primitives in AWS, GCP, or Azure
Experience building or operating runtime security and policy enforcement (Falco, Cilium, OPA, Calico, eBPF-based tools)
Solid grounding in network security and zero-trust architectures
Practical experience with secrets management and key systems (Vault, cloud KMS)
Strong automation skills in Go, Python, or Bash
Proven ability to operate autonomously, make architectural decisions, and deliver in ambiguous environments
Experience partnering deeply with platform, infra, and SRE teams
GPU isolation and virtualization security (MIG, SR-IOV)
InfiniBand, RDMA, or high-performance networking
HPC or large-scale multi-tenant compute platforms
Security for AI/ML systems or data-intensive workloads
Incident response leadership or red team experience
Security certifications (CKS, OSCP, CISSP)
Open-source contributions in security or cloud infrastructure
You’ll define the security model, not just implement tickets
You’ll work on real isolation problems in GPU and high-speed networking environments
You’ll influence architecture across the platform, not sit in a silo
You’ll help build a security culture at a company where speed and safety both matter
Competitive salary — commensurate with your experience and aligned with industry standards
Meaningful equity — be part of the upside as we build a category-defining company. Your grant will align with your role and the experience you bring.