Tensorwave·29 days ago
Our mission at Tensorwave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.
About the role
We are seeking a Network Principal to own the front-end network architecture for large-scale AI and GPU-accelerated infrastructure. This role is responsible for the design, implementation, and evolution of customer-facing, service, and control-plane networks that interface with large GPU, and storage.
This role operates as a peer to the Back End Network Principal, with shared responsibility for end-to-end platform scalability, reliability, and performance.
Responsibilities
Own front-end network architecture - DCI, edge, ingress/egress, and control-plane networks
Architect and operate large edge and service networks
Design scalable Ethernet architectures
Define routing, segmentation, and isolation strategies
Lead hands-on deployment, validation, and troubleshooting in new data centers
Define and maintain reference architectures, standards, and long-term growth models
Own relationships with network carriers and service providers
Work in collaboration with the platform team to design and deliver network solutions for Kubernetes-centric use cases
Partner closely with the Back End Network Principal to define clean interface boundaries between front-end and RDMA back-end fabrics
Required Experience
Bachelor of Science in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
10+ years data center networking experience
Proven experience with very large Ethernet fabrics and large-scale edge networks
Strong hands-on experience with BGP, traffic engineering, and high-availability designs
Experience with 100G–400G+ Ethernet environments
Familiarity with optical standards and transceiver types (e.g., 100G/400G/800G, SR/LR/ER, DWDM)
Demonstrated experience working with carriers providers to deliver production connectivity
Multi-Vendor Experience - Juniper, Cisco, Arista, Whitebox
NOS Experience - Junos, IOS/IOS-XE, NX-OS, EOS, SONiC
Preferred Experience
Automation or scripting experience in Python, GO, Bash, or equivalent
AI platform or GPU cluster environments
Multi-tenant or customer-facing platforms
Strong familiarity with Kubernetes networking concepts
Exposure to network automation and programmability
100G+ environments
AI, GPU, or HPC exposure
What We Bring
Mission driven company
Competitive Salary
Stock Options
100% paid Medical, Dental, and Vision insurance
Flexible PTO
Paid Holidays
401(k)
Parental Leave
Flexible Spending Account
Short Term Disability Insurance
Life and Voluntary Supplemental Insurance
Mental Health Benefits through Spring Health
We’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.
Tensorwave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, national origin, or veteran status.