Job Description
We are building the infrastructure for the 2026 AI revolution. As autonomous agents and multimodal models reshape the industry, we are seeking a visionary AI Systems Architect to design scalable, high-performance backend systems. You will work at the intersection of machine learning and distributed systems, ensuring our AI can handle petabyte-scale data while maintaining sub-millisecond latency. Join us in defining the standard for future-proof artificial intelligence.
Responsibilities
- Design and deploy scalable inference engines for next-generation Large Language Models and autonomous agents.
- Optimize deep learning workloads using Rust, Go, or C++ to maximize GPU utilization.
- Architect fault-tolerant, distributed systems capable of handling real-time, high-throughput data streams.
- Collaborate with ML researchers to translate theoretical models into production-ready software.
- Implement robust CI/CD pipelines for model versioning and A/B testing infrastructure.
- Drive technical strategy for the 2026 roadmap, identifying gaps in current stack and proposing innovative solutions.
Qualifications
- 8+ years of experience in backend engineering or distributed systems architecture.
- Deep expertise in Python, PyTorch, and at least one systems language (Rust, Go, or C++).
- Strong understanding of Kubernetes, Docker, and cloud-native architectures (AWS/GCP).
- Experience deploying and optimizing high-performance GPU clusters.
- Proven track record of working in high-growth, fast-paced startup environments.
- Excellent communication skills with the ability to articulate complex technical concepts to non-technical stakeholders.