Job Description
Are you ready to architect the backbone of the next generation of Artificial Intelligence?
Nexus Horizon Systems is at the forefront of innovation, preparing for the paradigm shift of 2026. We are seeking a visionary Lead AI Infrastructure Architect to design scalable, resilient, and high-performance computing environments that will power the future of generative models.
In this role, you will not just manage servers; you will engineer the ecosystems that allow AI to scale from prototypes to production-ready global platforms. If you are passionate about pushing the boundaries of what is possible in cloud infrastructure and machine learning, we want to meet you.
Responsibilities
- Design and implement next-generation AI inference pipelines capable of handling petabyte-scale data throughput.
- Lead the architectural strategy for GPU clusters and distributed computing frameworks to optimize model training and deployment efficiency.
- Collaborate with ML researchers to translate complex model requirements into robust, scalable infrastructure solutions.
- Ensure system security, compliance, and high availability across all cloud and on-premise environments.
- Drive the adoption of cutting-edge technologies, including quantum-ready architectures and edge computing nodes.
Qualifications
- Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related technical field.
- 10+ years of experience in infrastructure engineering, with at least 5 years specifically focused on Machine Learning Operations (MLOps) and AI infrastructure.
- Deep expertise in Python, PyTorch, TensorFlow, and modern containerization technologies (Docker, Kubernetes).
- Strong proficiency in cloud platforms (AWS, GCP, or Azure) with a focus on high-performance computing (HPC) services.
- Experience with cost optimization strategies for large-scale AI workloads.