Job Description
We are on a mission to define the future of intelligent systems. As a Senior Generative AI Architect, you will be at the forefront of developing the next generation of Large Language Models (LLMs) and autonomous agents. Join a team of world-class engineers and researchers dedicated to solving the most complex challenges in artificial intelligence for 2026 and beyond.
Why Join Us?
- Work on cutting-edge Generative AI projects that redefine human-computer interaction.
- Competitive compensation package with equity options.
- Flexible remote-first policy with a vibrant office in San Francisco.
Responsibilities
- Architect and deploy scalable, high-performance Generative AI models tailored for enterprise applications.
- Lead the research and implementation of Retrieval-Augmented Generation (RAG) pipelines and fine-tuning strategies.
- Optimize model inference latency and reduce token costs using advanced quantization and distillation techniques.
- Mentor junior engineers and data scientists, fostering a culture of innovation and technical excellence.
- Collaborate closely with product teams to translate technical requirements into production-ready AI solutions.
Qualifications
- PhD or Masterβs degree in Computer Science, Machine Learning, or a related quantitative field.
- 5+ years of professional experience in building, deploying, and optimizing Large Language Models.
- Deep expertise in Python, PyTorch, TensorFlow, and modern MLOps tools (MLflow, Kubeflow, Weights & Biases).
- Proven track record of working with Transformer architectures and state-of-the-art open-source models.
- Strong understanding of data privacy, security, and ethical AI principles.