Job Description
We are at the forefront of the AI revolution, building next-generation generative models that redefine human-machine interaction. We are looking for a visionary Senior AI Engineer to join our elite engineering team in San Francisco. You will be responsible for architecting, training, and deploying state-of-the-art Large Language Models (LLMs) that power our flagship products.
In this role, you will bridge the gap between cutting-edge research and production-scale deployment. If you are passionate about LLMs, fine-tuning, and building ethical AI systems, we want to hear from you.
Responsibilities
- Model Development: Design and implement advanced NLP models, specifically focusing on LLMs, RAG pipelines, and fine-tuning strategies using Transformers.
- Deployment & Scaling: Deploy models to production environments using Kubernetes and MLOps best practices to ensure high availability and low latency.
- Optimization: Optimize model inference speed and accuracy using quantization, pruning, and efficient attention mechanisms.
- Research Integration: Stay abreast of the latest research in arXiv and implement novel techniques to improve our model performance.
- Collaboration: Work closely with data scientists and product managers to translate business requirements into technical AI solutions.
Qualifications
- Education: MS or PhD in Computer Science, Machine Learning, or a related quantitative field.
- Experience: 5+ years of professional experience in AI/ML engineering, with at least 2 years specifically in LLM or NLP.
- Technical Skills: Proficiency in Python, PyTorch, or TensorFlow. Deep understanding of Hugging Face Transformers, LangChain, and vector databases (Pinecone, Milvus).
- Infrastructure: Experience with cloud platforms (AWS/GCP/Azure) and containerization (Docker, Kubernetes).
- Communication: Excellent written and verbal communication skills with the ability to explain complex technical concepts to non-technical stakeholders.