Job Description
Are you ready to architect the next generation of intelligent systems? Aether Dynamics is on the hunt for a visionary Senior Generative AI Engineer to join our elite AI Research division. As we accelerate towards the 2026 AI revolution, we are building the infrastructure that will define the future of human-machine interaction. If you thrive on pushing the boundaries of Large Language Models (LLMs), fine-tuning proprietary architectures, and deploying scalable AI solutions, this is your opportunity to lead.
In this role, you won't just be writing code; you will be shaping the cognitive architecture of our products. You will work directly with our Lead Scientists to refine our core models, optimize inference pipelines, and integrate cutting-edge Generative AI into our enterprise ecosystem. Join us in building the intelligent future.
Responsibilities
- Design, develop, and deploy state-of-the-art Generative AI models, focusing on LLMs and diffusion models.
- Optimize model inference latency and cost through advanced quantization, pruning, and distillation techniques.
- Build and maintain robust Retrieval-Augmented Generation (RAG) pipelines to enhance model accuracy and reduce hallucinations.
- Collaborate with cross-functional teams of data scientists, ML engineers, and product managers to translate research into production-ready features.
- Experiment with novel training methodologies, including Reinforcement Learning from Human Feedback (RLHF) and Constitutional AI.
- Ensure the security, ethics, and scalability of all AI deployments within the cloud infrastructure.
Qualifications
- Masterβs degree or PhD in Computer Science, Mathematics, or a related field, or equivalent practical experience.
- 5+ years of professional experience in Machine Learning, Deep Learning, or Natural Language Processing.
- Proficiency in Python, PyTorch, TensorFlow, or JAX with a deep understanding of computational graphs and automatic differentiation.
- Extensive experience fine-tuning pre-trained models (e.g., BERT, GPT, Llama) for specific domain applications.
- Strong understanding of distributed computing, GPU acceleration, and cloud infrastructure (AWS, GCP, or Azure).
- Demonstrated ability to publish in top-tier conferences (NeurIPS, ICML, ACL) or open-source communities.