We are looking for a highly experienced and hands-on AI Lead to drive the development of advanced AI solutions, with a core focus on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and agentic AI systems. The ideal candidate will lead a small team, architect scalable systems, and collaborate across departments to build production-grade AI applications.
Lead and architect LLM projects including fine-tuning, distillation, and deployment.
Design and prototype RAG and GraphRAG pipelines with vector and graph databases.
Build agentic AI frameworks using LangGraph, CrewAI, AutoGen, and Pydantic agents.
Mentor junior engineers and guide technical research and implementation strategies.
Optimize inference pipelines via quantization, pruning, and caching.
Evaluate models such as GPT, Claude, Mistral, LLaMA for custom use cases.
Experiment with multimodal inputs, document processing, and prompt engineering.
Deep understanding of LLM internals, transformer architecture, Hugging Face ecosystem.
Proficient in Python, Pydantic, FastAPI (or equivalent frameworks).
Experience with LangChain, LangGraph, CrewAI, AutoGen, or similar.
Familiar with RAG systems, vector databases (FAISS, Pinecone, Weaviate), and graph DBs.
Strong hands-on skills with LoRA, QLoRA, PEFT, and model evaluation.
Knowledge of Docker, cloud platforms (GCP, AWS, Azure), and GPU orchestration.
Minimum 3 years of total experience, with 2+ years in LLM-focused R&D.
Bachelor’s or Master’s degree in Computer Science, AI, Data Science, or relevant field.