AI Engineer
We are seeking an AI Engineer to design, fine-tune, deploy, and scale LLM-based systems. You’ll optimize model inference performance and cost, implement parameter-efficient fine-tuning (LoRA/QLoRA/adapters), develop diverse prompt engineering strategies, and build RAG pipelines. You will create robust backend infrastructure for AI-powered applications and implement end-to-end MLOps workflows, automated evaluation, and monitoring. You’ll translate experimentation into production-grade AI solutions with scalable, low-latency serving across cloud environments. Proficiency in Python, FastAPI, and cloud ML services is required, along with hands-on experience in model serving frameworks (vLLM, TensorRT, SGlang), containerization (Docker/Kubernetes), and distributed training/inference. Strong problem-solving, communication, and ownership mindset are essential.
Similar offers · 5
Save your favorite offers
Sign in to add this offer to your favorites.
