AI and LLM Deployment Engineer, RAG Chatbots, FastAPI Backends
Indien
Englisch
Einige Informationen werden in englischer Sprache angezeigt.
Über mich
I deploy open-source LLMs to production — quantized models on GPU infra (RunPod, AWS), streaming FastAPI endpoints, and RAG chatbots grounded in your documents.
What I deliver:
- RAG chatbots that answer from YOUR docs — not hallucinations
- LLM deployment & quantization (Llama, Qwen, Mistral)
- FastAPI backends, automation, document data extraction
- WhatsApp & chat integrations
Every delivery includes a README and reproducible setup — no lock-in.
8+ yrs in software & data engineering. Python, FastAPI, LangChain, PostgreSQL, Docker, AWS.... Mehr lesen