Skip to content
View sanjaychelliah's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report sanjaychelliah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sanjaychelliah/README.md

Senior ML Engineer Β· LLM Infrastructure Β· AI Platform

LinkedIn GitHub Email Chennai


🧠 About Me

Building the infrastructure that makes AI run fast, cheap, and reliably at scale.

I'm a Senior MLOps / AI Platform Engineer with 5+ years shipping production systems across two deep specializations:

  • πŸš€ LLM Inference Infrastructure β€” vLLM, SGLang, MCP-based agents, RAG architectures.
  • πŸ‘οΈ Computer Vision Pipelines β€” Real-time object detection, multi-object tracking, segmentation at millions of frames per week.

I've led teams of 4+ engineers, contributed to open-source SDKs increasing downloads by 10x, and pushed models to top throughput performance.


⚑ Career Highlights

Achievement Detail
πŸ† LLM Inference Optimization Low Latency, High Throughput Models
πŸ“¦ 10x SDK Growth Contributed to Clarifai Python SDK driving 10x download increase
🎯 0.97 mAP Car Dent Detection & Segmentation for insurance client
🧠 3x Latency Reduction Multi-modal RAG platform over 1M+ document knowledge base
πŸ… NVIDIA Hackathon Smart City Hackathon (Asia-Pacific) β€” Pothole Detection with RT-DETR
βš™οΈ 80% GPU Memory Savings LoRA/PEFT adapters enabling cost-efficient production fine-tuning

πŸ› οΈ Tech Stack

LLM & GenAI

vLLM SGLang LangChain RAG MCP LoRA TensorRT-LLM HuggingFace

Models I've Worked With

GPT Claude Llama Qwen DeepSeek Minimax Gemma Whisper

Computer Vision

OpenCV YOLO SAM PyTorch TensorFlow

MLOps & Infrastructure

Docker Kubernetes GitHub Actions ONNX TensorRT AWS Azure GCP

Data & Storage

PostgreSQL Qdrant Python


πŸ’Ό Experience

πŸ”· Senior MLOps Engineer β€” Clarifai (Mar 2024 – Present)

Built and optimized Clarifai's LLM inference engine to world-class benchmark performance.

  • Experience in vLLM, SGLang, MCP-based agents, RAG architectures
  • Architected MCP-based agent serving with LLM-as-a-judge evaluation and HITL feedback loops
  • Built sports analytics pipelines (Object Detection, MOT) achieving 90% MOTA at millions of frames/week
  • Implemented LoRA/PEFT reducing GPU memory by up to 80% vs full-parameter fine-tuning
  • Drove 10x SDK download growth via Clarifai Python SDK & CLI improvements

πŸ”· MLOps Engineer β€” Clarifai (Apr 2023 – Feb 2024)

  • Designed end-to-end multi-modal RAG platform (LLMs + VLMs) for document Q&A β€” 3x latency reduction over 1M+ document KB
  • Built scalable vector search pipelines with Qdrant for semantic retrieval at production scale

πŸ”· AI Engineer β€” Pavo & Tusker Innovations (Jun 2021 – Mar 2023)

Built the Kandula.ai cognitive computer vision SaaS platform from the ground up.

Project Result
πŸš— Car Dent Detection & Segmentation 0.97 mAP using YOLO-based models for insurance client
πŸ”₯ Fire Detection Droid Real-time YOLO_v3 deployment
πŸ•³οΈ Pothole Detection 95% accuracy on 100k-image dataset with RT-DETR; NVIDIA Hackathon finalist
πŸ‘οΈ Gaze Tracking R&D of SOTA algorithms, heatmap data products for retail analytics
πŸ₯ Crowd Detection Hospital infection rate reduction β€” threshold alerting system
βš™οΈ Model Deployment Exported 50+ models in ONNX/TensorRT with INT8/FP16 quantization

πŸŽ“ Education

Degree Institution Year Grade
M.Sc. Data Science Loyola College, Chennai 2019–2021 8.3 CGPA β€” First Class with Distinction
B.Sc. Mathematics Vivekananda College, Chennai 2016–2019 6.1 CGPA

πŸ“œ Certifications

  • πŸŽ“ Machine Learning β€” Coursera
  • 🐍 Python Data Structures β€” Coursera
  • πŸŽ₯ Building Real-Time Video AI Applications β€” NVIDIA
  • πŸ€– Mastering LLMs β€” Analytics Vidhya

πŸ“Š GitHub Stats

Sanjay's GitHub Stats

Top Languages


πŸš€ Featured Projects

Project Description Stack
Docwhisper Doc Question answering using RAG Python, FastAPI, Ollama
PyTorch Object Detect & Track Real-time multi-object detection and tracking in video Python, PyTorch, YOLO

πŸ”’ Most production work lives in private/org repos at Clarifai β€” benchmarks and results linked in experience above.


πŸ’¬ Let's Build Something

Open to collaborations on LLM infrastructure, computer vision systems, and AI platform engineering.

LinkedIn Email


Pinned Loading

  1. Clarifai/clarifai-python Clarifai/clarifai-python Public

    Experience the power of Clarifai’s AI platform with the python SDK. 🌟 Star to support our work!

    Python 40 8

  2. Clarifai/clarifai-python-datautils Clarifai/clarifai-python-datautils Public

    Extract Transform and Load unstructured data into the Clarifai's AI platform

    Python 7