Skip to content
View questinrest's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report questinrest

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
questinrest/README.md
Aman Mishra — AI/ML Engineer

LinkedIn   Email   Portfolio   Newsletter

Profile Views

About

AI/ML Engineer specializing in Agentic Architectures, Production RAG systems, and LLM Observability. Proven track record of building autonomous multi-agent pipelines, real-time feedback loops, and highly scalable backend infrastructure. Strong foundation in machine learning, grounded in first-principles execution—from orchestrating distributed, stateful LLM deployments to building zero-shot machine unlearning frameworks. Passionate about engineering reliable, deterministic, and tightly monitored GenAI solutions that solve complex enterprise challenges.

Master's in Data Science — Defence Institute of Advanced Technology (DIAT), Pune

Bachelor's in Mathematics — Indira Gandhi National Open University (IGNOU), New Delhi


Experience

Aug 2025
Jan 2026
AI Engineer — Evntro · Observo Tech Studio
• Contributed to a production-grade agentic B2B outreach system integrating email sending, receiving, replying, and campaign-level tracking across multiple mailboxes
• Designed and implemented the LLM tool layer (send/track/reply) with persistent conversation state in PostgreSQL, enabling autonomous, multi-threaded communication workflows
• Engineered a real-time feedback pipeline capturing 16+ delivery and engagement signals, enabling adaptive, data-driven campaign optimization
• Developed a context-aware RAG pipeline leveraging prospect data for personalized email generation, improving relevance over template-based outreach
• Refactored the FastAPI backend to resolve critical issues and deployed services on Azure Function Apps, enabling a scalable, event-driven system
Apr 2025
Jul 2025
GenAI Engineer (R&D) · Confedo AI
• Built multi-stage RAG pipelines and agentic workflows to simulate real-world LLM usage and enable systematic evaluation of faithfulness, relevance, and failure modes in complex pipelines
• Designed advanced retrieval pipelines (query routing, HyDE, reranking) to generate challenging, multistep queries, improving robustness and realism of evaluation benchmarks
• Reverse engineered 3 LLM observability frameworks to understand tracing architectures and extract design insights for building in-house evaluation and observability tooling
• Built a PoC for an in-house LLM tracing system to validate design decisions and inform the product team in developing a production-grade tracing SDK
Jan 2025
Jun 2025
Research Intern · Usable Security Group (USG) Lab @ IIIT Delhi
• Designed and validated a data-free, zero-shot machine unlearning framework to enable class-level data removal without requiring access to original training data, supporting privacy compliance (e.g., GDPR)
• Leveraged contrastive model inversion to generate 5K synthetic samples, enabling effective targeted unlearning while preserving model performance on unaffected classes
• Achieved 90%+ unlearning efficacy across 8 classes on the SVHN dataset, while maintaining 95%+ accuracy on retained classes, minimizing unintended knowledge loss
• Improved unlearning efficiency with a 77% reduction in runtime compared to the GKT baseline, enhancing feasibility for real-world deployment
• Conducted comparative evaluation against 4 baseline methods (EMMN, GKT, zMuGAN, retraining) and explored subset/coreset strategies to optimize scalability and effectiveness of unlearning

Tech Stack

Languages Python  SQL
ML / DL PyTorch  Hugging Face  scikit-learn  YOLO  Weights & Biases
LLM / RAG LangChain  LangGraph  LangSmith  Pinecone  FAISS  Groq  Ollama
Backend FastAPI  Docker  Celery  Redis  REST APIs
Cloud Azure  Supabase  Linux  Git
Data PostgreSQL  MongoDB  Pandas

Featured Projects

Enterprise-grade Multi-Tiered Retrieval-Augmented Generation System. Integrates dynamic fallback strategies and diverse retrieval architectures to ensure high-fidelity context extraction across complex datasets.

Python  LangChain  Vector DB

Custom-built, local-only LLM tracing and observability tooling. Engineered for async agentic workflows using contextvars to provide granular telemetry, span tracking, and metric evaluation with minimal overhead.

Python  LLMOps  Observability

Multi-path autonomous RAG pipeline featuring intelligent query routing, self-corrective retrieval loops, and iterative web search fallbacks to actively detect and eliminate hallucinations in real time.

Python  LangGraph  Groq

On-device RAG engineered around LEANN's graph-based search. Indexes millions of documents locally in gigabytes without significant accuracy loss, achieving zero cloud dependency and strict privacy.

Python  LEANN  Graph Search

Structure-aware documentation RAG pipeline built from scratch. Crawls 104 FastAPI pages, preserves section hierarchy, generates 971 metadata-rich chunks, and delivers citation-grounded answers via Pinecone + Groq.

Python  LangChain  Pinecone  FastAPI

RAG system with SHA-256 document deduplication, namespace isolation, semantic reranking, and inline-cited answers. Upload a 200-page manual, ask a question, get a precise cited response.

Python  Pinecone  FastAPI  PyMuPDF


Currently Working On

> Engineering multi-tiered, self-correcting Agentic RAG architectures
> Developing lightweight, local-only LLM observability and tracing SDKs
> Exploring hardware-efficient, vectorless RAG approaches (PageIndex & LEANN)
> Scaling distributed, stateful LLM tool layers for autonomous workflows
Footer

Pinned Loading

  1. TierRAG TierRAG Public

    Multi-Tiered Retrieval-Augmented Generation System

    Python 1

  2. precision-rag-with-deduplication precision-rag-with-deduplication Public

    Jupyter Notebook

  3. doc-struct-rag doc-struct-rag Public

    Structure-aware documentation RAG pipeline with custom ingestion, hierarchical chunking, and citation-grounded answers.

    Jupyter Notebook

  4. offline-rag-bot offline-rag-bot Public

    Jupyter Notebook

  5. transformer_from_scratch transformer_from_scratch Public

    Reimplementation of decoder based transformer in pure PyTorch

    Jupyter Notebook