Hello, I'm

Saurabh Chauhan

I completed my Master of Science in Computer Science at the University of Illinois Springfield (May 2026, GPA 3.8), building on my Bachelor of Engineering in Computer Engineering from Pune University. Now based in Denver, CO, I am an AI Engineer at MedLaunch Concepts building LLM-powered healthcare AI products. With 4+ years of hands-on experience in AI/ML engineering, I have built and deployed production-grade systems from RAG pipelines and multi-agent LLM workflows to computer vision serving thousands of users with measurable business impact.

Work Authorization: Currently on OPT (started June 2026). Eligible for STEM OPT extension with 3 years total US work authorization.

Get In Touch Download Resume

About Me

AI/ML Engineer

I'm currently pursuing my M.S. in Computer Science at the University of Illinois, building on my B.E. in Computer Engineering from Pune University. With 3+ years of specialized experience in artificial intelligence and machine learning, I've successfully developed and deployed production-grade AI systems that have driven measurable business impact.

At MedLaunch Concepts in Denver, I'm building LLM-powered clinical decision support tools using LangChain, LangGraph, and FastAPI. I design RAG pipelines over proprietary medical knowledge bases (FAISS + OpenAI API) and implement responsible AI guardrails to ensure clinical accuracy and hallucination mitigation by applying production AI skills directly to healthcare device commercialization workflows.

My expertise spans the entire machine learning lifecycle from data preprocessing and feature engineering to model development, optimization, and production deployment. At Product Dossier Solutions (Kytes), I architected a production RAG system using LangChain, Mistral-7B, and FAISS that serves 10,000+ users across six enterprise clients with 95% query accuracy and sub-50ms retrieval latency. I have built AI inference APIs using gRPC integrated with Spring Boot microservices, handling 110,000+ daily requests with 99.5% uptime. I specialize in TensorFlow, PyTorch, LangChain, and deploying ML models on cloud platforms (AWS, GCP) to serve thousands of concurrent users.

Years Experience

10+

Projects

10+

Technologies

Technical Skills

🤖 AI/ML Frameworks

TensorFlow 2.16 PyTorch 2.6 LangChain 2.0 LangGraph Claude SDK Scikit-learn OpenAI API RAG Hugging Face spaCy NLTK

⚙️ Backend Architecture

Microservices System Design gRPC RESTful APIs GraphQL Distributed Systems WebSockets Load Balancing

💻 Programming Languages

Python 3.12+ (Advanced) R 3.6 SQL Java 11 C++ TypeScript

🌐 Web & Frameworks

FastAPI Flask Streamlit Django React.js Next.js

☁️ Cloud & Infrastructure

AWS (ECS, EKS, EC2, S3, Lambda, SageMaker, ) GCP Docker Kubernetes CI/CD (Jenkins, GitHub Actions)

🗄️ Database & Caching

PostgreSQL SQLAlchemy SQL Server 2019 Oracle Redis MongoDB Vector Databases (FAISS, ChromaDB)

🛠️ DevOps & Testing

Git/GitHub Jenkins GitLab CI Robot Framework Playwright Linux/Unix

📊 Data Science & MLOps

Pandas NumPy Apache Airflow PySpark MLflow

Work Experience

June 2026 – Present

AI Engineer

MedLaunch Concepts – Denver, CO

Healthcare LLM Products: Designing and deploying LLM-powered clinical decision support tools using LangChain, LangGraph, and FastAPI; implementing responsible AI guardrails for clinical accuracy and hallucination mitigation in healthcare device commercialization workflows.
RAG Pipeline Engineering: Building RAG pipelines over proprietary medical knowledge bases using FAISS and OpenAI API to surface evidence-based answers with high precision for domain-specific healthcare queries.

2023 – 2024

Software Engineer (AI-ML)

Product Dossier Solutions Pvt. Ltd.

Production RAG System: Architected enterprise RAG system using LangChain, Mistral-7B, and FAISS, integrated with RASA chatbot to serve 10,000+ users across six enterprise clients with 95% query accuracy and sub-50ms retrieval latency.
Distributed ML Pipeline: Migrated legacy single-threaded Apache Airflow DAGs to distributed PySpark architecture, enabling parallel execution that reduced data processing time by 65% for enterprise project management platform.
ML Microservices: Built AI inference APIs using gRPC integrated with Spring Boot microservices, handling 110,000+ daily requests with intelligent caching and load balancing, achieving 99.5% uptime.
Vector Database Implementation: Implemented FAISS vector database for semantic document search, processing 200+ PDFs per client with optimized embedding generation and sub-50ms retrieval latency.

2021 – 2023

Software Engineer

Dasha Kirt Technologies Pvt. Ltd. (D10X)

SaaS Architecture: Programmed a Django multi-tenant workflow application from scratch, contributing 70% to the core codebase and leading the system design.
Data Automation: Created automated data pipelines using Python and SQLAlchemy to ingest daily NIFTY 50 market data, providing clients with immediate access to historical datasets for strategy backtesting and paper trading.
Test Automation Framework: Developed comprehensive test automation framework with 100+ test cases using Robot Framework and Playwright, reducing QA cycle time by 60% through data-driven testing methodology.

Featured Projects

🤟

ASL to Text Recognition

Problem: Real-time American Sign Language translation using computer vision and deep learning.

Solution: Fine-tuned VideoMAE transformer on 239-class ASL dataset, improving accuracy from 62% to 82% (+32% improvement) through novel Universal Temporal Sub-sampling technique optimized for GPU-constrained training. Implemented AdamW optimizer and cosine learning rate decay for optimal convergence. Supports multi-sign prediction from long-form videos with temporal context awareness.

Python PyTorch VideoMAE Hugging Face Computer Vision

View on GitHub →

🏛️

Montgomery AI Navigator Hackathon

Problem: Montgomery County residents struggle to navigate complex government services and information.

Solution: Built a Retrieve-Reason-Validate multi-agent pipeline using LangGraph and Google Gemini 2.5 Flash that decomposes queries, retrieves county-specific information, reasons over it, and validates answers before responding. React 18 + TypeScript frontend with FastAPI backend — designed and shipped end-to-end in a 24-hour hackathon.

Python LangGraph Gemini 2.5 Flash FastAPI React 18 TypeScript

View on GitHub →

🔬

Research Assistant (Multi-Agent AI System)

Problem: Standard RAG systems perform single-chain retrieval with no source credibility evaluation or confidence scoring, producing unreliable research outputs.

Solution: Built a production-grade multi-agent research system using AutoGen where specialized agents divide the pipeline - one retrieves and searches sources, a second evaluates credibility and assigns confidence scores, and a third synthesizes structured outputs with executive summaries, consensus/disagreement analysis, and numbered citations. Deployed on Amazon EKS with two replicas per service, ALB ingress, and zero-downtime rolling deployments representing a complete MLOps workflow.

Python AutoGen LangChain Mistral-7B FastAPI React Docker AWS EKS

View on GitHub →

🍽️

Restaurant POS System

Problem: Restaurants need real-time order management across multiple terminals. Solution: Developed high-concurrency Point of Sale system using Django/PostgreSQL backend with JavaScript frontend, implementing real-time WebSocket synchronization to handle 50+ concurrent orders. Built secure role-based access control (RBAC) with 6 distinct user roles (admin, manager, waiter, kitchen, cashier, inventory). System processes 500+ daily transactions with 99.9% uptime and sub-200ms response times.

Django PostgreSQL WebSocket JavaScript RBAC

View on GitHub →

📄

Invoice Processing Scanner

Problem: Manual invoice processing bottlenecks accounting workflows. Solution: Built end-to-end invoice extraction pipeline with Flask REST API, integrating GPU-accelerated Tesseract OCR and custom SpaCy NER model. Designed RESTful endpoints with proper error handling, request validation, and rate limiting. Achieved 95% accuracy extracting 8 key financial entities from 200+ documents with sub-2-second processing time per invoice.

Flask REST API PostgreSQL Docker OCR

View on GitHub →

Saurabh Chauhan

About Me

AI/ML Engineer

Backend Engineer

Technical Skills

🤖 AI/ML Frameworks

⚙️ Backend Architecture

💻 Programming Languages

🌐 Web & Frameworks

☁️ Cloud & Infrastructure

🗄️ Database & Caching

🛠️ DevOps & Testing

📊 Data Science & MLOps

Work Experience

AI Engineer

Software Engineer (AI-ML)

Software Engineer

Featured Projects

ASL to Text Recognition

Montgomery AI Navigator Hackathon

Research Assistant (Multi-Agent AI System)

Restaurant POS System

Invoice Processing Scanner

Get In Touch

Location

Email

Phone