Hi, I'm Sanjay Chouhan

|

Lead - AI Systems with 8+ years of experience in building scalable AI solutions. I specialize in GenAI, LLMs, MLOps, and distributed systems, while sharing my knowledge through in-depth AI/ML tutorials on YouTube.

Latest Tutorials

Join me on my YouTube channel AI & ML with Sanjay Chouhan for engaging tutorials and in-depth walkthroughs. Explore everything from cutting-edge NLP and LLMs to foundational machine learning principles.

Visit Channel β†—

About Me

I'm a Lead - AI Systems with 8+ years of experience designing and shipping production ML solutions. My expertise spans GenAI, LLMs, NLP, distributed training, and automated MLOps. As a published NLP researcher at ICPR with a Masters from IIIT Guwahati, I transform complex AI challenges into scalable, production-ready systems. Passionate about knowledge sharing, I also create in-depth AI & ML tutorials on YouTube.

🎯

Production ML Expert

8+ years shipping scalable AI solutions to production

πŸš€

Published Researcher

NLP research published at ICPR conference

πŸ’‘

AI & MLOps Specialist

Expert in generative AI, agentic systems, and automation

Technical Expertise

πŸ€–

Generative AI & Agents

LLM Fine-tuning, Pretraining, RAG Systems, Prompt Engineering, Model Optimization, Agentic Systems, AI Agents

Llama Mistral Mixtral AI Agents Huggingface
πŸ’¬

Natural Language Processing

BERT, Transformers, Text Classification, Sentiment Analysis, Language Understanding

Langchain BERT Transformers NLP
🧠

Machine Learning & Deep Learning

Neural Networks, Distributed Training, Model Development, Feature Engineering

PyTorch TensorFlow Scikit-learn
βš™οΈ

MLOps & Automation

CI/CD Pipelines, Model Monitoring, Automated Retraining, Workflow Orchestration

Jenkins Flyte MLFlow DVC
☁️

Cloud & Infrastructure

Distributed Systems, Container Orchestration, Scalable Deployments, Cloud Services

AWS Docker EKS Ray
πŸ“Š

Data Engineering

Big Data Processing, Data Pipelines, Analytics, Database Management

Spark Redshift MongoDB SQL

Work Experience

June 2022 - Present

Lead - AI Systems

ADF Data Science

Leading development of LLM/NLP solutions and distributed ML architectures. Designed scalable AI systems leveraging Jenkins, Ray, Spark, AWS EKS, and Flyte.

  • Developed LLM-based complaint classifier with automated retraining and dockerized deployment
  • Built Enterprise RAG system for internal use exploring multiple AWS approaches
  • Researched and compared Llama/Mistral/Mixtral; established fine-tuning best practices
  • Created low-latency, fault-tolerant model deployment server on AWS (ECS, ALB, ECR)
  • Reduced training time using Spark for data processing and Ray for distributed training
LLMs RAG Jenkins Ray AWS
June 2017 - May 2022

Senior Software Engineer (Machine Learning)

Webkul Software Pvt Ltd

Created ML-powered e-commerce solutions and led development team. Built personalized recommendation systems and NLP-based product features.

  • Created personalized product recommendation system
  • Built BERT-based classifier to flag inappropriate reviews
  • Developed smart Related Products section with NLP
  • Trained and managed team of developers
  • Collaborated on highly successful e-commerce sites (BuyAussieNow, Mealtemple)
BERT NLP Recommendations E-commerce

Education

πŸŽ“

M.Tech in Computer Science & Engineering

Indian Institute of Information Technology Guwahati (IIITG)

2016 - 2018

Specialized in machine learning and artificial intelligence. Published research on HindiLLM at ICPR conference.

Research: Large Language Model for Hindi (ICPR)
πŸ“œ

Diploma in Artificial Intelligence & Machine Learning

University of Hyderabad (UoH)

Completed

Advanced diploma focusing on AI and ML fundamentals, deep learning, and practical applications.

πŸŽ“

B.Tech in Computer Science & Engineering

Dr. A.P.J. Abdul Kalam Technical University (AKTU)

2012 - 2016

Foundation in computer science, algorithms, data structures, and software engineering principles.

Featured Projects

Neural Network Visualization
Deep Learning

Advanced Neural Architecture

Developed a custom neural network architecture achieving 95% accuracy on complex image classification tasks. Implemented novel attention mechanisms for improved performance.

PyTorch CUDA Python
NLP Model
NLP

Intelligent Text Analysis System

Built a state-of-the-art NLP system for multi-language sentiment analysis and entity extraction, processing millions of documents with high accuracy.

Transformers BERT FastAPI
Data Visualization
Data Science

Predictive Analytics Platform

Designed and deployed a comprehensive analytics platform using advanced ML algorithms to forecast trends and provide actionable business insights.

Scikit-learn Pandas AWS

Latest Articles

View All Articles

Let's Connect

I'm always interested in hearing about new opportunities, collaborations, or just chatting about AI and technology.