Sanjay Chouhan

EXPERIENCE

Machine Learning Engineer (Generative AI) @ ADF Data Science

Jun 2022 – Present

— Exploring various llm models like llama, mistral, mixtral, dolly, etc for different use cases. Also exploring best way to do fine-tuning, inference.
— Created Knowledge management system by training and fine-tuning llama2 model.
— Automating model development, validation and monitoring document generation with LLM.
— Classifying communication between agents and clients into potential and non potential complaints using LLM.
— Building tools and infrastructure to support scalable, time and cost efficient distributed modeling architecture with tools like Jenkins, Ray, AWS, EKS, Flyte, Spark.
— Worked on affiliate models to attract the right customers. Trained multiple models in a distributed environment, logged experiments through mlflow and scheduled periodic retraining pipeline through jenkins.
— Worked on Net-response model to target the right customers. Reduce training time by using spark to collect and process data from redshift and by using data parallelism based model training with ray.
— Created a low latency, scalable, dockerized and fault tolerant model deployment server on aws with ecs, alb, ecr, golang, fast api, redis, cloudwatch, etc.
— Working on internal ML library to handle our custom needs.
— Guiding a team of ML engineers and helping data scientists.
— Automating various tasks like retraining, prediction, data updation with data pipeline, model pipeline.

Senior Software Engineer (Machine Learning) @ Webkul

Dec 2020 – Apr 2022

— Build fast Image based product search mechanism with convolutional neural network.
— Created personalized product recommendation system.
— Build BERT based classifier to flag inappropriate reviews.
— Build smart Related Products section with NLP.
— Trained and Handle a small team of developers.

Software Engineer @ Webkul

Jun 2017 – Dec 2020

— Created A/B testing mechanism to test effectiveness UI updates.
— Written Magento Development Guide used for training new employees.
— Collaborated and developed highly successful e-commerce site BuyAussieNow.
— Created hyperlocal food delivery platform Mealtemple.
— Helped 100+ ecommerces to give smooth experience to their customer.
— Contributed to the Magento 2 codebase.

EDUCATION

M.Tech : Computer Science & Engineering (CSE)

Indian Institute of Information Technology Guwahati (IIITG)

2022 - 2024

Diploma in AI and ML at PG Level

University of Hyderabad (UoH)

2021 - 2022

B. Tech (Computer Science & Engineering)

Dr. A.P.J. Abdul Kalam Technical University (AKTU)

2013 - 2017

10+2 and Intermediate

Jawahar Navodaya Vidyalaya (JNV), Morigaon

2009 - 2013

SKILLS