Satvik Naren

Data Scientist

A strategic, goal-oriented, and self-motivated Azure-certified Data Scientist with 3.5 years of experience leading the full spectrum of Machine Learning projects. Proficient in deep learning and experienced in managing complex data and feature development within highly agile environments.

Demonstrated success in driving improvements from concept to production while maintaining high standards in code quality, security, and performance. Proven ability to own a project from inception to production, including proposal, discussion, and execution

Tools

Machine Learning Design

Deep Learning

SQL

Machine Learning Design

Deep Learning

SQL

Machine Learning Design

Deep Learning

SQL

Skills

Pytorch and tensorflow

Scikit-learn

Apache PySpark

MLflow and Kubeflow

Git,Docker,Kubernetes

Flask , ChromaDB

Experience

Senior Data Scientist

LAM Research

April 2024-Present

• Developed and fine-tuned QLoRA models to enhance the efficiency and accuracy of Retrieval Augmented Generation (RAG) systems • Led the setup and management of Azure Labelling projects, overseeing the entire lifecycle from initial data labeling to model deployment and operationalization • Spearheaded the creation and validation of machine learning models for Inclusion and Stain classification, ensuring robust performance through thorough model validation and User Acceptance Testing (UAT). • Conducted code reviews to ensure high standards in coding practices, maintainability, and adherence to security operations guidelines, Solved complex technical problems related to model deployment , scaling, and optimization.

Senior Data Scientist

LAM Research

April 2024-Present

• Developed and fine-tuned QLoRA models to enhance the efficiency and accuracy of Retrieval Augmented Generation (RAG) systems • Led the setup and management of Azure Labelling projects, overseeing the entire lifecycle from initial data labeling to model deployment and operationalization • Spearheaded the creation and validation of machine learning models for Inclusion and Stain classification, ensuring robust performance through thorough model validation and User Acceptance Testing (UAT). • Conducted code reviews to ensure high standards in coding practices, maintainability, and adherence to security operations guidelines, Solved complex technical problems related to model deployment , scaling, and optimization.

Senior Data Scientist

LAM Research

April 2024-Present

• Developed and fine-tuned QLoRA models to enhance the efficiency and accuracy of Retrieval Augmented Generation (RAG) systems • Led the setup and management of Azure Labelling projects, overseeing the entire lifecycle from initial data labeling to model deployment and operationalization • Spearheaded the creation and validation of machine learning models for Inclusion and Stain classification, ensuring robust performance through thorough model validation and User Acceptance Testing (UAT). • Conducted code reviews to ensure high standards in coding practices, maintainability, and adherence to security operations guidelines, Solved complex technical problems related to model deployment , scaling, and optimization.

Data Scientist II

Bombay Play

Jan 2023-Mar 2024

• Streamlined Data Processing with GCP: Led architecture and implementation of end-to-end data processing pipelines using Google Cloud Dataflow. Parallelized data ingestion, transformation, and storage with Apache Beam, achieving a remarkable 40% reduction in processing time. • Scalable Data Lake on GCP: Conceptualized and deployed a scalable data lake on Google Cloud Storage. Utilized the Parquet file format to streamline data storage and retrieval, resulting in reduced data footprint and expedited query processing. • Seamless Machine Learning Integration: Collaborated closely with Data Science teams, seamlessly integrating machine learning models into data pipelines. Leveraged Google Cloud ML Engine for model training, leading to a remarkable 25% boost in predictive accuracy. • Real-time Insights with Streaming: Leveraged Apache Kafka and Google Pub/Sub for real-time data streaming, providing stakeholders with instantaneous insights for agile decision-making. • Empowering GCP Mastery Across Teams: Facilitated knowledge transfer sessions, empowering cross-functional teams with GCP tools. Organized workshops on Dataflow, BigQuery, and ML Engine to democratize data insights and enhance agility. • Automated Deployment Excellence: Established CI/CD practices using Google Cloud Build and Cloud Functions, streamlining deployment workflows and ensuring consistent code quality, accelerating development cycles. • Optimized Data Processing with Dataproc: Orchestrated Apache Spark workflows on Google Cloud Dataproc, leveraging parallelized processing and scalability. Achieved a 20% reduction in data processing costs through dynamic cluster scaling. • Visualizing Insights with Looker: Collaborated with stakeholders to design interactive Looker dashboards, empowering informed decision-making with comprehensive visualizations.

Data Scientist II

Bombay Play

Jan 2023-Mar 2024

• Streamlined Data Processing with GCP: Led architecture and implementation of end-to-end data processing pipelines using Google Cloud Dataflow. Parallelized data ingestion, transformation, and storage with Apache Beam, achieving a remarkable 40% reduction in processing time. • Scalable Data Lake on GCP: Conceptualized and deployed a scalable data lake on Google Cloud Storage. Utilized the Parquet file format to streamline data storage and retrieval, resulting in reduced data footprint and expedited query processing. • Seamless Machine Learning Integration: Collaborated closely with Data Science teams, seamlessly integrating machine learning models into data pipelines. Leveraged Google Cloud ML Engine for model training, leading to a remarkable 25% boost in predictive accuracy. • Real-time Insights with Streaming: Leveraged Apache Kafka and Google Pub/Sub for real-time data streaming, providing stakeholders with instantaneous insights for agile decision-making. • Empowering GCP Mastery Across Teams: Facilitated knowledge transfer sessions, empowering cross-functional teams with GCP tools. Organized workshops on Dataflow, BigQuery, and ML Engine to democratize data insights and enhance agility. • Automated Deployment Excellence: Established CI/CD practices using Google Cloud Build and Cloud Functions, streamlining deployment workflows and ensuring consistent code quality, accelerating development cycles. • Optimized Data Processing with Dataproc: Orchestrated Apache Spark workflows on Google Cloud Dataproc, leveraging parallelized processing and scalability. Achieved a 20% reduction in data processing costs through dynamic cluster scaling. • Visualizing Insights with Looker: Collaborated with stakeholders to design interactive Looker dashboards, empowering informed decision-making with comprehensive visualizations.

Data Scientist II

Bombay Play

Jan 2023-Mar 2024

• Streamlined Data Processing with GCP: Led architecture and implementation of end-to-end data processing pipelines using Google Cloud Dataflow. Parallelized data ingestion, transformation, and storage with Apache Beam, achieving a remarkable 40% reduction in processing time. • Scalable Data Lake on GCP: Conceptualized and deployed a scalable data lake on Google Cloud Storage. Utilized the Parquet file format to streamline data storage and retrieval, resulting in reduced data footprint and expedited query processing. • Seamless Machine Learning Integration: Collaborated closely with Data Science teams, seamlessly integrating machine learning models into data pipelines. Leveraged Google Cloud ML Engine for model training, leading to a remarkable 25% boost in predictive accuracy. • Real-time Insights with Streaming: Leveraged Apache Kafka and Google Pub/Sub for real-time data streaming, providing stakeholders with instantaneous insights for agile decision-making. • Empowering GCP Mastery Across Teams: Facilitated knowledge transfer sessions, empowering cross-functional teams with GCP tools. Organized workshops on Dataflow, BigQuery, and ML Engine to democratize data insights and enhance agility. • Automated Deployment Excellence: Established CI/CD practices using Google Cloud Build and Cloud Functions, streamlining deployment workflows and ensuring consistent code quality, accelerating development cycles. • Optimized Data Processing with Dataproc: Orchestrated Apache Spark workflows on Google Cloud Dataproc, leveraging parallelized processing and scalability. Achieved a 20% reduction in data processing costs through dynamic cluster scaling. • Visualizing Insights with Looker: Collaborated with stakeholders to design interactive Looker dashboards, empowering informed decision-making with comprehensive visualizations.

Associate (Data science)

ZS Associates

Oct 2021 - Feb 2023

Led a team that achieved 4.8 % revenue growth for the Marketing Analytics team of a US Healthcare client, leveraging AWS Redshift, Spark-powered advance SQL analysis, and ETL pipelines to analyze healthcare data set Enhanced operational efficiency by saving 30 hours/week through Visual Basic Application (VBA)-powered automated reports Developed and optimized SQL queries for patient segmentation, incorporating strategic indexing and query restructuring techniques, resulting in a 20% improvement in query performance Utilized machine learning algorithms for patient segmentation project Collaborated with the development team to reproduce and verify bug fixes, ensuring a 95% accuracy.

Associate (Data science)

ZS Associates

Oct 2021 - Feb 2023

Led a team that achieved 4.8 % revenue growth for the Marketing Analytics team of a US Healthcare client, leveraging AWS Redshift, Spark-powered advance SQL analysis, and ETL pipelines to analyze healthcare data set Enhanced operational efficiency by saving 30 hours/week through Visual Basic Application (VBA)-powered automated reports Developed and optimized SQL queries for patient segmentation, incorporating strategic indexing and query restructuring techniques, resulting in a 20% improvement in query performance Utilized machine learning algorithms for patient segmentation project Collaborated with the development team to reproduce and verify bug fixes, ensuring a 95% accuracy.

Associate (Data science)

ZS Associates

Oct 2021 - Feb 2023

Led a team that achieved 4.8 % revenue growth for the Marketing Analytics team of a US Healthcare client, leveraging AWS Redshift, Spark-powered advance SQL analysis, and ETL pipelines to analyze healthcare data set Enhanced operational efficiency by saving 30 hours/week through Visual Basic Application (VBA)-powered automated reports Developed and optimized SQL queries for patient segmentation, incorporating strategic indexing and query restructuring techniques, resulting in a 20% improvement in query performance Utilized machine learning algorithms for patient segmentation project Collaborated with the development team to reproduce and verify bug fixes, ensuring a 95% accuracy.

Research Intern

IIT Hyderabad

Feb 2018 - Mar 2021

under super vision of Dr Sai Sidhardh and Dr. Konda Reddy Mopuri(mentor) research areas : Deep Learning, Computer Vision and Machine learning and finite element simulations Depts associated : Artificial Intelligence and finite element simulations Laplace and Inverse Laplace transforms and their properties, and working the solutions Numerical Analysis and Computer programming of Euler's equation of motion for inviscid flow; Stream-lines, the path of a particle; Potential flow; , Cauchy's method of characteristics; Linear partial differential equations of the second order with constant coefficients, the canonical form

Research Intern

IIT Hyderabad

Feb 2018 - Mar 2021

under super vision of Dr Sai Sidhardh and Dr. Konda Reddy Mopuri(mentor) research areas : Deep Learning, Computer Vision and Machine learning and finite element simulations Depts associated : Artificial Intelligence and finite element simulations Laplace and Inverse Laplace transforms and their properties, and working the solutions Numerical Analysis and Computer programming of Euler's equation of motion for inviscid flow; Stream-lines, the path of a particle; Potential flow; , Cauchy's method of characteristics; Linear partial differential equations of the second order with constant coefficients, the canonical form

Research Intern

IIT Hyderabad

Feb 2018 - Mar 2021

under super vision of Dr Sai Sidhardh and Dr. Konda Reddy Mopuri(mentor) research areas : Deep Learning, Computer Vision and Machine learning and finite element simulations Depts associated : Artificial Intelligence and finite element simulations Laplace and Inverse Laplace transforms and their properties, and working the solutions Numerical Analysis and Computer programming of Euler's equation of motion for inviscid flow; Stream-lines, the path of a particle; Potential flow; , Cauchy's method of characteristics; Linear partial differential equations of the second order with constant coefficients, the canonical form

Data Science Intern

Capgemini

Feb 2018 - Mar 2021

- writing complex SQL for data analysis and data profiling - Worked very closely with business analysts, development teams and project managers for requirements and business rules - Working in .NET framework and C# technology -Hands on ETL development using any combination of Ab Initio, Ab Initio Express IT and Ab Initio Continues Flows in an agile development environment.

Data Science Intern

Capgemini

Feb 2018 - Mar 2021

- writing complex SQL for data analysis and data profiling - Worked very closely with business analysts, development teams and project managers for requirements and business rules - Working in .NET framework and C# technology -Hands on ETL development using any combination of Ab Initio, Ab Initio Express IT and Ab Initio Continues Flows in an agile development environment.

Data Science Intern

Capgemini

Feb 2018 - Mar 2021

- writing complex SQL for data analysis and data profiling - Worked very closely with business analysts, development teams and project managers for requirements and business rules - Working in .NET framework and C# technology -Hands on ETL development using any combination of Ab Initio, Ab Initio Express IT and Ab Initio Continues Flows in an agile development environment.

Math Tutor

Chegg

2021-2023

Math Tutor

Chegg

2021-2023

Math Tutor

Chegg

2021-2023

Education

National Institute of Technology, Warangal (NITW)

Bachelor of Technology

2013 - 2016

Activities and societies: || Business Club || NSS || Table tennisActivities and societies: || Business Club || NSS || Table tennis Business Club : The Club's mission and core value is to inculcate the culture of making everyone financially conscious. • Organised both business-related seminars and events every month. + Gave many Sessions and Presentations • Discussed various case studies amongst the team members + Learnt trading and helped others to create awareness of Investments

National Institute of Technology, Warangal (NITW)

Bachelor of Technology

2013 - 2016

Activities and societies: || Business Club || NSS || Table tennisActivities and societies: || Business Club || NSS || Table tennis Business Club : The Club's mission and core value is to inculcate the culture of making everyone financially conscious. • Organised both business-related seminars and events every month. + Gave many Sessions and Presentations • Discussed various case studies amongst the team members + Learnt trading and helped others to create awareness of Investments

National Institute of Technology, Warangal (NITW)

Bachelor of Technology

2013 - 2016

Activities and societies: || Business Club || NSS || Table tennisActivities and societies: || Business Club || NSS || Table tennis Business Club : The Club's mission and core value is to inculcate the culture of making everyone financially conscious. • Organised both business-related seminars and events every month. + Gave many Sessions and Presentations • Discussed various case studies amongst the team members + Learnt trading and helped others to create awareness of Investments

Narayana Jr College

Intermediate (10+2)

2015-2017

-99.923 percentile in JEE mains and 98.23 in JEE advanced -secured 8.3k rank in JEE mains out of 8,32,000 - secured 7.2k In JEE Advanced out of 1,80,000 - 93.7% in Telangana State Board of Intermediate Education

Narayana Jr College

Intermediate (10+2)

2015-2017

-99.923 percentile in JEE mains and 98.23 in JEE advanced -secured 8.3k rank in JEE mains out of 8,32,000 - secured 7.2k In JEE Advanced out of 1,80,000 - 93.7% in Telangana State Board of Intermediate Education

Narayana Jr College

Intermediate (10+2)

2015-2017

-99.923 percentile in JEE mains and 98.23 in JEE advanced -secured 8.3k rank in JEE mains out of 8,32,000 - secured 7.2k In JEE Advanced out of 1,80,000 - 93.7% in Telangana State Board of Intermediate Education

Kendriya Vidyalaya

High School 10th

2013 - 2016

Activities and societies: Red Cross || NCC || Scouts and guides || Table Tennis || Swimming || Cricket ||Activities and societies: Red Cross || NCC || Scouts and guides || Table Tennis || Swimming || Cricket || -> 9.6/10 CGPA ->Represented state in swimming national meet in 2007 -> represented state twice ( AP ) in table tennis nationals meet in 2013 and 2012 -> Top 3 Percentile at the time of passout -> 3rd place in debate competitions -> House captain of the school

Kendriya Vidyalaya

High School 10th

2013 - 2016

Activities and societies: Red Cross || NCC || Scouts and guides || Table Tennis || Swimming || Cricket ||Activities and societies: Red Cross || NCC || Scouts and guides || Table Tennis || Swimming || Cricket || -> 9.6/10 CGPA ->Represented state in swimming national meet in 2007 -> represented state twice ( AP ) in table tennis nationals meet in 2013 and 2012 -> Top 3 Percentile at the time of passout -> 3rd place in debate competitions -> House captain of the school

Kendriya Vidyalaya

High School 10th

2013 - 2016

Activities and societies: Red Cross || NCC || Scouts and guides || Table Tennis || Swimming || Cricket ||Activities and societies: Red Cross || NCC || Scouts and guides || Table Tennis || Swimming || Cricket || -> 9.6/10 CGPA ->Represented state in swimming national meet in 2007 -> represented state twice ( AP ) in table tennis nationals meet in 2013 and 2012 -> Top 3 Percentile at the time of passout -> 3rd place in debate competitions -> House captain of the school

Got questions?

I’m always excited to collaborate on innovative and exciting projects!

E-mail

tsatvik@student.nitw.ac.in

Phone

+918125662478

Got questions?

I’m always excited to collaborate on innovative and exciting projects!

E-mail

tsatvik@student.nitw.ac.in

Phone

+918125662478

Got questions?

I’m always excited to collaborate on innovative and exciting projects!

E-mail

tsatvik@student.nitw.ac.in

Phone

+918125662478