Sagar Sidhwa Profile Photo

Hello, I’m

Sagar Sidhwa

AI Software Engineer at Cummins Inc | MS CS – Artificial Intelligence (AI) @ Binghamton University | AWS Certified

I'm an aspiring AI Software Engineer using cutting-edge AI and machine learning to transform data into actionable insights. With hands-on experience in both academic and industry settings by developing LLM-RAG solutions, scalable ML pipelines, and full-stack AI systems using Python, FastAPI, React, and AWS, I integrate software development with innovative AI solutions to tackle complex challenges. I’m actively looking for new and exciting opportunities in Software, AI, and ML roles to further contribute to impactful technology that shapes the future of tech.





(Completed)

Master's in Computer Science (Major: CS Artificial Intelligence) [MSCS-AI] @Binghamton University

Binghamton, NY, USA | (Aug 2023 – Aug 2025)

(Completed)

Bachelor's in Computer Engineering @University of Mumbai

Mumbai, India | (Aug 2017 – May 2021)

Coursework

  • Artificial Intelligence
  • Machine Learning
  • Data Mining
  • Social Media Data Science Pipeline
  • Design & Analysis Comp Algorithms (Data Structures and Algorithms)
  • Database Systems
  • Programming Languages
  • Operating Systems
  • Systems Programming

Programming Languages

Python C# SQL JavaScript TypeScript Java C HTML CSS XML PHP

Frameworks

ASP.NET Core ASP.NET Web API AngularJS Django Django REST API Framework Frontend and Backend development Blazor Apriso

Platforms: Software Applications, Programming tools, and Data Management tools

Visual Studio Code Visual Studio Git Postman Microsoft SQL Server Azure DevOps Jupyter Notebook GitHub GitLab MongoDB PostgreSQL PowerBI Tableau Microsoft Excel Microsoft Office Suite Linux

Additional Technologies, Skills, and Libraries Expertise

Machine Learning Algorithms Predictive Analysis JSON Parsing XML Parsing Data Pipeline UI/UX Development Client-Side Development Server-Side Development Back-End Development Bootstrap Data Visualization NumPy Pandas TensorFlow SciPy Matplotlib OpenCV Keras PyTorch Scikit-Learn XGBoost
Sagar Sidhwa

Hello, I’m Sagar Sidhwa, I'm an AI Software Engineer with a Master’s in Computer Science with a focus and specialization on Artificial Intelligence (as my major - CS Artificial Intelligence) at (SUNY) State University of New York at Binghamton | (Binghamton University). I have a robust background in Machine Learning, AI, Data Science, and Data Analysis, supported by over two years of professional AI Software Engineer and Development experience and a foundational degree in Computer Engineering from the University of Mumbai. I leverage these skills to develop innovative solutions that integrate software development with Artificial Intelligence and Machine Learning.

My academic journey has provided me with a thorough understanding of both foundational and advanced technologies. At Binghamton University, I am honing my Master's in Computer Science with an expertise (Major's) in Artificial Intelligence (MSCS-AI). My coursework, including Design & Analysis Comp Algorithms (Data Structures and Algorithms), Operating Systems, Introduction To Data Mining, Social Media Data Science Pipeline, Intro to Machine Learning, Intro to Artificial Intelligence, Database Systems, Programming Languages, Systems Programming, has further enhanced my expertise in complex data analysis, model development, and software development and programming.

📌 Current Role – AI Software Engineer at Cummins Inc.
At Cummins, I’m leading the design and development of enterprise-grade GenAI solutions. I built a GenAI-powered LLM-RAG system that leverages historical CRs, IRs, and Oracle SQL data to reduce issue resolution time by 65%. This involved building clean data pipelines, fine-tuning LLMs through cross-validation and A/B testing, and deploying continuous monitoring and feedback loops. I also designed and deployed a cross-platform Employee Card Access system using ReactJS and FastAPI, replacing a legacy solution to improve operational reliability and security. By integrating AWS services and GitHub CI/CD pipelines, I established a modern and secure intranet DevOps process. My role involves full-lifecycle system design, architecture planning, and close collaboration with cross-functional teams to deliver AI-enhanced tools in fast-paced manufacturing environments.

Professionally, I have worked as an AI-Software Engineer at LTIMindtree in Mumbai, where I designed and developed key solutions, including an automated assembly line sorting algorithm and a secure authentication process. I developed a predictive model for the assembly line, utilizing statistical modeling to enable dynamic data sorting and filtering, which enhanced AI-based systems, data handling, and process automation. I also fixed a critical security flaw by implementing encryption in the authentication process, securing user credentials and mitigating login failures, while automating metrics monitoring through Python scripts—recognized as a major enhancement by 20+ clients. My role involved collaborating with cross-functional teams, enhancing software functionality, and optimizing user experiences. I also contributed to deploying applications using Azure DevOps, GitLab, and GitHub, ensuring smooth operation in pre-production and production environments. Additionally, I automated machine learning lifecycle deployments with Azure Pipelines, reducing production time by 20% and cutting manual workload by 50%. As a result of my commitment to continuous learning and growth, as evidenced by my certification in Machine Learning, and my recognition with the “GoMX Annual Hall of Fame Awards” for exceptional contributions to my team and projects.

Throughout my academic and professional journey, I have led several significant projects, which are detailed on my GitHub profile (github.com/sagar-sidhwa). Notable projects include:

  • AI | Machine Learning | LLM | RAG | Projects
  • Analysis of Cultural Impact on Tech Discussions: Reddit vs YouTube
  • OK Cupid Profiles Analysis
  • Cyberbullying and Fake Account Detection on Social Media
  • Attendance Recognition System Using Facial Recognition
  • Audiobook System

These experiences have provided me with extensive expertise in software and web development, data gathering and analysis, and building machine learning models, demonstrating my ability to address diverse and impactful challenges.



Discover My



Cummins Inc, AI Software Engineer

New York, USA | (May 2025 – Present)

Built a GenAI-powered LLM-RAG application for real-time resolution of Change Requests (CRs) and Incident Reports (IRs) in manufacturing. Leveraged historical production tickets, Oracle SQL data, and past resolution logs to develop a context-aware model that significantly reduced ticket resolution time by 65% and helped streamline issue triaging on the shop floor.

Fine-tuned transformer-based LLMs using techniques such as k-fold cross-validation, confusion matrix analysis, and A/B testing. Optimized key hyperparameters including learning rate, batch size, and epochs to reduce overfitting and improve model generalization. Incorporated real-time feedback loops to monitor model outputs and drive continuous iteration based on user interaction and error patterns.

Designed and implemented robust data pipelines by integrating Oracle SQL queries and internal APIs from production and assembly lines. Built ETL pipelines to clean, structure, and transform data from multiple systems, creating a reliable backend source for LLM prompts and embedding generation.

Led architecture and development of a cross-platform Employee Card Access and Registration System using ReactJS (frontend) and FastAPI (backend). Migrated the system from a legacy environment to a modern web-based solution with improved role-based access control, photo processing integration, and error handling, ensuring operational scalability and compliance with security policies.

Integrated cloud-based DevOps solutions using AWS services (EC2, S3, Lambda) and GitHub Actions for CI/CD. Deployed AI and web applications within the intranet infrastructure, ensuring secure access, automated testing, and cross-environment debugging with full traceability.

Drove requirement gathering and full lifecycle system design in collaboration with cross-functional teams including software developers, security stakeholders, and infrastructure engineers. Participated in peer code reviews, provided architecture input, and enforced modular design principles to ensure maintainability and system responsiveness.

Binghamton University, Graduate Assistant

Binghamton, NY, USA | (August 2024 – May 2025)

Designed and developed a robust diagnostics portal that seamlessly integrates medicine-barcode mapping functionality. This innovation significantly enhanced delivery scheduling efficiency and improved inventory material tracking processes, resulting in a 40% optimization in overall logistics

Engineered an advanced audio frequency enhancement tool using Python and signal processing techniques. The solution simulated diverse patient voice variations, enabling more realistic medical training scenarios for healthcare professionals

Created a scalable remote API pipeline for medical data collection from platforms such as Reddit and YouTube. Leveraged Python for data processing and stored the retrieved data in AWS Snowflake, PostgreSQL, and MongoDB, ensuring efficient cloud-based storage and API access. This system successfully handled large-scale, real-time data retrieval tasks with precision

Implemented a BERT transformer model and performed exploratory data analysis (EDA) with attention mechanisms, time series analysis, and statistical modeling. Integrated generative AI (GenAI) and large language models (LLMs) with hyperparameter tuning, leading to a 38% improvement in model evaluation metrics

Designed a secure connected messaging protocol architecture to streamline communication between doctors and nurses. This architecture ensured efficient and encrypted data exchange within healthcare workflows, fostering improved collaboration and patient care

Built an interactive website featuring data analysis dashboards with advanced filtering algorithms. Users could adjust parameters by time ranges, compare historical and latest results, and access comprehensive reports with detailed insights, enhancing decision-making in healthcare processes

Conducted in-depth analysis of toxic behaviors and cultural influences using natural language processing (NLP) and machine learning techniques. This effort yielded predictive trends, sentiment analysis, and actionable insights, enabling better understanding of behavioral patterns and their implications

LTIMindtree, AI Software Engineer

Mumbai, India | (June 2021 – July 2023)

Designed and developed an automated assembly line sorting algorithm, rectified a critical security flaw in the authentication process, and deployed an equitable new assembly line orders assignment endpoint API. Implemented AI-driven solutions for dynamic data sorting, filtering, and process automation using Python, optimizing overall system efficiency.

Collaborated with a team of 5, working closely with managers and clients to translate requirements into actionable JIRA tasks, execute user stories, manage change requests, ensure adherence to acceptance criteria, and resolve newly reported issues and bugs

Employed a diverse set of technologies, including .NET, .NET Framework, .NET WebAPI, Angular, JavaScript, C#, Microsoft SQL, Postman, Kibana, and Python to enhance software functionality, user experience, and AI-based automation

Orchestrated deployment to the pre-production and production environment using Azure DevOps, involved clients in the testing phase, encouraged feedback, and applied further changes as required to ensure smooth application operation

Enhanced the user experience, streamlined production and deployment, and provided optimized solutions and process enhancements that improved efficiency, productivity, data protection, reduced manual workload, and earned client appreciation for the security enhancement

Implemented a pickle file-based caching algorithm that optimized the system’s speed by 60%, avoiding redundant model training and improving overall system performance

VESIT, Software-ML Engineer

Mumbai, India | (January 2020 – April 2021)

Developed a specialized website using Django and SQL Server tailored for visually impaired users in healthcare. The platform enabled users to learn from audio read-back of scanned text, improving accessibility and inclusivity in healthcare learning environments

Deployed a comprehensive text-to-speech system leveraging Pytesseract, pyttsx3 libraries, and advanced techniques such as text clustering, topic modeling, and text-to-audio alignment mapping. Designed a dynamic data upload section and a structured backend engine, achieving a remarkable 92% system accuracy and providing an intuitive user experience

Collaborated with a team of four to develop a cyberbullying detection and dummy account identification system. Utilized Scikit-Learn, Python, NLP, random forest, and decision tree classification models, along with advanced text classification and anomaly detection techniques. This project resulted in a 78% improvement in system performance, enhancing cybersecurity and user safety

Designed and implemented a UI/UX web page system for streamlined incident notifications to an admin page. Leveraged DevOps practices, Object-Oriented Programming principles, and JavaScript for dynamic functionality. Analyzed crucial features and utilized GitHub for distributed deployment, ensuring efficient and direct issue reporting to the administrative panel

Vivekanand Education Society's Institute of Technology (VESIT), Machine Learning Engineer Intern

Mumbai, India | (November 2019 – January 2020)

Developed an "Attendance Monitoring System" in a group of 3 using Django and Machine Learning, for precise and efficient student attendance tracking.

Performed NLP techniques that enhanced the analysis process and deeply analyzed user-generated content from online channels, identifying toxic behaviors and assessing regional cultural influences on technology, education, and advancements.

Implemented a pickle file-based caching algorithm which optimized the system’s speed by 60%, avoiding redundant model training.




Check Out My Recent


Project 1

AI | Machine Learning | GenAI | LLM | RAG | Projects

August 2024 – Present

A collection of hands-on projects in Machine Learning, AI, and Deep Learning, covering classification, regression, NLP, and Generative AI. The repository includes Python, Scikit-learn, TensorFlow, OpenAI APIs, and more. Projects demonstrate core ML concepts, model building, evaluation, and deployment. Topics span traditional ML, deep learning, LLM fine-tuning, and RAG (Retrieval-Augmented Generation).

Explore Project
Project 1

OK CUPID PROFILES ANALYSIS

January 2024 – May 2024

Analyzed OkCupid user profiles to identify patterns in matching preferences and profile popularity. Collaborated with a team to examine attributes such as age, height, and body type, while exploring factors like income and lifestyle choices affecting profile popularity. My contributions provided valuable insights into user behavior, enhancing the understanding of online dating dynamics and improved matchmaking strategies.

Explore Project
Project 1

Analysis of Cultural Impact on Tech Discussions: Reddit vs YouTube

August 2023 – December 2023

Developed an automated system for collecting and analyzing Reddit and YouTube data using Python, PostgreSQL, and machine learning models. Utilized NLP techniques for sentiment analysis and trend predictions, while also identifying toxic behaviors and assessing cultural influences. Automated data processing and visualized insights with Flask, Django, and Power BI, by data-informed decision-making

Explore Project
Project 1

Cyber Bullying and Fake Account Detection in Social Media

July 2020 – April 2021

Developed a Python-Django-based website utilizing advanced logic and NLP to detect cyberbullying and fake social media accounts. Used Python, SQL Server, and Matplotlib for data processing, while implementing a Random Forest model to analyze account authenticity. The solution improved online security, streamlined reporting, and enhanced user experience by identifying and addressing suspicious activities effectively.

Explore Project
Project 1

Face Recognition based Attendance Management System

July 2019 – April 2020

Designed a face recognition-based attendance management system using Python-Django and Machine Learning. Integrated advanced image and face recognition algorithms with an unique approach for accurate tracking, even with partial face visibility, using Python, SQL, Pickle, Pandas, and Matplotlib. Optimized system performance by 60% with a caching algorithm to avoid redundant model training.

Explore Project
Project 1

Audiobook System

January 2020 – April 2020

Created a Python-Django web application to assist visually impaired users by converting scanned text into speech using libraries like pyttsx3 and Pytesseract. Utilized Django and SQL Server to build and optimize the app for efficient PDF and text-to-speech conversion. Collaborated on planning, design, and troubleshooting with the team, ensuring seamless user experience and functionality.

Explore Project
Project 1

Web Development - Online Pizza Ordering Project

January 2019 – May 2019

Built a web-based application that enables customers to order pizzas online, view order history, modify orders, and track deliveries. Developed collaboratively by a team of three, I was primarily responsible for designing the major (UI/UX) user interface pages and implementing key Backend functionalities, contributing to a robust and efficient ordering system.

Explore Project

Explore More Porjects here

View All Projects Project




Papers Published

2

Certifications

5+

Awards

2

Papers Published

Joseph, Richard and Samtani, Jayesh and Sidhwa, Sagar and Tiwari, Somesh and Wadhwani, Riya, “Cyber bullying and Fake Account Detection in Social Media”, Jun’21, Proceedings of the 4th International Conference on Advances in Science & Technology (ICAST2021)

ELSEVIER

Doultani, Mannat, Yogesh Tekwani, Somesh Tiwari, and Sagar Sidhwa, "Attendance Recognition System using Face Appearance", Vol 12 No SUP 1 (2020): Jun’20, SAMRIDDHI: A Journal of Physical Sciences, Engineering and Technology

SAMRIDDHI

Certifications

Amazon Web Services (AWS) Certified Machine Learning Engineer - Associate
Probability for Data Science
● Completed the "Machine Learning using Python in Data Science" and "Android Developer Course" from Udemy.
● Completed multiple Oracle courses and certificates on Artificial Intelligence with Machine Learning, Database Design and Programming with SQL, Java Programming Cumulative Final Exam.

Awards

● Recognized with the prestigious "GoMX Annual Hall of Fame Awards 2022", achieving the title of "Business Unit of the Year" commended for exemplary dedication, ownership, and trust demonstrated in contributions to the MFGOG project and team.
● Achieved top position at the inter-department district level round of Avishkar-2019-20, an inter-university research project competition.





Feel Free to drop a Message :)


Thank you for taking the time to learn about me. I'm actively looking for new opportunities and collaborations. Feel free to reach out if you're interested in connecting or discussing potential projects!

Connect with me ()