Hello, I’m
AI Software Engineer at Cummins Inc | MS CS – Artificial Intelligence (AI) @ Binghamton University | AWS Certified
I'm an aspiring AI Software Engineer using cutting-edge AI and machine learning to transform data into actionable insights. With hands-on experience in both academic and industry settings by developing LLM-RAG solutions, scalable ML pipelines, and full-stack AI systems using Python, FastAPI, React, and AWS, I integrate software development with innovative AI solutions to tackle complex challenges. I’m actively looking for new and exciting opportunities in Software, AI, and ML roles to further contribute to impactful technology that shapes the future of tech.
Gain more insights
Master's in Computer Science (Major: CS Artificial Intelligence) [MSCS-AI] @Binghamton University
Binghamton, NY, USA | (Aug 2023 – Aug 2025)
Bachelor's in Computer Engineering @University of Mumbai
Mumbai, India | (Aug 2017 – May 2021)
Hello, I’m Sagar Sidhwa, I'm an AI Software Engineer with a Master’s in Computer Science with a focus and specialization on Artificial Intelligence (as my major - CS Artificial Intelligence) at (SUNY) State University of New York at Binghamton | (Binghamton University). I have a robust background in Machine Learning, AI, Data Science, and Data Analysis, supported by over two years of professional AI Software Engineer and Development experience and a foundational degree in Computer Engineering from the University of Mumbai. I leverage these skills to develop innovative solutions that integrate software development with Artificial Intelligence and Machine Learning.
My academic journey has provided me with a thorough understanding of both foundational and advanced technologies. At Binghamton University, I am honing my Master's in Computer Science with an expertise (Major's) in Artificial Intelligence (MSCS-AI). My coursework, including Design & Analysis Comp Algorithms (Data Structures and Algorithms), Operating Systems, Introduction To Data Mining, Social Media Data Science Pipeline, Intro to Machine Learning, Intro to Artificial Intelligence, Database Systems, Programming Languages, Systems Programming, has further enhanced my expertise in complex data analysis, model development, and software development and programming.
📌 Current Role – AI Software Engineer at Cummins Inc. At Cummins, I’m leading the design and development of enterprise-grade GenAI solutions. I built a GenAI-powered LLM-RAG system that leverages historical CRs, IRs, and Oracle SQL data to reduce issue resolution time by 65%. This involved building clean data pipelines, fine-tuning LLMs through cross-validation and A/B testing, and deploying continuous monitoring and feedback loops. I also designed and deployed a cross-platform Employee Card Access system using ReactJS and FastAPI, replacing a legacy solution to improve operational reliability and security. By integrating AWS services and GitHub CI/CD pipelines, I established a modern and secure intranet DevOps process. My role involves full-lifecycle system design, architecture planning, and close collaboration with cross-functional teams to deliver AI-enhanced tools in fast-paced manufacturing environments.
Professionally, I have worked as an AI-Software Engineer at LTIMindtree in Mumbai, where I designed and developed key solutions, including an automated assembly line sorting algorithm and a secure authentication process. I developed a predictive model for the assembly line, utilizing statistical modeling to enable dynamic data sorting and filtering, which enhanced AI-based systems, data handling, and process automation. I also fixed a critical security flaw by implementing encryption in the authentication process, securing user credentials and mitigating login failures, while automating metrics monitoring through Python scripts—recognized as a major enhancement by 20+ clients. My role involved collaborating with cross-functional teams, enhancing software functionality, and optimizing user experiences. I also contributed to deploying applications using Azure DevOps, GitLab, and GitHub, ensuring smooth operation in pre-production and production environments. Additionally, I automated machine learning lifecycle deployments with Azure Pipelines, reducing production time by 20% and cutting manual workload by 50%. As a result of my commitment to continuous learning and growth, as evidenced by my certification in Machine Learning, and my recognition with the “GoMX Annual Hall of Fame Awards” for exceptional contributions to my team and projects.
Throughout my academic and professional journey, I have led several significant projects, which are detailed on my GitHub profile (github.com/sagar-sidhwa). Notable projects include:
These experiences have provided me with extensive expertise in software and web development, data gathering and analysis, and building machine learning models, demonstrating my ability to address diverse and impactful challenges.
Discover My
New York, USA | (May 2025 – Present)
Built a GenAI-powered LLM-RAG application for real-time resolution of Change Requests (CRs) and Incident Reports (IRs) in manufacturing. Leveraged historical production tickets, Oracle SQL data, and past resolution logs to develop a context-aware model that significantly reduced ticket resolution time by 65% and helped streamline issue triaging on the shop floor.
Fine-tuned transformer-based LLMs using techniques such as k-fold cross-validation, confusion matrix analysis, and A/B testing. Optimized key hyperparameters including learning rate, batch size, and epochs to reduce overfitting and improve model generalization. Incorporated real-time feedback loops to monitor model outputs and drive continuous iteration based on user interaction and error patterns.
Designed and implemented robust data pipelines by integrating Oracle SQL queries and internal APIs from production and assembly lines. Built ETL pipelines to clean, structure, and transform data from multiple systems, creating a reliable backend source for LLM prompts and embedding generation.
Led architecture and development of a cross-platform Employee Card Access and Registration System using ReactJS (frontend) and FastAPI (backend). Migrated the system from a legacy environment to a modern web-based solution with improved role-based access control, photo processing integration, and error handling, ensuring operational scalability and compliance with security policies.
Integrated cloud-based DevOps solutions using AWS services (EC2, S3, Lambda) and GitHub Actions for CI/CD. Deployed AI and web applications within the intranet infrastructure, ensuring secure access, automated testing, and cross-environment debugging with full traceability.
Drove requirement gathering and full lifecycle system design in collaboration with cross-functional teams including software developers, security stakeholders, and infrastructure engineers. Participated in peer code reviews, provided architecture input, and enforced modular design principles to ensure maintainability and system responsiveness.
Binghamton, NY, USA | (August 2024 – May 2025)
Designed and developed a robust diagnostics portal that seamlessly integrates medicine-barcode mapping functionality. This innovation significantly enhanced delivery scheduling efficiency and improved inventory material tracking processes, resulting in a 40% optimization in overall logistics
Engineered an advanced audio frequency enhancement tool using Python and signal processing techniques. The solution simulated diverse patient voice variations, enabling more realistic medical training scenarios for healthcare professionals
Created a scalable remote API pipeline for medical data collection from platforms such as Reddit and YouTube. Leveraged Python for data processing and stored the retrieved data in AWS Snowflake, PostgreSQL, and MongoDB, ensuring efficient cloud-based storage and API access. This system successfully handled large-scale, real-time data retrieval tasks with precision
Implemented a BERT transformer model and performed exploratory data analysis (EDA) with attention mechanisms, time series analysis, and statistical modeling. Integrated generative AI (GenAI) and large language models (LLMs) with hyperparameter tuning, leading to a 38% improvement in model evaluation metrics
Designed a secure connected messaging protocol architecture to streamline communication between doctors and nurses. This architecture ensured efficient and encrypted data exchange within healthcare workflows, fostering improved collaboration and patient care
Built an interactive website featuring data analysis dashboards with advanced filtering algorithms. Users could adjust parameters by time ranges, compare historical and latest results, and access comprehensive reports with detailed insights, enhancing decision-making in healthcare processes
Conducted in-depth analysis of toxic behaviors and cultural influences using natural language processing (NLP) and machine learning techniques. This effort yielded predictive trends, sentiment analysis, and actionable insights, enabling better understanding of behavioral patterns and their implications
Mumbai, India | (June 2021 – July 2023)
Designed and developed an automated assembly line sorting algorithm, rectified a critical security flaw in the authentication process, and deployed an equitable new assembly line orders assignment endpoint API. Implemented AI-driven solutions for dynamic data sorting, filtering, and process automation using Python, optimizing overall system efficiency.
Collaborated with a team of 5, working closely with managers and clients to translate requirements into actionable JIRA tasks, execute user stories, manage change requests, ensure adherence to acceptance criteria, and resolve newly reported issues and bugs
Employed a diverse set of technologies, including .NET, .NET Framework, .NET WebAPI, Angular, JavaScript, C#, Microsoft SQL, Postman, Kibana, and Python to enhance software functionality, user experience, and AI-based automation
Orchestrated deployment to the pre-production and production environment using Azure DevOps, involved clients in the testing phase, encouraged feedback, and applied further changes as required to ensure smooth application operation
Enhanced the user experience, streamlined production and deployment, and provided optimized solutions and process enhancements that improved efficiency, productivity, data protection, reduced manual workload, and earned client appreciation for the security enhancement
Implemented a pickle file-based caching algorithm that optimized the system’s speed by 60%, avoiding redundant model training and improving overall system performance
Mumbai, India | (January 2020 – April 2021)
Developed a specialized website using Django and SQL Server tailored for visually impaired users in healthcare. The platform enabled users to learn from audio read-back of scanned text, improving accessibility and inclusivity in healthcare learning environments
Deployed a comprehensive text-to-speech system leveraging Pytesseract, pyttsx3 libraries, and advanced techniques such as text clustering, topic modeling, and text-to-audio alignment mapping. Designed a dynamic data upload section and a structured backend engine, achieving a remarkable 92% system accuracy and providing an intuitive user experience
Collaborated with a team of four to develop a cyberbullying detection and dummy account identification system. Utilized Scikit-Learn, Python, NLP, random forest, and decision tree classification models, along with advanced text classification and anomaly detection techniques. This project resulted in a 78% improvement in system performance, enhancing cybersecurity and user safety
Designed and implemented a UI/UX web page system for streamlined incident notifications to an admin page. Leveraged DevOps practices, Object-Oriented Programming principles, and JavaScript for dynamic functionality. Analyzed crucial features and utilized GitHub for distributed deployment, ensuring efficient and direct issue reporting to the administrative panel
Mumbai, India | (November 2019 – January 2020)
Developed an "Attendance Monitoring System" in a group of 3 using Django and Machine Learning, for precise and efficient student attendance tracking.
Performed NLP techniques that enhanced the analysis process and deeply analyzed user-generated content from online channels, identifying toxic behaviors and assessing regional cultural influences on technology, education, and advancements.
Implemented a pickle file-based caching algorithm which optimized the system’s speed by 60%, avoiding redundant model training.
Check Out My Recent
0
+ Completed
August 2024 – Present
A collection of hands-on projects in Machine Learning, AI, and Deep Learning, covering classification, regression, NLP, and Generative AI. The repository includes Python, Scikit-learn, TensorFlow, OpenAI APIs, and more. Projects demonstrate core ML concepts, model building, evaluation, and deployment. Topics span traditional ML, deep learning, LLM fine-tuning, and RAG (Retrieval-Augmented Generation).
Explore ProjectJanuary 2024 – May 2024
Analyzed OkCupid user profiles to identify patterns in matching preferences and profile popularity. Collaborated with a team to examine attributes such as age, height, and body type, while exploring factors like income and lifestyle choices affecting profile popularity. My contributions provided valuable insights into user behavior, enhancing the understanding of online dating dynamics and improved matchmaking strategies.
Explore ProjectAugust 2023 – December 2023
Developed an automated system for collecting and analyzing Reddit and YouTube data using Python, PostgreSQL, and machine learning models. Utilized NLP techniques for sentiment analysis and trend predictions, while also identifying toxic behaviors and assessing cultural influences. Automated data processing and visualized insights with Flask, Django, and Power BI, by data-informed decision-making
Explore ProjectJuly 2020 – April 2021
Developed a Python-Django-based website utilizing advanced logic and NLP to detect cyberbullying and fake social media accounts. Used Python, SQL Server, and Matplotlib for data processing, while implementing a Random Forest model to analyze account authenticity. The solution improved online security, streamlined reporting, and enhanced user experience by identifying and addressing suspicious activities effectively.
Explore ProjectJuly 2019 – April 2020
Designed a face recognition-based attendance management system using Python-Django and Machine Learning. Integrated advanced image and face recognition algorithms with an unique approach for accurate tracking, even with partial face visibility, using Python, SQL, Pickle, Pandas, and Matplotlib. Optimized system performance by 60% with a caching algorithm to avoid redundant model training.
Explore ProjectJanuary 2020 – April 2020
Created a Python-Django web application to assist visually impaired users by converting scanned text into speech using libraries like pyttsx3 and Pytesseract. Utilized Django and SQL Server to build and optimize the app for efficient PDF and text-to-speech conversion. Collaborated on planning, design, and troubleshooting with the team, ensuring seamless user experience and functionality.
Explore ProjectJanuary 2019 – May 2019
Built a web-based application that enables customers to order pizzas online, view order history, modify orders, and track deliveries. Developed collaboratively by a team of three, I was primarily responsible for designing the major (UI/UX) user interface pages and implementing key Backend functionalities, contributing to a robust and efficient ordering system.
Explore ProjectLook Through My Research
2
5+
2
Joseph, Richard and Samtani, Jayesh and Sidhwa, Sagar and Tiwari, Somesh and Wadhwani, Riya, “Cyber bullying and Fake Account Detection in Social Media”, Jun’21, Proceedings of the 4th International Conference on Advances in Science & Technology (ICAST2021)
ELSEVIERDoultani, Mannat, Yogesh Tekwani, Somesh Tiwari, and Sagar Sidhwa, "Attendance Recognition System using Face Appearance", Vol 12 No SUP 1 (2020): Jun’20, SAMRIDDHI: A Journal of Physical Sciences, Engineering and Technology
SAMRIDDHI● Amazon Web Services (AWS) Certified Machine Learning Engineer - Associate |
● Probability for Data Science |
● Completed the "Machine Learning using Python in Data Science" and "Android Developer Course" from Udemy. |
● Completed multiple Oracle courses and certificates on Artificial Intelligence with Machine Learning, Database Design and Programming with SQL, Java Programming Cumulative Final Exam. |
● Recognized with the prestigious "GoMX Annual Hall of Fame Awards 2022", achieving the title of "Business Unit of the Year" commended for exemplary dedication, ownership, and trust demonstrated in contributions to the MFGOG project and team. |
● Achieved top position at the inter-department district level round of Avishkar-2019-20, an inter-university research project competition. |
Thank you for taking the time to learn about me. I'm actively looking for new opportunities and collaborations. Feel free to reach out if you're interested in connecting or discussing potential projects!