About Me
I’m a passionate software engineer specializing in ML Ops and building scalable machine learning solutions. With experience in modern web technologies and cloud infrastructure, I deliver efficient, end-to-end AI-powered applications that solve complex problems.
My journey in technology began at the University of Mumbai, where I developed a strong passion for Machine Learning, AI, and deploying end-to-end solutions. Pursuing my Master’s in Computer Science at UC Irvine further deepened my interest in efficient cloud deployment. Since then, I’ve worked on a wide range of projects—from scalable ML pipelines and real-time analytics to LLM-powered applications—continuously honing my skills in building impactful, production-ready systems.
Skills & Technologies
Programming Languages
Frameworks & Libraries
Databases & Storage
Cloud & DevOps
Projects
E-Commerce Clickstream Analytics
FinAgent
LIRA(S): Legal document Information Retrieval, Analysis, and Summarization
Digimed
NoCode-ML
Commodity Price Predictor
Publications
This paper proposes a unique approach to cluster these documents using the mini batch k-means algorithm on dimensionally reduced sentence embeddings generated with the use of DistilBERT and UMAP. The proposed approach has been compared to state-of-the-art topic modelling and clustering approaches and has outperformed them.
The paper explores how the evolution of data — from structured to predominantly unstructured — has created a need for scalable, flexible storage solutions that traditional relational databases cannot provide. This has led to the rise of NoSQL databases, which are especially suitable for domains like finance that generate large, dynamic datasets daily. The study compares different types of NoSQL databases based on metrics such as data model, indexing, atomicity, and integrity. It also implements and evaluates MongoDB, Cassandra, and Redis using financial data, measuring their performance on various operations including basic and complex READ and WRITE queries, as well as aggregation tasks.
This paper focuses on performing sentiment analysis on chat messages from Twitch livestreams, which are challenging to analyze due to large volume and platform-specific language like emotes and memes. Several machine learning models were tested, including Support Vector Classifier, Logistic Regression, Decision Tree, Random Forest, and Multinomial Naïve Bayes. The Support Vector Classifier achieved the best performance, surpassing the previous state-of-the-art with notable improvements: 10.3% higher accuracy, 9.1% higher recall, and a 7.5% increase in F1-score.
Open Source Contributions
My Contributions:
- Learned about RDF, the Semantic Web, and ideas behind RESTful API interface.
- Improved API flexibility.
- Improved documentation by adding usage tutorials.
My Contributions:
- Contributed to the project as part of Hacktoberfest 2020.
- Developed a backend Flask API to efficiently serve PostgreSQL data.
- Created an ETL process to convert CSV data into PostgreSQL tables.
My Contributions:
- Contributed to the project as part of Hacktoberfest 2020.
- Improved code readability and maintainability.
My Contributions:
- Worked on a PR to correct a mistake in the network's implementation.
