Hi, I'm Ashish Rao

Building cool stuff with ML Ops, software, and generative AI—from start to finish.

Ashish Rao

About Me

I’m a passionate software engineer specializing in ML Ops and building scalable machine learning solutions. With experience in modern web technologies and cloud infrastructure, I deliver efficient, end-to-end AI-powered applications that solve complex problems.

My journey in technology began at the University of Mumbai, where I developed a strong passion for Machine Learning, AI, and deploying end-to-end solutions. Pursuing my Master’s in Computer Science at UC Irvine further deepened my interest in efficient cloud deployment. Since then, I’ve worked on a wide range of projects—from scalable ML pipelines and real-time analytics to LLM-powered applications—continuously honing my skills in building impactful, production-ready systems.

Skills & Technologies

Programming Languages

PythonJavaScriptTypeScriptSQLCJavaGoGraphQL

Frameworks & Libraries

ReactNext.jsDjangoFlaskPyTorchTensorFlowAngularTailwind CSSLangChainLangGraph

Databases & Storage

PostgreSQLMongoDBRedisCassandraRDSOpensearch

Cloud & DevOps

AWSGCPDockerKubernetesNginxLinuxKafkaRayMLflow

Projects

E-Commerce Clickstream Analytics

E-Commerce Clickstream Analytics
Built a scalable Go API middleware and Next.js frontend for an e-commerce platform, integrated real-time analytics with Redis and Kafka, and developed an ALS-based recommendation pipeline using Ray and MLflow for improved, experiment-tracked product recommendations.
GoNext.jsRedisKafkaRayMLflow

FinAgent

FinAgent
Built a RAG-based financial insights agent with LangChain and GPT-4o, served via a Flask/FastAPI backend with persistent PostgreSQL storage, and deployed using Docker and Kubernetes for scalable, real-time market explanations.
LangchainFastAPIDockerKubernetesPostgreSQL

LIRA(S): Legal document Information Retrieval, Analysis, and Summarization

LIRA(S): Legal document Information Retrieval, Analysis, and Summarization
Built a legal document processing pipeline with DistilBERT, UMAP, and Mini-batch K-Means for categorization, plus Pegasus-based summarization and retrieval, deployed via Docker on Google Cloud Run with a Django frontend for user access.
PythonHugging Face TransformersGoogle CloudDockerDjangoScikit-Learn

Digimed

Digimed
Developed a Django-based website with a team to digitize medical records, improving security and transferability. Hosted on AWS EC2 with Nginx and used Amazon S3 for scalable data storage. Added patient dashboards for easy appointment viewing and booking.
PythonAWSDjangoNginx

NoCode-ML

NoCode-ML
User-friendly no-code platform that empowers non-programmers to apply machine learning. Users can upload data, select relevant features and target columns, and choose from multiple models to generate predictions easily.
PythonReactDjangoPandasScikit-learn

Commodity Price Predictor

Commodity Price Predictor
Commodity Price Predictor generates accurate price models for Cotton, Rice, and Maize using historical state-level data. Built with Django, it lets users choose the commodity, state, and land area, reflecting regional factors that affect pricing.
PythonDjangoPandasScikit-learn

Publications

Topic Modelling-Based Approach for Clustering Legal Documents
Information and Communication Technology for Competitive Strategies (ICTCS 2021)2021
View Publication

This paper proposes a unique approach to cluster these documents using the mini batch k-means algorithm on dimensionally reduced sentence embeddings generated with the use of DistilBERT and UMAP. The proposed approach has been compared to state-of-the-art topic modelling and clustering approaches and has outperformed them.

Natural Language Processing
PyTorch
Hugging Face
Scikit-learn
Insights into NoSQL databases using financial data: A comparative analysis
Procedia Computer Science, Volume 215, 20222021
View Publication

The paper explores how the evolution of data — from structured to predominantly unstructured — has created a need for scalable, flexible storage solutions that traditional relational databases cannot provide. This has led to the rise of NoSQL databases, which are especially suitable for domains like finance that generate large, dynamic datasets daily. The study compares different types of NoSQL databases based on metrics such as data model, indexing, atomicity, and integrity. It also implements and evaluates MongoDB, Cassandra, and Redis using financial data, measuring their performance on various operations including basic and complex READ and WRITE queries, as well as aggregation tasks.

Database Management Systems
MongoDB
Cassandra
Redis
Sentiment Analysis of Twitch.tv Livestream Messages using Machine Learning Methods
2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)2021
View Publication

This paper focuses on performing sentiment analysis on chat messages from Twitch livestreams, which are challenging to analyze due to large volume and platform-specific language like emotes and memes. Several machine learning models were tested, including Support Vector Classifier, Logistic Regression, Decision Tree, Random Forest, and Multinomial Naïve Bayes. The Support Vector Classifier achieved the best performance, surpassing the previous state-of-the-art with notable improvements: 10.3% higher accuracy, 9.1% higher recall, and a 7.5% increase in F1-score.

Natural Language Processing
Scikit-Learn

Open Source Contributions

HTTP-APIs
Hydra Ecosystem is an organization that aims to create RESTful APIs and agents.
Contributor

My Contributions:

  • Learned about RDF, the Semantic Web, and ideas behind RESTful API interface.
  • Improved API flexibility.
  • Improved documentation by adding usage tutorials.
banjtheman/defundthepolice
A dashboard to make local police budgets transparent and help advocate for reallocating funds to strengthen communities.
Contributor

My Contributions:

  • Contributed to the project as part of Hacktoberfest 2020.
  • Developed a backend Flask API to efficiently serve PostgreSQL data.
  • Created an ETL process to convert CSV data into PostgreSQL tables.
dry-python/returns
This project attempts to bring types to Python.
Contributor

My Contributions:

  • Contributed to the project as part of Hacktoberfest 2020.
  • Improved code readability and maintainability.
jindongwang/Pytorch-CapsuleNet
An implementation of the CapsuleNet Neural Network for Image Classification tasks.
Contributor

My Contributions:

  • Worked on a PR to correct a mistake in the network's implementation.