About Projects Skills Experience Contact
Open to opportunities · Mumbai

Data Analyst
& ML Practitioner

Built end-to-end analytics systems on 96K+ records — from SQL pipelines to ML models — focused on solving real business problems like churn and retention.

about me

Who I Am

VC

I'm a second-year BCA student at Somaiya Vidyavihar University, Mumbai. I build end-to-end analytics systems on real-world datasets — from writing complex SQL queries to deploying machine learning models as live apps.

My work spans customer segmentation, churn prediction, BI dashboards, and instructor effectiveness modelling. I'm always focused on the business outcome, not just the model metric.

Previously interned at Riidl Incubation Centre evaluating 10+ early-stage startups and producing data-backed KPI reports.

📊

Business-first thinking

Every analysis ends with a recommendation. Identified 68% customers as one-time buyers, highlighting retention as the primary revenue growth opportunity

End-to-end builder

Raw SQL → cleaned data → ML model → deployed app. Non-technical stakeholders can query data in plain English.

🚀

Startup ecosystem

Evaluated EdTech, FinTech & SaaS startups for investment viability at Riidl Incubation Centre.

🎓

Academic excellence

CGPA 9.63 / 10.0 — Somaiya Vidyavihar University, 2024–2027.

work

Featured Projects

01 · ML + ROUTING

Mumbai Local Train Network Intelligence System

End-to-end ML pipeline for delay prediction and optimal route planning across Mumbai's 139-station suburban rail network — built entirely from scratch, no public dataset.

XGBoost delay predictor — MAE 1.16 min, R² of 0.82
Dijkstra routing across 4 lines, real-time per-station delay annotations
11,730-row synthetic timetable with statistically calibrated peak variance
Live Flask API with interactive map frontend
PythonPostgreSQLXGBoostFlaskPower BI
02 · Business Intelligence

E-Commerce BI System

End-to-end BI pipeline on 96K+ customer records — RFM segmentation, Power BI dashboards, and an AI natural language query engine.

15+ PostgreSQL queries across 7 joined tables
68% one-time buyers → retention is key growth lever
Champion / Loyal / Regular / Lost tiers via RFM
AI NL query engine for non-technical stakeholders
PythonPostgreSQLPower BIStreamlit
03 · Predictive Modelling

Instructor Effectiveness Model

Composite scoring model that tiers instructors into Low / Medium / High performance using 9 features across 3 weighted pillars.

Pillars: Learning Outcome, Engagement, Quality
96% accuracy — zero critical misclassifications
Random Forest beat Gradient Boosting & Logistic
PythonScikit-learnPandas
toolkit

Skills & Technologies

Languages
Python
SQL / PostgreSQL
Data Analysis
Pandas
NumPy
Statistics
Machine Learning
Scikit-learn
Random Forest
Logistic Regression
Visualisation
Power BI
Plotly / Seaborn
Matplotlib
Tools
Streamlit
Git / GitHub
Jupyter
background

Experience & Education

Startup Analyst Intern
Riidl Incubation Centre · Mumbai
May – Aug 2025
  • Evaluated 10+ early-stage startups on business model viability, scalability, and competitive positioning
  • Produced data-backed KPI reports that reduced analysis turnaround time and supported investment decisions
  • Researched market differentiation across EdTech, FinTech, and SaaS for portfolio startups
Bachelor of Computer Applications (BCA)
Somaiya Vidyavihar University
2024 – 2027 · Mumbai, India
9.63
CGPA / 10.0
Certifications
Exploratory Data Analysis for Machine Learning
IBM / Coursera
Aug 2025
AI Product Manager
Microsoft / Coursera
Sep 2025
Elements of AI
University of Helsinki
Jul 2025
Designing User Interfaces and Experiences (UI/UX)
IBM / Coursera
Mar 2026
Getting Started with Git and Github
IBM / Coursera
Mar 2026
Data Structures and Performance
UC San Diego / Coursera
Mar 2026
contact

Let's Connect

Say hello 👋

I'm open to data analyst internship opportunities and interesting projects. If you want to talk data, analytics, or just connect — reach out.