Projects

A collection of my work across AI, machine learning, data engineering, business intelligence, and full-stack development.

AI & LLM Projects

Exploring large language models, vector search, and AI agents using LangChain, OpenAI, and RAG pipelines.

PrepMe – AI-Powered Interview Platform

PrepMe – AI-Powered Interview Platform

Built a responsive web platform using Next.js, Vapi AI, and Gemini that simulates real-time mock interviews with voice agents and provides instant feedback and transcripts.

Next.js Vapi AI Gemini Firebase TailwindCSS
SupportBot – AI-Powered Customer Assistant

SupportBot – AI-Powered Customer Assistant

Developed a multi-agent chatbot using LangFlow and RAG, capable of resolving order tracking, FAQs, and product queries with 92% automation accuracy.

LangFlow OpenAI RAG Flask Docker
AI Agent Benchmarking Tool

AI Agent Benchmarking Tool

Compared performance of CrewAI, Phidata, and Autogen for financial data extraction. Benchmarked agents based on latency, precision, and cost using real stock data from YFinance.

Phidata CrewAI Autogen LangChain SQLite YFinance

Machine Learning Projects

Classical ML models built with scikit-learn, XGBoost, and evaluation pipelines for structured datasets.

Credit Card Approval Prediction

Credit Card Approval Prediction

Built classification models including Logistic Regression and Random Forest to predict approvals based on applicant data. Achieved up to 96% accuracy.

Python scikit-learn pandas GridSearchCV
California Wildfire Risk Prediction

California Wildfire Risk Prediction

Used weather and vegetation data to train Random Forest and XGBoost models in R. Reached 89% accuracy for predicting high-risk zones.

R XGBoost Random Forest Data Wrangling

Data Engineering Projects

ETL pipelines, PySpark jobs, Snowflake workflows, and data orchestration with tools like Airflow.

Spotify Listening ETL Pipeline

Spotify Listening ETL Pipeline

Pulled and processed over 10K+ streaming records from Spotify API. Modeled genre trends, time-based clusters, and visualized usage via Power BI.

Spotify API Python AWS S3 PostgreSQL Power BI
Customer Data Lakehouse Pipeline

Customer Data Lakehouse Pipeline

Engineered a modular PySpark pipeline for clickstream logs, storing outputs in Delta Lake on AWS S3 and enabling low-latency Spark SQL queries.

PySpark Delta Lake AWS S3 SQL GitHub Actions
Logistics & Fulfillment Data Pipeline

Logistics & Fulfillment Data Pipeline

Built an ETL pipeline to clean and load simulated retail logistics data into Redshift, optimizing dashboard latency by 40%.

Python SQL AWS Redshift Data Modeling

Business Intelligence & Dashboards

Dashboards built in Tableau & Power BI, SQL-based KPIs, and interactive reporting solutions.

AI Stocks Performance Dashboard

AI Stocks Performance Dashboard

Built an automated ETL system to pull data from Yahoo Finance API, store results in S3, and visualize stock trends in Tableau.

Python Yahoo Finance API AWS S3 Tableau
Marketing Campaign Analysis Dashboard

Marketing Campaign Analysis Dashboard

Analyzed campaign effectiveness using SQL and Excel, segmented user behavior, and visualized KPIs across regions and channels in Power BI.

SQL Power BI Excel
Operational Efficiency Dashboard (Retail)

Operational Efficiency Dashboard (Retail)

Used Power BI to track revenue, inventory turnover, and staffing metrics across multiple retail branches.

SQL Excel Power BI
Yelp Review Analytics

Yelp Review Analytics

Processed over 7M reviews, loaded data into Snowflake, and built dashboards to identify sentiment trends and top-rated businesses.

Python Snowflake SQL Tableau