PySpark, Databricks, AWS Glue, AWS Athena (SQL), AWS Quicksight, MS Excel, GA, Looker Studio (Google Data Studio), R - R Studio, Python - Jupyter, AWS Quicksight, AWS SageMaker, Dataiku, Adobe Analytics, Tableau
Batch-based Data Ingestion and Processing Pipeline
End-to-end AWS pipeline for ingesting and analyzing video analytics data.
Real-time Customer Relationship Management (CRM) Lead Processing and Notification System
Event-driven lead processing and notification system on AWS.
Low-latency cross platform Data Engineering System for Marketing Spend and Customer Activity Analysis
Data engineering pipeline for business insights based on events and spend data leveraging cross platform system design (AWS, Databricks, Streamlit).
Energy Mix Over Time - EIA
This is a trend analyis of electricity generation across Renewable and Non Renewable sources in the US.
Quality of Life Analysis - OECD Nations
This is a quality of life analysis/comparison between OECD countries and select countries of interest focused on the recent years across different indicators like job quality, social interactions, health, etc
People Analytics Data Python Package
This package is port of an R package associated with the free online book Handbook of Regression Modeling in People Analytics
Digital Advertising Spend Optimization - Media Mix Modeling
Goes all the way from raw clickstream data to estimating spend-to-sales effects at a million-dollar scale.
Google Merchandise Store - Discovery
This is a prototype of C-level report to understand website performance of a company. Here, it’s Google Merchandise Store.
Mobile Game Analytics A/B Testing
A quick-to-refer framework to make decision whether to run a test.
Rating Prediction for Google Local - User reviews
A sentiment driven rating prediction to better recommend places to visit for users.
Regression in Microsoft Excel
Before diving into writing code for regression, this work highlights the concepts and assumptions using Excel.
Google Merchandise Store - Discovery - GA4 Edition
This is a high level performance reporting of Google Merchandise Store data with GA4 functionality.
SQL Data Exploration on Jupyter
A simple initiative to setup an environment locally and query. You also have some basic clauses covered in the blog.
What-is-what in Statistics and Data Science - Tableau Dashboard
Collation of introductory information on different concepts in statistics and data science.
All the works published and thoughts shared are my own.