PySpark, Databricks, AWS Glue, AWS Athena (SQL), AWS Quicksight, MS Excel, GA, Looker Studio (Google Data Studio), R - R Studio, Python - Jupyter, AWS Quicksight, AWS SageMaker, Dataiku, Adobe Analytics, Tableau
Batch-based Data Ingestion and Processing Pipeline
End-to-end AWS pipeline for ingesting and analyzing video analytics data.
Real-time Customer Relationship Management (CRM) Lead Processing and Notification System
Event-driven lead processing and notification system on AWS.
Low-latency cross platform Data Engineering System for Marketing Spend and Customer Activity Analysis
Data engineering pipeline for business insights based on events and spend data leveraging cross platform system design (AWS, Databricks, Streamlit).
Energy Mix Over Time - EIA
This is a trend analyis of electricity generation across Renewable and Non Renewable sources in the US.
Quality of Life Analysis - OECD Nations
This is a quality of life analysis/comparison between OECD countries and select countries of interest focused on the recent years across different indicators like job quality, social interactions, health, etc
People Analytics Data Python Package
This package is port of an R package associated with the free online book Handbook of Regression Modeling in People Analytics
Digital Advertising Spend Optimization - Media Mix Modeling
Goes all the way from raw clickstream data to estimating spend-to-sales effects at a million-dollar scale.
Google Merchandise Store - Discovery
This is a prototype of C-level report to understand website performance of a company. Here, it’s Google Merchandise Store.
Mobile Game Analytics A/B Testing
A quick-to-refer framework to make decision whether to run a test.
All the works published and thoughts shared are my own.