Explore my sample collection of open-source projects, code samples, and development resources.
Analyzed lockdown policy effectiveness using GCP, Apache Kafka, and Spark. The system combined real-time and batch processing for advanced analytics.
Analyzed U.S. alternative fuel stations data to uncover trends in EV charging infrastructure and forecasted future growth patterns.
Built a web scraping pipeline using Scrapy and Requests to collect and process air quality data from EPA AirNow for geospatial analysis.
A robust document processing system for handling various file formats, including PDFs, with text extraction and analysis capabilities.
A collection of data engineering exercises and solutions, covering ETL processes, data pipelines, and big data technologies.
Implementation of ML/AI algorithms including autoencoders, and Isolation Forest for predictive analysis.