Some things I've worked on:
Data engineering
- Big data AWS web app that displays Chicago's street traffic, crash history, and traffic violation history. Languages/platforms used:
Hadoop DFS
,HBase
,HiveQL
,Kafka
,Spark
,Scala
. Final project for MPCS 53014: Big Data Application Architecture. GitHub - Created and integrated database models into a web app to display the NIH's energy consumption data in an internal
server. The
Django
web app shows an input form for engineers to enter and search utility records, and visualizations usingR
,Qlik
, andMS Report Builder
summarise energy usage and cost avoidance. For NIH as a Civic Digital Fellow in summer 2020. GitHub - Built a
Django
web app that outputs an energy source table and visual representations of state energy consumption trends. The state-level data from US EIA was imported via web scraping and API. Final project for CAPP 30122: Computer Science with Applications 2. Link GitHub - Web-browser automation tool that counts and de-duplicates orders for student test name purchasing. It was my first introduction to programming and,
despite its simplicity, one of my proudest projects. Built primarily using
selenium
andopenpyxl
.
etc.
R Shiny
web app that displays City of Chicago employee salary and overtime pay for 2020. It shows a treemap of salary allocation and a table that can be filtered by department and pay. Link GitHub- Identified Chicago neighborhoods where affordable housing is most likely to be an issue with current trends in housing prices. Using open local government datasets and machine learning models, predicted areas with highest increases in price-to-income ratios in several years. Final project for CAPP 30254: Machine Learning for Public Policy. GitHub
- This personal website to experiment with lightweight web frameworks. GitHub
- Predictive models to calculate the probability of hall of fame induction for all active and recently-retired players. GitHub