This project demonstrates a simple ETL (Extract, Transform, Load) pipeline using the GitHub REST API, Python, Pandas, and PostgreSQL. The pipeline extracts GitHub user information, performs ...
An end-to-end Data Engineering and Analytics platform that ingests startup funding data, performs ETL processing, stores the data in a relational Data Warehouse (Star Schema), generates analytical ...
**Data Cleaning & Quality Automation Tool** Preprocessing the data typically consumes most of the time of analysts. This led me to develop a fully automated data quality pipeline in Python that ...
𝗕𝘂𝗶𝗹𝗱 𝗔 𝗪𝗲𝗯 𝗦𝗰𝗿𝗮𝗽𝗲𝗿 𝗔𝗻𝗱 𝗦𝗲𝗹𝗹 𝗗𝗮𝘁𝗮 Web scraping pulls data from websites. You turn this data into profit. You sell it to businesses. Use Python. It is simple. Use these ...