The Serverless Student Performance Analytics System is an AWS-based serverless application designed to automate the processing, storage, analysis, and export of student performance data. Faculty ...
PySpark provides a flexible and powerful DataFrame API to read data from different formats such as: CSV JSON Parquet ORC Delta Databases (JDBC) Understanding how to read data efficiently is important ...
A comprehensive Command Line Interface (CLI) based census portal for admin and client stakeholders with full CRUD operations using Python, Pandas, and NumPy.
**Data Cleaning & Quality Automation Tool** Preprocessing the data typically consumes most of the time of analysts. This led me to develop a fully automated data quality pipeline in Python that ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results