This is a performance testing framework for Spark SQL in Apache Spark 2.2+. The framework contains twelve benchmarks that can be executed in local mode. They are organized into three classes and ...
End-to-end Azure Data Engineering portfolio project built around the Intel Berkeley Lab IoT sensor dataset. The project implements a lakehouse-style analytics platform using Azure Data Factory, Azure ...
๐Ÿง‘๐Ÿ’ป ๐’๐๐‹ ๐‚๐ก๐ž๐š๐ญ๐ฌ๐ก๐ž๐ž๐ญ ๐Ÿ๐จ๐ซ ๐ƒ๐š๐ญ๐š & ๐๐š๐œ๐ค๐ž๐ง๐ ๐…๐จ๐ฅ๐ค๐ฌ ๐Ÿ’ฏโฃ โฃ If you're ...
Senior Data Engineer | Databricks · PySpark · Python · AWS · Azure · SQL 6+ yrs · Banking · Healthcare · IT Services · Dual Databricks Certified (DE Associate + Gen AI Engineer) ...