Published inCodeXData Reliability with Soda CoreData is critical for all organizations and yet many struggles with getting quality data. The most common reasons are either lack of…Aug 21, 2022Aug 21, 2022
Published inCodeXHow to Setup a Local MWAA Development EnvironmentApache Airflow is an open-source tool to programmatically author, schedule, and monitor workflows. It is one of the most robust platforms…Oct 10, 20212Oct 10, 20212
Published inCodeXVersion Control your Data Lake with LakeFSData Lake brings revolution to the data world, it allows you to store relational data from business applications and operational databases…Apr 1, 20211Apr 1, 20211
Published inCodeXBuild Data Pipeline with Apache HopApache Airflow has become a de facto tool for Data Engineering, but don’t overlook other tools out there that can boost your productivity…Jan 31, 20214Jan 31, 20214
Published inCodeXHow to Scale-out Apache Airflow 2.0 with Redis and CeleryApache Airflow has become one of the most prevalent tools in the Data Engineering space. It is a platform that offers you to…Jan 19, 20214Jan 19, 20214
Published inCodeXModern Data Platform Using Open Source TechnologiesAre you looking to build a modern data warehouse or a data lake for your organization? If so, should you build one from the ground up or…Jan 11, 20211Jan 11, 20211
Published inCodeXHow to Build a Modern Data Lake with MinIOIn this article, the focus is to build a modern data lake using only open source technologies. I will walk-through a step-by-step process…Jan 10, 2021Jan 10, 2021