Tools

  • Dremio - The Data Lake Engine - https://www.dremio.com

  • Apache Airflow - programmatically author, schedule and monitor workflows - https://airflow.apache.org

  • Perfect - The New Standard in Dataflow Automation - https://www.prefect.io

  • Flyte - Develop, execute, and monitor distributed workflows reliably at scale - https://github.com/lyft/flyte

  • Amundsen - Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data - https://github.com/lyft/amundsen