Learn 🧠 All Concepts (20) 🤖 What is an LLM? 📚 RAG Explained ⚡ AI Agents 💻 Run AI Locally 🇮🇳 AI in India 📖 Learn Tracks 🔧 DevOps Track ⚙️ AI Ops Track 🗺️ AI Engineer Roadmap
Tools 🔧 AI Tools Directory 🔓 Open Source AI ⭐ Top GitHub Repos ✦ Claude Skill Repos 🚀 Ready-to-Deploy Projects
Build 🏗️ Build Hub 🎯 Master Prompts 🧩 RAG Agents 🚀 App Megaprompts
Workflows ⚡ All Workflows (22) 🎥 Text to Video 🎞️ Image to Video 🔊 Text to Speech ♻️ Automation
Resources 🧪 Colab Notebooks ⚙️ n8n Workflows 📈 Algo Trading 💰 Passive Income
🗂️ Browse All Topics About AItheGuru
← All open source tools
🌬️

Apache Airflow

Open-source workflow orchestration — schedule and monitor data + ML pipelines

MLOps Apache 2.0 Self-hosted / Cloud Intermediate

Stats

GitHub stars★ 38k+
LicenseApache 2.0
HostingSelf-hosted / Cloud
DifficultyIntermediate

Get started

Official docs and GitHub repo

Visit Apache Airflow ↗ View on GitHub ↗

What is Apache Airflow?

Apache Airflow is the standard open-source platform for orchestrating complex data and ML workflows as DAGs (Directed Acyclic Graphs). Schedule pipelines, handle dependencies, monitor execution, and retry failures automatically. Used by Netflix, Airbnb, Twitter, and most major Indian tech companies.

Quick start

1
pip install apache-airflow
2
airflow standalone # Dev setup
3

Open http://localhost:8080

4

Write DAGs as Python files in ~/airflow/dags/

Use cases

ML pipeline automation

Data engineering

ETL workflows

Model retraining schedules

Compatible models

Framework agnostic — orchestrates any Python code

Why this matters for India

// india context

Most senior data engineering and MLOps roles in India require Airflow. Core skill for the ₹20L+ salary band.