8000 etl-pipeline · GitHub Topics · GitHub
[go: up one dir, main page]

Skip to content
#

etl-pipeline

Here are 2,428 public repositories matching this topic...

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

  • Updated Oct 7, 2025
  • Jupyter Notebook
Udacity-Data-Engineering-Projects

Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.

  • Updated Aug 26, 2022
  • Python

The agentic AI platform for enterprise. Built for availability, scalability, and security. Complete end-to-end context engineering and LLM orchestration infrastructure. Run anywhere - local, cloud, or bare metal.

  • Updated Oct 6, 2025
  • Python

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

  • Updated Feb 14, 2025
  • Python

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

0