Trending

Content tagged with "etl"

etl

Hacker News

Top stories from the Hacker News community• Updated 3 minutes ago

Reddit

Top posts from tech subreddits• Updated 3 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

dlt

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

dbt-utils

Utility functions for dbt projects.

Makefile
1,616
556

arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

seatunnel

SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.