Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 4 minutes ago

Reddit

Top posts from tech subreddits• Updated 1 minute ago

Hugging Face Trending

Popular models from Hugging Face• Updated 1 minute ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 15 minutes ago

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Data-Science-For-Beginners

10 Weeks, 20 Lessons, Data Science for All!

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

rclone

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

datahub

The Metadata Platform for your Data and AI Stack

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.