Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 4 minutes ago

Reddit

Top posts from tech subreddits• Updated 1 minute ago

Hugging Face Trending

Popular models from Hugging Face• Updated 1 minute ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 15 minutes ago

rclone

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

superset

Apache Superset is a Data Visualization and Data Exploration Platform

arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

ClickHouse

ClickHouse® is a real-time analytics database management system

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

datahub

The Metadata Platform for your Data and AI Stack