Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 12 minutes ago

Reddit

Top posts from tech subreddits• Updated 3 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 39 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

rclone

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

superset

Apache Superset is a Data Visualization and Data Exploration Platform

arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

ClickHouse

ClickHouse® is a real-time analytics database management system

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

datahub

The Metadata Platform for your Data and AI Stack