Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 4 minutes ago

117

Reddit

Top posts from tech subreddits• Updated less than a minute ago

Reddit

Batch processing

reddit.com

Hugging Face Trending

Popular models from Hugging Face• Updated 1 minute ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 15 minutes ago

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

great_expectations

Always know what to expect from your data.

shardingsphere

Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

superset

Apache Superset is a Data Visualization and Data Exploration Platform

starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.