Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 6 minutes ago

Reddit

Top posts from tech subreddits• Updated 36 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 19 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 33 minutes ago

redisson

Redisson - Valkey & Redis Java client. Real-Time Data Platform. Sync/Async/RxJava/Reactive API. Over 50 Valkey and Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Bloom filter, Spring, Tomcat, Scheduler, JCache API, Hibernate, RPC, local cache..

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

ClickHouse

ClickHouse® is a real-time analytics database management system

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

datahub

The Metadata Platform for your Data and AI Stack

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

superset

Apache Superset is a Data Visualization and Data Exploration Platform