Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated less than a minute ago

Reddit

Top posts from tech subreddits• Updated less than a minute ago

Hugging Face Trending

Popular models from Hugging Face• Updated 43 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

flatbuffers

FlatBuffers: Memory Efficient Serialization Library

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

ClickHouse

ClickHouse® is a real-time analytics database management system

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

faiss

A library for efficient similarity search and clustering of dense vectors.

flink-cdc

Flink CDC is a streaming data integration tool