Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 7 minutes ago

Reddit

Top posts from tech subreddits• Updated 7 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 3 minutes ago

superset

Apache Superset is a Data Visualization and Data Exploration Platform

flatbuffers

FlatBuffers: Memory Efficient Serialization Library

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

ClickHouse

ClickHouse® is a real-time analytics database management system

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

faiss

A library for efficient similarity search and clustering of dense vectors.

flink-cdc

Flink CDC is a streaming data integration tool