Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 12 minutes ago

HN

The two versions of Parquet

jeronimo.dev

Reddit

Top posts from tech subreddits• Updated 12 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 8 minutes ago

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

superset

Apache Superset is a Data Visualization and Data Exploration Platform

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

velero

Backup and migrate Kubernetes applications and their persistent volumes

chroma

Open-source search and retrieval database for AI applications.

mindsdb

AI Analytics and Knowledge Engine for RAG over large-scale, heterogeneous data. - The only MCP Server you'll ever need