Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 8 minutes ago

Reddit

Top posts from tech subreddits• Updated 38 minutes ago

Reddit

Archived JSON of NYT Crosswords

reddit.com
42
13
Brilliant-Kick2708
about 1 month ago

Hugging Face Trending

Popular models from Hugging Face• Updated 21 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 34 minutes ago

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

superset

Apache Superset is a Data Visualization and Data Exploration Platform

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

velero

Backup and migrate Kubernetes applications and their persistent volumes

chroma

Open-source search and retrieval database for AI applications.

mindsdb

AI Analytics and Knowledge Engine for RAG over large-scale, heterogeneous data. - The only MCP Server you'll ever need