Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 11 minutes ago

125

Reddit

Top posts from tech subreddits• Updated about 2 hours ago

Hugging Face Trending

Popular models from Hugging Face• Updated 38 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

yfinance

Download market data from Yahoo! Finance's API

simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

datavzrd

A tool to create visual HTML reports from collections of CSV/TSV tables

seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.