Trending
Content tagged with "data-engineering"
Hacker News
Top stories from the Hacker News community• Updated 7 minutes ago
Top posts from tech subreddits• Updated 7 minutes ago
Zstandard Compression in Python 3.14: Why It Is a Big Deal for Developers
Google to invest $6 billion in India for Asia’s biggest datacentre project: Report - The Times of India
Cheyenne to host massive AI data center using more electricity than all Wyoming homes combined
Data center construction sparks NIMBY opposition amid Korea's big AI push
Hugging Face Trending
Popular models from Hugging Face• Updated about 1 hour ago
No models found
Try removing the tag filter or searching for different content.
GitHub Trending
Popular repositories from GitHub• Updated 3 minutes ago
unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.