Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 7 minutes ago

Reddit

Top posts from tech subreddits• Updated 5 minutes ago

Reddit

[D] Where to do study ML Infra?

reddit.com
Reddit

Respect the CSV

github.com

Hugging Face Trending

Popular models from Hugging Face• Updated 5 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated 19 minutes ago

arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

superset

Apache Superset is a Data Visualization and Data Exploration Platform

dice

DiceDB is an open-source, fast, reactive, in-memory database optimized for modern hardware.

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

pimcore

Core Framework for the Open Core Data & Experience Management Platform (PIM, MDM, CDP, DAM, DXP/CMS & Digital Commerce)

starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.