Trending

Content tagged with "data-engineering"

data-engineering

Hacker News

Top stories from the Hacker News community• Updated 10 minutes ago

InfoQ

Latest articles from InfoQ• Updated 2 minutes ago

InfoQ

Magika 1.0: Smarter, Faster File Detection with Rust and AI

Google has just released version 1.0 of Magika, a substantial rewrite of its open-source file type detection system. The new version leverages AI to support a broader range of file types and is built in Rust for maximum speed and security. By Sergio De Simone

infoq.com
Sergio De Simone
about 2 hours ago
InfoQ

Breaking Silos: Netflix Introduces Upper Metamodel to Bring Consistency Across Content Engineering

Netflix has introduced the Upper metamodel within its Unified Data Architecture (UDA) to standardize domain definitions and generate consistent data container representations. UDA links conceptual models to GraphQL, Avro, SQL, and Java artifacts, supporting projections, mappings, and knowledge graph-based discovery across content, advertising, and operational systems. By Leela Kumili

infoq.com
InfoQ

Learnings from Cultivating Machine Learning Engineers as a Team Manager

As an AI team manager, Vivek Gupta stays broadly informed to guide AI experts effectively and drive the team. Engineers need feedback on both technical and interpersonal skills, Gupta mentioned at Dev Summit Boston. He stresses learning time, asking for help, and cross-team collaboration. Mentorship, data handling, and human-in-the-loop validation are key to success for machine learning engineers. By Ben Linders

infoq.com
InfoQ

Agentic Postgres: Postgres for Agentic Apps with Fast Forking and AI-Ready Features

Tiger Data, the company behind TimescaleDB, has launched Agentic Postgres, a Postgres-based database designed for both AI agents and developers. It extends Postgres with fast forking, an MCP server, native BM25 and vector search, and includes a CLI for terminal access. By Sergio De Simone

infoq.com

Reddit

Top posts from tech subreddits• Updated 2 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 38 minutes ago

No models found

Try removing the tag filter or searching for different content.

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

mindsdb

Federated query engine for AI - The only MCP Server you'll ever need

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

superset

Apache Superset is a Data Visualization and Data Exploration Platform

presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

posthog

🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

shardingsphere

Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

ClickHouse

ClickHouse® is a real-time analytics database management system

starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.