Trending
Content tagged with "data-engineering"
Hacker News
Top stories from the Hacker News community• Updated 4 minutes ago
InfoQ
Latest articles from InfoQ• Updated 4 minutes ago
Presentation: Ecologies and Economics of Language AI in Practice
Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott
Toad: A Unified CLI Tool for All Your LLMs That Promises Improved UX From Existing Ones
During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP). By Olimpiu Pop
Decathlon Switches to Polars to Optimize Data Pipelines and Infrastructure Costs
Decathlon, one of the world's leading sports retailers, recently shared why it adopted the open source library Polars to optimize its data pipelines. The Decathlon Digital team found that migrating from Apache Spark to Polars for small input datasets provides significant speed and cost savings. By Renato Losio
Presentation: Lessons Learned From Shipping AI-Powered Healthcare Products
Clara Matos discusses the journey of shipping AI-powered healthcare products at Sword Health. She explains how to implement input/output guardrails for regulated industries and shares a framework for robust evaluations using human and LLM-based ratings. From prompt engineering to RAG and user feedback loops, she shares a data-driven roadmap for building reliable AI care agents at scale. By Clara Matos
Article: Architecture in a Flow of AI-Augmented Change
While AI adoption is surging, most organizations fail to scale past pilots. The solution lies in organizational structure, not just technology. This article details how architects can enable "fast flow" by defining clear domains and guardrails. Learn how to shift from controlling outcomes to curating context, allowing AI to drive continuous, valuable business change. By Jonathan McPhail, Juan Medina, Jake DeCrane, Isuru Wijesundara
QCon AI New York 2025: Moving Mountains: Migrating Legacy Code in Weeks Instead of Years
David Stein, Principal AI Engineer at ServiceTitan, presented “Moving Mountains: Migrating Legacy Code in Weeks instead of Years” at QCon AI New York 2025. Stein demonstrated how migrations don’t have to be synonymous to “moving mountains” and introduced the concepts of the Principle of Acceleration and the Assembly Line Pattern. By Michael Redlich
Article: NextGen Search - Where AI Meets OpenSearch Through MCP
In this article, authors Srikanth Daggumalli and Arun Lakshmanan discuss next-generation context-aware conversational search using OpenSearch and AI agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP). By Srikanth Daggumalli, Arun Lakshmanan
TornadoVM 2.0 Brings Automatic GPU Acceleration and LLM support to Java
The TornadoVM project recently reached version 2.0, a major milestone for the open-source project that aims to provide a heterogeneous hardware runtime for Java. The project automatically accelerates Java programs on multi-core CPUs, GPUs, and FPGAs. This release is likely to be of particular interest to teams developing LLM solutions on the JVM. By Ben Evans
Presentation: Powering Enterprise AI Applications with Data and Open Source Software
Francisco Javier Arceo explored Feast, the open-source feature store designed to address common data challenges in the AI/ML lifecycle, such as feature redundancy, and low-latency serving at scale. By Francisco Javier Arceo
Top posts from tech subreddits• Updated about 1 hour ago
The Most Powerful Way to Build AI Agents: LangGraph + Pydantic AI (Detailed Example)
Hugging Face Trending
Popular models from Hugging Face• Updated about 1 hour ago
No models found
Try removing the tag filter or searching for different content.
GitHub Trending
Popular repositories from GitHub• Updated 1 minute ago
pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
rclone
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
posthog
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.
simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks