Tracy Bannon's QCon AI NY 2025 talk revealed how the rise of AI agents risks amplifying common architectural failures. She emphasized the distinctions between bots, assistants, and agents, highlighting the need for governance, clear identity controls, and disciplined decision-making to address “agentic debt.” Bannon called for architects to apply foundational principles amid rapid AI adoption. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

about 4 hours ago

Software Development ai mlops architecture ai-ethics system-design

InfoQ

Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy

Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy. By Sergio De Simone

infoq.com

Sergio De Simone

1 day ago

Wearable Tech ai llm mobile-ai mobile mobile-privacy mlops edge-computing privacy on-device-inference mobile-ml

InfoQ

Presentation: Ecologies and Economics of Language AI in Practice

Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott

infoq.com

Jade Abbott

1 day ago

Transcripts llm ai-ethics data-engineering ai mlops nlp transformers

InfoQ

OpenAI and Anthropic Donate AGENTS.md and Model Context Protocol to New Agentic AI Foundation

OpenAI and Anthropic have donated their AGENTS.md and Model Context Protocol projects to the Agentic AI Foundation (AAIF), a new directed fund under the Linux Foundation. Block contributed their agent framework, goose, as another founding project, and several other tech companies have joined as Platinum members. By Anthony Alford

infoq.com

Anthony Alford

2 days ago

Anthropic ai mlops openapi open-source ai-ethics ai-research openai

InfoQ

Pinecone Introduces Dedicated Read Nodes in Public Preview for Predictable Vector Workloads

Pinecone recently announced the public preview of Dedicated Read Nodes (DRN), a new capacity mode for its vector database designed to deliver predictable performance and cost at scale for high-throughput applications such as billion-vector semantic search, recommendation systems, and mission-critical AI services. By Craig Risi

infoq.com

Craig Risi

2 days ago

vector databases vector-search databases cloud ai vector-databases mlops

InfoQ

Toad: A Unified CLI Tool for All Your LLMs That Promises Improved UX From Existing Ones

During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP). By Olimpiu Pop

infoq.com

Olimpiu Pop

3 days ago

Agent Communication Protocol ai llm cli-tools developer-tools devops data-engineering mlops development-news

InfoQ

IBM Research Introduces CUGA, an Open-Source Configurable Agent Framework on Hugging Face

IBM Research has released CUGA (Configurable Generalist Agent) on Hugging Face Spaces, making its enterprise-oriented agent framework easier to evaluate with open models and real workflows. The move positions CUGA as a practical alternative to brittle, tightly coupled agent frameworks that often struggle with tool misuse, long-horizon reasoning, and recovery from failure. By Robert Krzaczyński

infoq.com

Robert Krzaczyński

4 days ago

Agents ai mlops transformers llm agents model-serving agent-framework ai-ethics generative-ai nlp knowledge-graphs

InfoQ

QConAI NY 2025 - Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery

Aaron Erickson at QCon AI NYC 2025 emphasized treating agentic AI as an engineering challenge, focusing on reliability through the blend of probabilistic and deterministic systems. He argued for clear operational structures to minimize risks and optimize performance, highlighting the importance of specialized agents and deterministic paths to enhance accuracy and control in AI workflows. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

5 days ago

Machine Learning ai mlops ai-ethics model-serving ai-research

InfoQ

Google Metrax Brings Predefined Model Evaluation Metrics to JAX

Recently open-sourced by Google, Metrax is a JAX library providing standardized, performant metrics implementations for classification, regression, NLP, vision, and audio models. By Sergio De Simone

infoq.com

Sergio De Simone

5 days ago

JAX mlops ai tensorflow python model-serving ml model-evaluation transformers jax

Top posts from tech subreddits• Updated 15 minutes ago

Things Programmers Missed While Using AI

medium.com

delvin0

3 days ago

r/coding software-development ai programming developer-tools mlops

Is MCP Overhyped?

youtu.be

113

Helpful_Geologist430

4 days ago

r/programming ai model-serving scalability performance mlops system-design

[R] EGGROLL: trained a model without backprop and found it generalized better

reddit.com

Ok_Rub1689

4 days ago

r/MachineLearning neural-networks ai optimization model-optimization deep-learning mlops research model-training generative-ai

ai startup claims new model debugs software better than GPT-4 ... real or bs?

kodezi.com

rtbot2

4 days ago

r/realtech ai ai-research mlops generative-ai

Dataset quality is not improving much

huggingface.co

100

rekriux

4 days ago

r/LocalLLaMA data-science mlops data-quality data-engineering

NOAA's new AI weather system promises faster forecasts with less computing power

techspot.com

rtbot2

4 days ago

r/realtech ai data-science cloud big-data deep-learning mlops

MiniMax 2.1 release?

i.redd.it

112

_cttt_

5 days ago

r/LocalLLaMA ai model-serving mlops

I am new to this stuff where should I start

reddit.com

Vishu_8435

5 days ago

r/AI_Agents developer-experience ai data-science education beginners learning learning-resources mlops beginner

[D] [P] WrenAI System Architecture

reddit.com

jorgemaagomes

4 days ago

r/MachineLearning ai mlops system-design generative-ai system-architecture

135216

Hugging Face Trending

Popular models from Hugging Face• Updated 15 minutes ago

StepFun

GELab-Zero-4B-preview

673

generative-ai ai mlops

ByteDance

BindWeave

ai mlops deep-learning

Microsoft

NextCoder-32B

Task: text-generation

297

mlops transformers generative-ai

MIT HAN Lab

nunchaku-flux.1-kontext-dev

ai mlops devops

Black Forest Labs

FLUX.1-Kontext-dev-onnx

ai mlops open-source

Google

videoprism

Task: video-classification

webdev ai mlops

SUFE-AIFLM-Lab

Fin-R1

179

1,575

ai mlops research

GitHub Trending

Popular repositories from GitHub• Updated 29 minutes ago

sympy

A computer algebra system written in pure Python

Python

14,221

4,965

python developer-tools open-source computer-vision math sympy computer-algebra mlops

microsoft

qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

Python

34,807

5,409

ai data-science reinforcement-learning mlops quantitative-finance quantitative-investment quantitative-investing deep-learning quantitative-research model-serving python

mindsdb

Federated query engine for AI - The only MCP Server you'll ever need

Python

37,873

6,059

ai data-science python data-engineering mlops knowledge-graphs rag model-serving database-tuning

DLR-RM

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python

12,318

2,017

reinforcement-learning pytorch python deep-learning ai mlops

n8n-io

n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript

163,457

52,227

automation self-hosting typescript workflow-automation ai mlops self-hosted webdev devops

feast-dev

feast

The Open Source Feature Store for AI/ML

Python

6,545

1,184

open-source ai mlops data-engineering python feature-engineering

huggingface

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

154,048

31,492

transformers pytorch tensorflow deep-learning nlp python mlops ai

activepieces

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

TypeScript

19,651

3,015

ai llm typescript open-source automation workflow-automation ai-ethics mlops ai-agents ai-research

deepspeedai

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

40,983

4,664

deep-learning mlops python model-serving transformers

Trending

Hacker News

InfoQ

Reddit

Hugging Face Trending

GitHub Trending