Tracy Bannon's QCon AI NY 2025 talk revealed how the rise of AI agents risks amplifying common architectural failures. She emphasized the distinctions between bots, assistants, and agents, highlighting the need for governance, clear identity controls, and disciplined decision-making to address “agentic debt.” Bannon called for architects to apply foundational principles amid rapid AI adoption. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

about 7 hours ago

Software Development ai mlops architecture ai-ethics system-design

InfoQ

How Artificial Intelligence Can Help Us Connect with Customers

In software development, success means going beyond meeting requirements. We must create products that surprise and delight users and are innovative, create impactful solutions, Ken Hughes said in the keynote “Connection is Everything”. AI can help us connect with customers and create better user experiences. By Ben Linders

infoq.com

Ben Linders

about 11 hours ago

Digital Focus ai nlp user-experience generative-ai webdev software-development mlops

InfoQ

Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy

Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy. By Sergio De Simone

infoq.com

Sergio De Simone

1 day ago

Wearable Tech ai llm mobile-ai mobile mobile-privacy mlops edge-computing privacy on-device-inference mobile-ml mobile-inference

InfoQ

Presentation: Ecologies and Economics of Language AI in Practice

Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott

infoq.com

Jade Abbott

1 day ago

Transcripts llm ai-ethics data-engineering ai mlops nlp transformers

InfoQ

OpenAI and Anthropic Donate AGENTS.md and Model Context Protocol to New Agentic AI Foundation

OpenAI and Anthropic have donated their AGENTS.md and Model Context Protocol projects to the Agentic AI Foundation (AAIF), a new directed fund under the Linux Foundation. Block contributed their agent framework, goose, as another founding project, and several other tech companies have joined as Platinum members. By Anthony Alford

infoq.com

Anthony Alford

2 days ago

Anthropic ai mlops openapi open-source ai-ethics ai-research openai

InfoQ

Pinecone Introduces Dedicated Read Nodes in Public Preview for Predictable Vector Workloads

Pinecone recently announced the public preview of Dedicated Read Nodes (DRN), a new capacity mode for its vector database designed to deliver predictable performance and cost at scale for high-throughput applications such as billion-vector semantic search, recommendation systems, and mission-critical AI services. By Craig Risi

infoq.com

Craig Risi

2 days ago

vector databases vector-search databases cloud ai vector-databases mlops

InfoQ

Toad: A Unified CLI Tool for All Your LLMs That Promises Improved UX From Existing Ones

During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP). By Olimpiu Pop

infoq.com

Olimpiu Pop

3 days ago

Agent Communication Protocol ai llm cli-tools developer-tools devops data-engineering mlops development-news

InfoQ

IBM Research Introduces CUGA, an Open-Source Configurable Agent Framework on Hugging Face

IBM Research has released CUGA (Configurable Generalist Agent) on Hugging Face Spaces, making its enterprise-oriented agent framework easier to evaluate with open models and real workflows. The move positions CUGA as a practical alternative to brittle, tightly coupled agent frameworks that often struggle with tool misuse, long-horizon reasoning, and recovery from failure. By Robert Krzaczyński

infoq.com

Robert Krzaczyński

4 days ago

Agents ai mlops transformers llm agents model-serving agent-framework ai-ethics generative-ai nlp knowledge-graphs

InfoQ

QConAI NY 2025 - Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery

Aaron Erickson at QCon AI NYC 2025 emphasized treating agentic AI as an engineering challenge, focusing on reliability through the blend of probabilistic and deterministic systems. He argued for clear operational structures to minimize risks and optimize performance, highlighting the importance of specialized agents and deterministic paths to enhance accuracy and control in AI workflows. By Andrew Hoblitzell

infoq.com

Andrew Hoblitzell

5 days ago

Machine Learning ai mlops ai-ethics model-serving ai-research

Top posts from tech subreddits• Updated less than a minute ago

Kimi K2 Thinking at 28.3 t/s on 4x Mac Studio cluster

i.redd.it

388

110

geerlingguy

7 days ago

r/LocalLLaMA cluster ai cloud mlops

What is something AI still struggles with, in your experience?

reddit.com

Govind_goswami

7 days ago

r/artificial ai ai-ethics ai-research mlops nlp

Computational Creativity – Call for Papers for ICCC'26

computationalcreativity.net

ICCCConf-Publicity

9 days ago

r/coding ai ai-research mlops nlp research generative-ai

AWS CEO says replacing junior devs with AI is 'one of the dumbest ideas'

finalroundai.com

5110

222

ImpressiveContest283

8 days ago

r/programming ai cloud developer-tools aws mlops

Our AI sales agent has surprisingly brought in 29 new paying enterprise customers

reddit.com

Icy_Science1948

8 days ago

r/AI_Agents ai ai-research mlops nlp generative-ai

AI helps ship faster but it produces 1.7× more bugs

coderabbit.ai

476

128

rag1987

8 days ago

r/programming software-development ai ai-ethics performance mlops software-quality

Humans are now the minority online

euractiv.com

1342

108

Massimo25ore

7 days ago

r/technology ai data-science ai-ethics social-media internet-trends big-data mlops

Maestro – Run AI coding agents autonomously for days (Free/OSS)

i.redd.it

pedramamini

8 days ago

r/LocalLLaMA ai mlops open-source

Help me price my AI agent user usage

reddit.com

Worth_Ad8415

8 days ago

r/AI_Agents ai ai-ethics model-serving mlops generative-ai

157216

Hugging Face Trending

Popular models from Hugging Face• Updated 18 minutes ago

StepFun

GELab-Zero-4B-preview

673

generative-ai ai mlops

ByteDance

BindWeave

ai mlops deep-learning

Microsoft

NextCoder-32B

Task: text-generation

297

mlops transformers generative-ai

MIT HAN Lab

nunchaku-flux.1-kontext-dev

ai mlops devops

Black Forest Labs

FLUX.1-Kontext-dev-onnx

ai mlops open-source

Google

videoprism

Task: video-classification

webdev ai mlops

SUFE-AIFLM-Lab

Fin-R1

179

1,575

ai mlops research

GitHub Trending

Popular repositories from GitHub• Updated 32 minutes ago

sympy

A computer algebra system written in pure Python

Python

14,221

4,965

python developer-tools open-source computer-vision math sympy computer-algebra mlops

microsoft

qlib

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

Python

34,807

5,409

ai data-science reinforcement-learning mlops quantitative-finance quantitative-investment quantitative-investing deep-learning quantitative-research model-serving python

mindsdb

Federated query engine for AI - The only MCP Server you'll ever need

Python

37,873

6,059

ai data-science python data-engineering mlops knowledge-graphs rag model-serving database-tuning

DLR-RM

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python

12,318

2,017

reinforcement-learning pytorch python deep-learning ai mlops

n8n-io

n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript

163,457

52,227

automation self-hosting typescript workflow-automation ai mlops self-hosted webdev devops

feast-dev

feast

The Open Source Feature Store for AI/ML

Python

6,545

1,184

open-source ai mlops data-engineering python feature-engineering

huggingface

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python

154,048

31,492

transformers pytorch tensorflow deep-learning nlp python mlops ai

activepieces

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

TypeScript

19,651

3,015

ai llm typescript open-source automation workflow-automation ai-ethics mlops ai-agents ai-research

deepspeedai

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

40,983

4,664

deep-learning mlops python model-serving transformers

Trending

Hacker News

InfoQ

Reddit

Hugging Face Trending

GitHub Trending