Trending

Content tagged with "mlops"

mlops

Hacker News

Top stories from the Hacker News community• Updated 9 minutes ago

HN

The new calculus of AI-based coding

blog.joemag.dev

InfoQ

Latest articles from InfoQ• Updated less than a minute ago

InfoQ

AWS Expands Well-Architected Framework with Responsible AI and Updated ML and Generative AI Lenses

At AWS re:Invent 2025, AWS expanded its Well-Architected Framework with a new Responsible AI Lens and updated Machine Learning and Generative AI Lenses. The updates provide guidance on governance, bias mitigation, scalable ML workflows, and trustworthy AI system design across the full AI lifecycle. By Leela Kumili

infoq.com
InfoQ

QCon AI New York 2025: AI Platform Scaling at LinkedIn

At QCon AI NY 2025, LinkedIn's Prince Valluri and Karthik Ramgopal unveiled an internal platform for AI agents, prioritizing execution over intelligence. By using structured specifications within a robust orchestration layer, they enhance agent observability and interoperability while ensuring human accountability. By Andrew Hoblitzell

infoq.com
Andrew Hoblitzell
about 19 hours ago
InfoQ

Presentation: Lessons Learned From Shipping AI-Powered Healthcare Products

Clara Matos discusses the journey of shipping AI-powered healthcare products at Sword Health. She explains how to implement input/output guardrails for regulated industries and shares a framework for robust evaluations using human and LLM-based ratings. From prompt engineering to RAG and user feedback loops, she shares a data-driven roadmap for building reliable AI care agents at scale. By Clara Matos

infoq.com
InfoQ

Article: Where Architects Sit in the Era of AI

As AI evolves from tool to collaborator, architects must shift from manual design to meta-design. This article introduces the "Three Loops" framework (In, On, Out) to help navigate this transition. It explores how to balance oversight with delegation, mitigate risks like skill atrophy, and design the governance structures that keep AI-augmented systems safe and aligned with human intent. By Dave Holliday, João Carlos Gonçalves, Manoj Kumar Yadav

infoq.com
Dave Holliday, João Carlos Gonçalves, Manoj Kumar Yadav
1 day ago
InfoQ

Google Cloud Launches Managed MCP Support

Google Cloud's introduction of fully managed Model Context Protocol (MCP) servers revolutionizes its API infrastructure, streamlining access for developers. This enterprise-ready solution enhances AI integration across services such as Google Maps and BigQuery while promoting wide-scale adoption. New tools ensure governance and security, and are currently in public preview. By Steef-Jan Wiggers

infoq.com
InfoQ

Article: Architecture in a Flow of AI-Augmented Change

While AI adoption is surging, most organizations fail to scale past pilots. The solution lies in organizational structure, not just technology. This article details how architects can enable "fast flow" by defining clear domains and guardrails. Learn how to shift from controlling outcomes to curating context, allowing AI to drive continuous, valuable business change. By Jonathan McPhail, Juan Medina, Jake DeCrane, Isuru Wijesundara

infoq.com
Jonathan McPhail, Juan Medina, Jake DeCrane, Isuru Wijesundara
2 days ago
InfoQ

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell

infoq.com
InfoQ

TornadoVM 2.0 Brings Automatic GPU Acceleration and LLM support to Java

The TornadoVM project recently reached version 2.0, a major milestone for the open-source project that aims to provide a heterogeneous hardware runtime for Java. The project automatically accelerates Java programs on multi-core CPUs, GPUs, and FPGAs. This release is likely to be of particular interest to teams developing LLM solutions on the JVM. By Ben Evans

infoq.com
InfoQ

Transformers v5 Introduces a More Modular and Interoperable Core

Hugging Face has released the first candidate for Transformers v5, marking a significant evolution from v4 five years ago. The library has grown from a specialized model toolkit to a critical resource in AI development, achieving over three million installations daily and more than 1.2 billion total installs. By Robert Krzaczyński

infoq.com
Robert Krzaczyński
4 days ago

Reddit

Top posts from tech subreddits• Updated 18 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 36 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

feast

The Open Source Feature Store for AI/ML

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

mindsdb

Federated query engine for AI - The only MCP Server you'll ever need

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

activepieces

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

sympy

A computer algebra system written in pure Python

cuda-python

CUDA Python: Performance meets Productivity