Trending

Content tagged with "mlops"

mlops

Hacker News

Top stories from the Hacker News community• Updated 4 minutes ago

InfoQ

Latest articles from InfoQ• Updated 4 minutes ago

InfoQ

Article: Where Architects Sit in the Era of AI

As AI evolves from tool to collaborator, architects must shift from manual design to meta-design. This article introduces the "Three Loops" framework (In, On, Out) to help navigate this transition. It explores how to balance oversight with delegation, mitigate risks like skill atrophy, and design the governance structures that keep AI-augmented systems safe and aligned with human intent. By Dave Holliday, João Carlos Gonçalves, Manoj Kumar Yadav

infoq.com
Dave Holliday, João Carlos Gonçalves, Manoj Kumar Yadav
about 1 hour ago
InfoQ

Google Cloud Launches Managed MCP Support

Google Cloud's introduction of fully managed Model Context Protocol (MCP) servers revolutionizes its API infrastructure, streamlining access for developers. This enterprise-ready solution enhances AI integration across services such as Google Maps and BigQuery while promoting wide-scale adoption. New tools ensure governance and security, and are currently in public preview. By Steef-Jan Wiggers

infoq.com
Steef-Jan Wiggers
about 24 hours ago
InfoQ

Article: Architecture in a Flow of AI-Augmented Change

While AI adoption is surging, most organizations fail to scale past pilots. The solution lies in organizational structure, not just technology. This article details how architects can enable "fast flow" by defining clear domains and guardrails. Learn how to shift from controlling outcomes to curating context, allowing AI to drive continuous, valuable business change. By Jonathan McPhail, Juan Medina, Jake DeCrane, Isuru Wijesundara

infoq.com
Jonathan McPhail, Juan Medina, Jake DeCrane, Isuru Wijesundara
1 day ago
InfoQ

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes. By Andrew Hoblitzell

infoq.com
InfoQ

TornadoVM 2.0 Brings Automatic GPU Acceleration and LLM support to Java

The TornadoVM project recently reached version 2.0, a major milestone for the open-source project that aims to provide a heterogeneous hardware runtime for Java. The project automatically accelerates Java programs on multi-core CPUs, GPUs, and FPGAs. This release is likely to be of particular interest to teams developing LLM solutions on the JVM. By Ben Evans

infoq.com
InfoQ

Transformers v5 Introduces a More Modular and Interoperable Core

Hugging Face has released the first candidate for Transformers v5, marking a significant evolution from v4 five years ago. The library has grown from a specialized model toolkit to a critical resource in AI development, achieving over three million installations daily and more than 1.2 billion total installs. By Robert Krzaczyński

infoq.com
Robert Krzaczyński
3 days ago
InfoQ

Meta's Optimization Platform Ax 1.0 Streamlines LLM and System Optimization

Now stable, Ax is an open-source platform from Meta designed to help researchers and engineers apply machine learning to complex, resource-intensive experimentation. Over the past several years, Meta has used Ax to improve AI models, accelerate machine learning research, tune production infrastructure, and more. By Sergio De Simone

infoq.com
InfoQ

Lyft Rearchitects ML Platform with Hybrid AWS SageMaker-Kubernetes Approach

Lyft has rearchitected its machine learning platform LyftLearn into a hybrid system, moving offline workloads to AWS SageMaker while retaining Kubernetes for online model serving. Its decision to choose managed services where operational complexity was highest, while maintaining custom infrastructure where control mattered most, offers a pragmatic alternative to unified platform strategies. By Eran Stiller

infoq.com
InfoQ

Presentation: Powering Enterprise AI Applications with Data and Open Source Software

Francisco Javier Arceo explored Feast, the open-source feature store designed to address common data challenges in the AI/ML lifecycle, such as feature redundancy, and low-latency serving at scale. By Francisco Javier Arceo

infoq.com
2

Reddit

Top posts from tech subreddits• Updated about 2 hours ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

GitHub Trending

Popular repositories from GitHub• Updated 1 minute ago

cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,支持大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式

Jupyter Notebook
4,403
758

faststream

FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

autogluon

Fast and Accurate ML in 3 Lines of Code

Python
8,887
1,024

azureml-examples

Official community-driven Azure Machine Learning examples, tested with GitHub Actions.

Jupyter Notebook
1,872
1,536

evidently

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook
6,110
672