Trending

Content tagged with "llm"

llm

Hacker News

Top stories from the Hacker News community• Updated 11 minutes ago

InfoQ

Latest articles from InfoQ• Updated 11 minutes ago

InfoQ

Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy

Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy. By Sergio De Simone

infoq.com
InfoQ

Presentation: Ecologies and Economics of Language AI in Practice

Jade Abbott discusses the shift from massive, resource-heavy models to "Little LMs" that prioritize efficiency and cultural sustainability. She explains how techniques like LoRA, quantization, and GRPO allow for high performance with less compute. By sharing the "Ubuntu Punk" philosophy, she shares how to move beyond extractive data practices toward human-centric, sustainable AI systems. By Jade Abbott

infoq.com
InfoQ

Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations

Target has deployed GRAM, a GenAI-powered accessory recommendation system for the Home category, using large language models to prioritize product attributes and capture aesthetic cohesion. The system helps shoppers find compatible accessories, integrates human-in-the-loop curation, and achieved measurable improvements in engagement and conversion. By Leela Kumili

infoq.com
InfoQ

Toad: A Unified CLI Tool for All Your LLMs That Promises Improved UX From Existing Ones

During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP). By Olimpiu Pop

infoq.com
InfoQ

Meta Details GEM Ads Model Using LLM-Scale Training, Hybrid Parallelism, and Knowledge Transfer

Meta released details about its Generative Ads Model (GEM), a foundation model designed to improve ads recommendation across its platforms. The model addresses core challenges in recommendation systems (RecSys) by processing billions of daily user-ad interactions where meaningful signals such as clicks and conversions are very sparse. By Vinod Goje

infoq.com
InfoQ

IBM Research Introduces CUGA, an Open-Source Configurable Agent Framework on Hugging Face

IBM Research has released CUGA (Configurable Generalist Agent) on Hugging Face Spaces, making its enterprise-oriented agent framework easier to evaluate with open models and real workflows. The move positions CUGA as a practical alternative to brittle, tightly coupled agent frameworks that often struggle with tool misuse, long-horizon reasoning, and recovery from failure. By Robert Krzaczyński

infoq.com
InfoQ

Presentation: Lessons Learned From Shipping AI-Powered Healthcare Products

Clara Matos discusses the journey of shipping AI-powered healthcare products at Sword Health. She explains how to implement input/output guardrails for regulated industries and shares a framework for robust evaluations using human and LLM-based ratings. From prompt engineering to RAG and user feedback loops, she shares a data-driven roadmap for building reliable AI care agents at scale. By Clara Matos

infoq.com
InfoQ

Article: NextGen Search - Where AI Meets OpenSearch Through MCP

In this article, authors Srikanth Daggumalli and Arun Lakshmanan discuss next-generation context-aware conversational search using OpenSearch and AI agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP). By Srikanth Daggumalli, Arun Lakshmanan

infoq.com
InfoQ

TornadoVM 2.0 Brings Automatic GPU Acceleration and LLM support to Java

The TornadoVM project recently reached version 2.0, a major milestone for the open-source project that aims to provide a heterogeneous hardware runtime for Java. The project automatically accelerates Java programs on multi-core CPUs, GPUs, and FPGAs. This release is likely to be of particular interest to teams developing LLM solutions on the JVM. By Ben Evans

infoq.com
2

Reddit

Top posts from tech subreddits• Updated 11 minutes ago

Reddit

GLM 4.7 is out on HF!

huggingface.co
417
98
KvAk_AKPlaysYT
2 days ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

GitHub Trending

Popular repositories from GitHub• Updated 8 minutes ago

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

langchain

🦜🔗 The platform for reliable agents.

openai-cookbook

Examples and guides for using the OpenAI API

khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

awesome-chatgpt-prompts

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Megatron-LM

Ongoing research training transformer models at scale

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

activepieces

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents