The TornadoVM project recently reached version 2.0, a major milestone for the open-source project that aims to provide a heterogeneous hardware runtime for Java. The project automatically accelerates Java programs on multi-core CPUs, GPUs, and FPGAs. This release is likely to be of particular interest to teams developing LLM solutions on the JVM. By Ben Evans

infoq.com

Ben Evans

2 days ago

GPU ai llm java mlops gpu data-engineering

InfoQ

Podcast: Building a More Appealing CLI for Agentic LLMs Based on Learnings from the Textual Framework

Will McGugan, the maker of Textual and Rich frameworks, speaks about the reasoning of developing the two two libraries and the lesson learned. Also, he shares light on Toad, his current project, which he envisions being a more visually appealing way of interacting with agentic LLMs through command line. By Will McGugan

infoq.com

Will McGugan

4 days ago

Frameworks cli nlp ai user-experience llm webdev cli-tools developer-tools ai-ethics ux-design

InfoQ

Podcast: Platform Engineering for AI: Scaling Agents and MCP at LinkedIn

QCon AI New York Chair Wes Reisz talks with LinkedIn’s Karthik Ramgopal and Prince Valluri about enabling AI agents at enterprise scale. They discuss how platform teams orchestrate secure, multi-agentic systems, the role of MCP, the use of foreground and background agents, improving developer experience, and reducing toil. By Karthik Ramgopal, Prince Valluri

infoq.com

Karthik Ramgopal, Prince Valluri

9 days ago

Agents ai generative-ai platform-engineering llm ai-ethics model-serving mlops

InfoQ

OpenAI's New GPT-5.1 Models are Faster and More Conversational

OpenAI recently released upgrades to their GPT-5 model. GPT‑5.1 Instant, the default chat model, has improvements to instruction following. GPT‑5.1 Thinking, the reasoning model, is faster and gives more understandable responses. GPT‑5.1-Codex-Max, the coding model, is trained to use compaction to perform long-running tasks. By Anthony Alford

infoq.com

Anthony Alford

10 days ago

ChatGPT ai llm transformers

InfoQ

Replit Introduces New AI Integrations for Multi-Model Development

Replit has introduced Replit AI Integrations, a feature that lets users select third-party models directly inside the IDE and automatically generate the code needed to run inference. By Daniel Dominguez

infoq.com

Daniel Dominguez

10 days ago

Artificial Intelligence ai llm model-serving mlops code-generation developer-tools software-development ai-ethics

Top posts from tech subreddits• Updated 29 minutes ago

Using 3 different LLMs to build/code games for a smart ball

reddit.com

summerflies

about 18 hours ago

r/artificial ai llm generative-ai

An independent Korean researcher is trying to democratize LLM pretraining with a 1.5B model

reddit.com

o3omoomin

about 23 hours ago

r/LocalLLaMA ai llm transformers

GPT-5 demonstrates ability to do novel lab work

axios.com

rtbot2

1 day ago

r/realtech ai llm generative-ai

It was Ilya who "closed" OpenAI

i.redd.it

327

175

licuphand

3 days ago

r/LocalLLaMA ai ai-ethics llm openai

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model!

i.redd.it

617

133

Difficult-Cap-7527

4 days ago

r/LocalLLaMA ai llm transformers

New Google model incoming!!!

i.redd.it

912

193

R46H4V

4 days ago

r/LocalLLaMA ai llm transformers

llama.cpp: Automation for GPU layers, tensor split, tensor overrides, and context size (with MoE optimizations)

reddit.com

Remove_Ayys

4 days ago

r/LocalLLaMA ai llm deep-learning

I pitted GPT-5.2 against Opus 4.5 and Gemini 3 in a robot coding tournament

reddit.com

Inevitable_Can598

4 days ago

r/LocalLLaMA ai llm generative-ai

Qwen3-Next-80B-A3B-Thinking-GGUF has just been released on HuggingFace

reddit.com

LegacyRemaster

5 days ago

r/LocalLLaMA ai llm transformers

2222

Hugging Face Trending

Popular models from Hugging Face• Updated 11 minutes ago

NVIDIA

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Task: text-generation

366

62,493

llm transformers generative-ai

FunAudioLLM

Fun-CosyVoice3-0.5B-2512

220

397

llm generative-ai audio

OpenAI

circuit-sparsity

Task: text-generation

164

1,245

ai llm text-generation

Mistral AI_

Devstral-Small-2-24B-Instruct-2512

408

53,846

llm transformers generative-ai

Z.ai

GLM-4.6V-Flash

Task: image-text-to-text

480

139,172

llm transformers generative-ai

Google

functiongemma-270m-it

Task: text-generation

147

2,154

llm generative-ai ai

NVIDIA

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Task: text-generation

143

103,181

ai llm generative-ai

Z.ai

AutoGLM-Phone-9B-Multilingual

Task: image-text-to-text

196

8,592

ai llm multilingual

Unsloth AI

Nemotron-3-Nano-30B-A3B-GGUF

Task: text-generation

105

46,774

llm transformers generative-ai

241

GitHub Trending

Popular repositories from GitHub• Updated 25 minutes ago

BlinkDL

RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

Python

13,338

900

llm deep-learning python

1 2

Trending

Hacker News

Trained LLMs exclusively on pre-1913 texts

GPT-5.2-Codex

Prompt caching: 10x cheaper LLM tokens, but how?

FunctionGemma 270M Model

Show HN: A local-first memory store for LLM agents (SQLite)

Developers can now submit apps to ChatGPT

Ditch the Chain-of-Thought Hacks: A Modular System for Composing AI Operations

Debug Mode for LLMs in vLLora

Show HN: Solving the ~95% legislative coverage gap using LLM's

InfoQ

Article: NextGen Search - Where AI Meets OpenSearch Through MCP

TornadoVM 2.0 Brings Automatic GPU Acceleration and LLM support to Java

Podcast: Building a More Appealing CLI for Agentic LLMs Based on Learnings from the Textual Framework

Podcast: Platform Engineering for AI: Scaling Agents and MCP at LinkedIn

OpenAI's New GPT-5.1 Models are Faster and More Conversational

Replit Introduces New AI Integrations for Multi-Model Development

Reddit

Using 3 different LLMs to build/code games for a smart ball

An independent Korean researcher is trying to democratize LLM pretraining with a 1.5B model

GPT-5 demonstrates ability to do novel lab work

It was Ilya who "closed" OpenAI

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model!

New Google model incoming!!!

llama.cpp: Automation for GPU layers, tensor split, tensor overrides, and context size (with MoE optimizations)

I pitted GPT-5.2 against Opus 4.5 and Gemini 3 in a robot coding tournament

Qwen3-Next-80B-A3B-Thinking-GGUF has just been released on HuggingFace

Hugging Face Trending

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Fun-CosyVoice3-0.5B-2512

circuit-sparsity

Devstral-Small-2-24B-Instruct-2512

GLM-4.6V-Flash

functiongemma-270m-it

NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

AutoGLM-Phone-9B-Multilingual

Nemotron-3-Nano-30B-A3B-GGUF

GitHub Trending

RWKV-LM