Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 9 minutes ago

HN

We Need Arabic Language Models

natureasia.com
9
3
thinkingemote
about 2 months ago
HN

Caveat Prompter

surfingcomplexity.blog
9
5
azhenley
about 2 months ago
HN

Codex Is Live in Zed

zed.dev
256
55
meetpateltech
about 2 months ago
HN

Claude Haiku 4.5

anthropic.com
688
271
adocomplete
about 2 months ago
HN

Recursive Language Models (RLMs)

alexzhang13.github.io
103
28
talhof8
about 2 months ago

InfoQ

Latest articles from InfoQ• Updated 9 minutes ago

InfoQ

OpenAI's New GPT-5.1 Models are Faster and More Conversational

OpenAI recently released upgrades to their GPT-5 model. GPT‑5.1 Instant, the default chat model, has improvements to instruction following. GPT‑5.1 Thinking, the reasoning model, is faster and gives more understandable responses. GPT‑5.1-Codex-Max, the coding model, is trained to use compaction to perform long-running tasks. By Anthony Alford

infoq.com
Anthony Alford
2 days ago

Reddit

Top posts from tech subreddits• Updated about 2 hours ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

GitHub Trending

Popular repositories from GitHub• Updated 5 minutes ago

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.