Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 11 minutes ago

InfoQ

Latest articles from InfoQ• Updated 5 minutes ago

InfoQ

OpenAI's New GPT-5.1 Models are Faster and More Conversational

OpenAI recently released upgrades to their GPT-5 model. GPT‑5.1 Instant, the default chat model, has improvements to instruction following. GPT‑5.1 Thinking, the reasoning model, is faster and gives more understandable responses. GPT‑5.1-Codex-Max, the coding model, is trained to use compaction to perform long-running tasks. By Anthony Alford

infoq.com
Anthony Alford
1 day ago

Reddit

Top posts from tech subreddits• Updated 5 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated 23 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated 37 minutes ago

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.