Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated less than a minute ago

InfoQ

Latest articles from InfoQ• Updated 1 minute ago

InfoQ

OpenAI's New GPT-5.1 Models are Faster and More Conversational

OpenAI recently released upgrades to their GPT-5 model. GPT‑5.1 Instant, the default chat model, has improvements to instruction following. GPT‑5.1 Thinking, the reasoning model, is faster and gives more understandable responses. GPT‑5.1-Codex-Max, the coding model, is trained to use compaction to perform long-running tasks. By Anthony Alford

infoq.com
Anthony Alford
2 days ago

Reddit

Top posts from tech subreddits• Updated less than a minute ago

Hugging Face Trending

Popular models from Hugging Face• Updated 43 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.