Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 14 minutes ago

HN

Sora 2

openai.com
HN

Claude Sonnet 4.5

anthropic.com
1502
746
adocomplete
2 months ago
11
surprisetalk
2 months ago
HN

The QMA Singularity

scottaaronson.blog
73
28
frozenseven
2 months ago
48
36
pseudolus
2 months ago
HN

Video models are zero-shot learners and reasoners

video-zero-shot.github.io
65
4
meetpateltech
3 months ago

Reddit

Top posts from tech subreddits• Updated 8 minutes ago

115
11
paf1138
about 5 hours ago
Reddit

zai-org/GLM-4.6V-Flash (9B) is here

reddit.com
237
39
Cute-Sprinkles4911
1 day ago

Hugging Face Trending

Popular models from Hugging Face• Updated 26 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated 40 minutes ago

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.