Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 4 minutes ago

HN

Qwen3 30B-A3B

huggingface.co
HN

Playing with Open Source LLMs

alicegg.tech
48
35
zer0tonin
3 months ago
HN

Viral Language

lareviewofbooks.org
37
6
lermontov
3 months ago
HN

I am a SOTA 0-shot classifier of your slop

christopherkrapu.com
48
36
ckrapu
3 months ago
HN

Qwen3-235B-A22B-Thinking-2507

huggingface.co
130
40
tosh
3 months ago
34
3
kaycebasques
3 months ago

Reddit

Top posts from tech subreddits• Updated 4 minutes ago

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

Qwen3-VL-8B-Instruct

Task: image-text-to-text

Qwen3-VL-8B-Thinking

Task: image-text-to-text

GLM-4.6

Task: text-generation

UserLM-8b

Task: text-generation

Arch-Router-1.5B

Task: text-generation

Schematron-3B

Task: text-generation

GitHub Trending

Popular repositories from GitHub• Updated less than a minute ago

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Megatron-LM

Ongoing research training transformer models at scale

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

whisper.cpp

Port of OpenAI's Whisper model in C/C++

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors