Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 8 minutes ago

Reddit

Top posts from tech subreddits• Updated 2 minutes ago

Reddit

Llama5 is cancelled long live llama

i.redd.it
324
67
SelectionCalm70
17 days ago
Reddit

introducing tangled

blog.tangled.org
Reddit

Qwen3 VL 4B to be released?

i.redd.it
164
16
Signal-Run7450
19 days ago

Hugging Face Trending

Popular models from Hugging Face• Updated 21 minutes ago

Qwen3-VL-2B-Instruct

Task: image-text-to-text

Qwen3-VL-8B-Instruct

Task: image-text-to-text

Qwen3-VL-32B-Instruct

Task: image-text-to-text

pokee_research_7b

Task: text-generation

GLM-4.6

Task: text-generation

Qwen3-VL-2B-Thinking

Task: image-text-to-text

Llama-3.1-8B-Instruct

Task: text-generation

gpt-oss-120b

Task: text-generation

GitHub Trending

Popular repositories from GitHub• Updated 35 minutes ago

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

Megatron-LM

Ongoing research training transformer models at scale

flash-attention

Fast and memory-efficient exact attention

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

whisper.cpp

Port of OpenAI's Whisper model in C/C++