Trending

Content tagged with "transformers"

transformers

Hacker News

Top stories from the Hacker News community• Updated 13 minutes ago

InfoQ

Latest articles from InfoQ• Updated 4 minutes ago

InfoQ

Transformers v5 Introduces a More Modular and Interoperable Core

Hugging Face has released the first candidate for Transformers v5, marking a significant evolution from v4 five years ago. The library has grown from a specialized model toolkit to a critical resource in AI development, achieving over three million installations daily and more than 1.2 billion total installs. By Robert Krzaczyński

infoq.com
Robert Krzaczyński
3 days ago
InfoQ

Meta's Optimization Platform Ax 1.0 Streamlines LLM and System Optimization

Now stable, Ax is an open-source platform from Meta designed to help researchers and engineers apply machine learning to complex, resource-intensive experimentation. Over the past several years, Meta has used Ax to improve AI models, accelerate machine learning research, tune production infrastructure, and more. By Sergio De Simone

infoq.com
InfoQ

OpenAI's New GPT-5.1 Models are Faster and More Conversational

OpenAI recently released upgrades to their GPT-5 model. GPT‑5.1 Instant, the default chat model, has improvements to instruction following. GPT‑5.1 Thinking, the reasoning model, is faster and gives more understandable responses. GPT‑5.1-Codex-Max, the coding model, is trained to use compaction to perform long-running tasks. By Anthony Alford

infoq.com
Anthony Alford
10 days ago

Reddit

Top posts from tech subreddits• Updated 4 minutes ago

Reddit

Google's Gemma models family

i.redd.it
Reddit

New Google model incoming!!!

i.redd.it

Hugging Face Trending

Popular models from Hugging Face• Updated 40 minutes ago

GitHub Trending

Popular repositories from GitHub• Updated about 1 hour ago

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

flash-attention

Fast and memory-efficient exact attention

unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Jupyter Notebook
4,200
333

optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

txtai

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch