Trending

Content tagged with "computer-vision"

computer-vision

Hacker News

Top stories from the Hacker News community• Updated 11 minutes ago

Reddit

Top posts from tech subreddits• Updated 11 minutes ago

Reddit

Vision Language Models are Biased

vlmsarebiased.github.io

Hugging Face Trending

Popular models from Hugging Face• Updated about 1 hour ago

Hunyuan3D-2.1

Task: image-to-3d

MonkeyOCR

Task: visual-document-retrieval

vjepa2-vitl-fpc64-256

Task: video-classification

GitHub Trending

Popular repositories from GitHub• Updated 7 minutes ago

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

insightface

State-of-the-art 2D and 3D Face Analysis Project

boxmot

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

2