Trending

Content tagged with "computer-vision"

computer-vision

Hacker News

Top stories from the Hacker News community• Updated 7 minutes ago

Reddit

Top posts from tech subreddits• Updated 4 minutes ago

Reddit

[R] Swapping image encoder in VLM

reddit.com
2
0
Amazing_NickName
about 2 months ago
1753
114
dionisioalcaraz
about 2 months ago

Hugging Face Trending

Popular models from Hugging Face• Updated 4 minutes ago

Hunyuan3D-2.1

Task: image-to-3d

MonkeyOCR

Task: visual-document-retrieval

vjepa2-vitl-fpc64-256

Task: video-classification

GitHub Trending

Popular repositories from GitHub• Updated 18 minutes ago

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

insightface

State-of-the-art 2D and 3D Face Analysis Project

boxmot

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

2