Trending
Content tagged with "computer-vision"
Hacker News
Top stories from the Hacker News community
No stories found
Try removing the tag filter or searching for different content.
Top posts from tech subreddits• Updated 8 minutes ago
[P] Has anyone worked with CNNs and geo-spatial data? How do you deal with edge cases and Null/No Data values in CNNs?
nanoVLM: A minimal Vision-Language Model with a LLaMA-style decoder — now open source
Hugging Face Trending
Popular models from Hugging Face• Updated 26 minutes ago
GitHub Trending
Popular repositories from GitHub• Updated 40 minutes ago
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
boxmot
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models