SOTAVerified

Object Categorization

Object categorization identifies which label, from a given set, best corresponds to an image region defined by an input image and bounding box.

Papers

Showing 125 of 80 papers

TitleStatusHype
Learning Transferable Visual Models From Natural Language SupervisionCode2
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
GRIT: General Robust Image Task BenchmarkCode1
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise LabellingCode1
Learning Physical Graph Representations from Visual ScenesCode1
Unsupervised Domain Adaptation through Inter-modal Rotation for RGB-D Object RecognitionCode0
Improved object recognition using neural networks trained to mimic the brain's statistical propertiesCode0
Vision CNNs trained to estimate spatial latents learned similar ventral-stream-aligned representationsCode0
Learning robust visual representations using data augmentation invarianceCode0
Learning Deep Visual Object Models From Noisy Web Data: How to Make it WorkCode0
RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews from Unsupervised ViewpointsCode0
Systematic evaluation of CNN advances on the ImageNetCode0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNsCode0
Part-Aware Fine-grained Object Categorization using Weakly Supervised Part Detection NetworkCode0
Collaborative Receptive Field LearningCode0
Data augmentation instead of explicit regularizationCode0
Deep Learning Human Mind for Automated Visual ClassificationCode0
Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorizationCode0
IAUnet: Global Context-Aware Feature Learning for Person Re-IdentificationCode0
Are we done with object recognition? The iCub robot's perspectiveCode0
Cost-Effective Active Learning for Deep Image ClassificationCode0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Recurrent Convolutional Fusion for RGB-D Object RecognitionCode0
Vocabulary-informed Zero-shot and Open-set LearningCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Unified-IOXLCategorization (ablation)61.7Unverified
2GPV-2Categorization (ablation)54.7Unverified
3CLIPCategorization (ablation)48.1Unverified
4OFA_LargeCategorization (ablation)22.6Unverified