SOTAVerified

Object Categorization

Object categorization identifies which label, from a given set, best corresponds to an image region defined by an input image and bounding box.

Papers

Showing 125 of 80 papers

TitleStatusHype
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
Learning Transferable Visual Models From Natural Language SupervisionCode2
GRIT: General Robust Image Task BenchmarkCode1
Learning Physical Graph Representations from Visual ScenesCode1
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise LabellingCode1
Vision CNNs trained to estimate spatial latents learned similar ventral-stream-aligned representationsCode0
Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries0
Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task0
Adversarial alignment: Breaking the trade-off between the strength of an attack and its relevance to human perception0
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers0
Vocabulary-informed Zero-shot and Open-set LearningCode0
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Webly Supervised Concept Expansion for General Purpose Vision Models0
Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorizationCode0
Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces0
IAUnet: Global Context-Aware Feature Learning for Person Re-IdentificationCode0
Local-HDP: Interactive Open-Ended 3D Object Categorization in Real-Time Robotic Scenarios0
Unsupervised Domain Adaptation through Inter-modal Rotation for RGB-D Object RecognitionCode0
Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNsCode0
Multiple Riemannian Manifold-valued Descriptors based Image Set Classification with Multi-Kernel Metric Learning0
Look Further to Recognize Better: Learning Shared Topics and Category-Specific Dictionaries for Open-Ended 3D Object Recognition0
Visualizing Representational Dynamics with Multidimensional Scaling Alignment0
Learning robust visual representations using data augmentation invarianceCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Unified-IOXLCategorization (ablation)61.7Unverified
2GPV-2Categorization (ablation)54.7Unverified
3CLIPCategorization (ablation)48.1Unverified
4OFA_LargeCategorization (ablation)22.6Unverified