SOTAVerified

Object Categorization

Object categorization identifies which label, from a given set, best corresponds to an image region defined by an input image and bounding box.

Papers

Showing 125 of 80 papers

TitleStatusHype
Vision CNNs trained to estimate spatial latents learned similar ventral-stream-aligned representationsCode0
Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries0
Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task0
Adversarial alignment: Breaking the trade-off between the strength of an attack and its relevance to human perception0
Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers0
Vocabulary-informed Zero-shot and Open-set LearningCode0
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
Enhancing Fine-Grained 3D Object Recognition using Hybrid Multi-Modal Vision Transformer-CNN ModelsCode0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
GRIT: General Robust Image Task BenchmarkCode1
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Webly Supervised Concept Expansion for General Purpose Vision Models0
Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorizationCode0
Learning Transferable Visual Models From Natural Language SupervisionCode2
Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces0
IAUnet: Global Context-Aware Feature Learning for Person Re-IdentificationCode0
Local-HDP: Interactive Open-Ended 3D Object Categorization in Real-Time Robotic Scenarios0
Learning Physical Graph Representations from Visual ScenesCode1
Unsupervised Domain Adaptation through Inter-modal Rotation for RGB-D Object RecognitionCode0
Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNsCode0
Multiple Riemannian Manifold-valued Descriptors based Image Set Classification with Multi-Kernel Metric Learning0
Look Further to Recognize Better: Learning Shared Topics and Category-Specific Dictionaries for Open-Ended 3D Object Recognition0
Visualizing Representational Dynamics with Multidimensional Scaling Alignment0
Learning robust visual representations using data augmentation invarianceCode0
Improved object recognition using neural networks trained to mimic the brain's statistical propertiesCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Unified-IOXLCategorization (ablation)61.7Unverified
2GPV-2Categorization (ablation)54.7Unverified
3CLIPCategorization (ablation)48.1Unverified
4OFA_LargeCategorization (ablation)22.6Unverified