SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 51100 of 10696 papers

TitleStatusHype
Orientation Matters: Making 3D Generative Models Orientation-Aligned0
Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection0
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References0
MapBERT: Bitwise Masked Modeling for Real-Time Semantic Mapping Generation0
SAM2Auto: Auto Annotation Using FLASH0
Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive StudyCode0
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation0
Multiple Object Stitching for Unsupervised Representation LearningCode1
HOI-PAGE: Zero-Shot Human-Object Interaction Generation with Part Affordance Guidance0
Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM0
Edge-Enabled Collaborative Object Detection for Real-Time Multi-Vehicle PerceptionCode0
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?0
Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning0
CIVET: Systematic Evaluation of Understanding in VLMs0
RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion0
Gen-n-Val: Agentic Image Data Generation and Validation0
Light and 3D: a methodological exploration of digitisation techniques adapted to a selection of objects from the Musée d'Archéologie Nationale0
Feature-Based Lie Group Transformer for Real-World Applications0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations0
Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning0
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection0
Sounding that Object: Interactive Object-Aware Image to Audio Generation0
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models0
ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment0
Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs0
InterRVOS: Interaction-aware Referring Video Object Segmentation0
unMORE: Unsupervised Multi-Object Segmentation via Center-Boundary ReasoningCode0
WoMAP: World Models For Embodied Open-Vocabulary Object Localization0
ComposeAnything: Composite Object Priors for Text-to-Image Generation0
SORCE: Small Object Retrieval in Complex EnvironmentsCode0
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing0
Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames0
Object Centric Concept Bottlenecks0
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation0
Conformal Object Detection by Sequential Risk Control0
FMG-Det: Foundation Model Guided Robust Object Detection0
Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping0
MOVi: Training-free Text-conditioned Multi-Object Video Generation0
Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images0
Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion0
The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector0
LPOI: Listwise Preference Optimization for Vision Language ModelsCode1
Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks0
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation0
CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects0
ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object TrackingCode1
Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language ModelsCode0
Progressive Scaling Visual Object Tracking0
Category-Agnostic Neural Object Rigging0
Show:102550
← PrevPage 2 of 214Next →

No leaderboard results yet.