SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 25512600 of 10696 papers

TitleStatusHype
RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion0
Gen-n-Val: Agentic Image Data Generation and Validation0
Light and 3D: a methodological exploration of digitisation techniques adapted to a selection of objects from the Musée d'Archéologie Nationale0
Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations0
CIVET: Systematic Evaluation of Understanding in VLMs0
Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning0
Feature-Based Lie Group Transformer for Real-World Applications0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?0
Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning0
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection0
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models0
Sounding that Object: Interactive Object-Aware Image to Audio Generation0
Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs0
InterRVOS: Interaction-aware Referring Video Object Segmentation0
ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment0
WoMAP: World Models For Embodied Open-Vocabulary Object Localization0
unMORE: Unsupervised Multi-Object Segmentation via Center-Boundary ReasoningCode0
SORCE: Small Object Retrieval in Complex EnvironmentsCode0
ComposeAnything: Composite Object Priors for Text-to-Image Generation0
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing0
Object Centric Concept Bottlenecks0
Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames0
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation0
Conformal Object Detection by Sequential Risk Control0
Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion0
Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images0
Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping0
MOVi: Training-free Text-conditioned Multi-Object Video Generation0
FMG-Det: Foundation Model Guided Robust Object Detection0
The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector0
Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks0
CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects0
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation0
Progressive Scaling Visual Object Tracking0
Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language ModelsCode0
Category-Agnostic Neural Object Rigging0
NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID0
MaskedManipulator: Versatile Whole-Body Control for Loco-Manipulation0
FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
EOTNet: Deep Memory Aided Bayesian Filter for Extended Object TrackingCode0
SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes0
Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms0
Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking0
RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection0
Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds0
TextureSAM: Towards a Texture Aware Foundation Model for Segmentation0
MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection0
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance0
Show:102550
← PrevPage 52 of 214Next →

No leaderboard results yet.