SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 25512575 of 10696 papers

TitleStatusHype
Feature-Based Lie Group Transformer for Real-World Applications0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
Gen-n-Val: Agentic Image Data Generation and Validation0
CIVET: Systematic Evaluation of Understanding in VLMs0
Light and 3D: a methodological exploration of digitisation techniques adapted to a selection of objects from the Musée d'Archéologie Nationale0
Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning0
Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations0
RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion0
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?0
Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning0
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models0
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection0
Sounding that Object: Interactive Object-Aware Image to Audio Generation0
Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs0
InterRVOS: Interaction-aware Referring Video Object Segmentation0
ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment0
unMORE: Unsupervised Multi-Object Segmentation via Center-Boundary ReasoningCode0
WoMAP: World Models For Embodied Open-Vocabulary Object Localization0
SORCE: Small Object Retrieval in Complex EnvironmentsCode0
Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames0
ComposeAnything: Composite Object Priors for Text-to-Image Generation0
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation0
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing0
Object Centric Concept Bottlenecks0
Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion0
Show:102550
← PrevPage 103 of 428Next →

No leaderboard results yet.