SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 17511800 of 10696 papers

TitleStatusHype
Learning to Visually Localize Sound Sources from Mixtures without Prior Source KnowledgeCode1
Comp4D: LLM-Guided Compositional 4D Scene Generation0
Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View PlanningCode0
Co-Occurring of Object Detection and Identification towards unlabeled object discovery0
ASDF: Assembly State Detection Utilizing Late Fusion by Integrating 6D Pose EstimationCode0
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster0
Data-Efficient 3D Visual Grounding via Order-Aware Referring0
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with ObjectsCode3
Multiple Object Tracking as ID PredictionCode3
DOCTR: Disentangled Object-Centric Transformer for Point Scene UnderstandingCode0
Toward Open-Set Human Object Interaction DetectionCode0
Cross-domain Multi-modal Few-shot Object Detection via Rich TextCode0
Realtime Robust Shape Estimation of Deformable Linear Object0
Fusion of Active and Passive Measurements for Robust and Scalable Positioning0
Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields0
Object Detectors in the Open Environment: Challenges, Solutions, and OutlookCode1
Gaze-guided Hand-Object Interaction Synthesis: Dataset and Method0
Towards Two-Stream Foveation-based Active Vision Learning0
Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation0
SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object ReconstructionCode1
PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture SearchCode1
Inpainting-Driven Mask Optimization for Object Removal0
InterFusion: Text-Driven Generation of 3D Human-Object InteractionCode2
Reasoning-Enhanced Object-Centric Learning for Videos0
SFOD: Spiking Fusion Object DetectorCode1
VRSO: Visual-Centric Reconstruction for Static Object AnnotationCode1
Pose-Aware Self-Supervised Learning with Viewpoint Trajectory RegularizationCode0
Survey on Modeling of Human-made Articulated Objects0
PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation0
Zero-Shot Multi-Object Scene Completion0
External Knowledge Enhanced 3D Scene Generation from Sketch0
VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation0
Leveraging Large Language Model-based Room-Object Relationships Knowledge for Enhancing Multimodal-Input Object Goal Navigation0
Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild0
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection0
3D Object Detection from Point Cloud via Voting Step DiffusionCode0
EC-IoU: Orienting Safety for Object Detectors via Ego-Centric Intersection-over-Union0
DVMNet++: Rethinking Relative Pose Estimation for Unseen ObjectsCode1
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
Few-shot Oriented Object Detection with Memorable Contrastive Learning in Remote Sensing Images0
EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration0
3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D0
SC-Diff: 3D Shape Completion with Latent Diffusion Models0
OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation0
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance0
Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain AdaptationCode0
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLMCode1
Show:102550
← PrevPage 36 of 214Next →

No leaderboard results yet.