SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 26512700 of 10696 papers

TitleStatusHype
FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors0
Inconsistency-based Active Learning for LiDAR Object Detection0
HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection0
Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors0
Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction0
Stereo X-ray tomography on deformed object tracking0
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation0
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models0
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection0
Hierarchical Context Learning of object components for unsupervised semantic segmentationCode0
The Mean of Multi-Object Trajectories0
Category-Level and Open-Set Object Pose Estimation for Robotics0
LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition0
Dexonomy: Synthesizing All Dexterous Grasp Types in a Grasp Taxonomy0
A Review of 3D Object Detection with Vision-Language Models0
Multi-Sensor Fusion of Active and Passive Measurements for Extended Object Tracking0
Object Learning and Robust 3D Reconstruction0
PCF-Grasp: Converting Point Completion to Geometry Feature to Enhance 6-DoF Grasp0
DeepPD: Joint Phase and Object Estimation from Phase Diversity with Neural Calibration of a Deformable Mirror0
Visual Intention Grounding for Egocentric Assistants0
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection0
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence MatchingCode0
Crossing the Human-Robot Embodiment Gap with Sim-to-Real RL using One Human Demonstration0
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling0
ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation0
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity0
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation0
VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture0
Generalized Visual Relation Detection with Diffusion Models0
A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions0
DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction0
RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning0
Recent Advance in 3D Object and Scene Generation: A Survey0
Object Placement for Anything0
Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task0
Weather-Aware Object Detection Transformer for Domain Adaptation0
3D Object Reconstruction with mmWave Radars0
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts0
HUMOTO: A 4D Dataset of Mocap Human Object Interactions0
Multi-Object Grounding via Hierarchical Contrastive Siamese Transformers0
DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing0
MASSeg : 2nd Technical Report for 4th PVUW MOSE TrackCode0
RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object DetectionCode0
Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data GenerationCode0
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization0
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset0
WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer0
Learning Object Focused Attention0
POEM: Precise Object-level Editing via MLLM control0
SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos0
Show:102550
← PrevPage 54 of 214Next →

No leaderboard results yet.