SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 151200 of 10696 papers

TitleStatusHype
METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship DetectionCode0
Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search0
An Edge AI Solution for Space Object Detection0
Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial PoolingCode0
A Simple Detector with Frame Dynamics is a Strong TrackerCode1
Visual Affordances: Enabling Robots to Understand Object Functionality0
PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting0
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models0
Web2Grasp: Learning Functional Grasps from Web Images of Hand-Object Interactions0
Low Resolution Next Best View for Robot Packing0
One2Any: One-Reference 6D Pose Estimation for Any Object0
CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion0
AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual GroundingCode0
Corner Cases: How Size and Position of Objects Challenge ImageNet-Trained Models0
EOPose : Exemplar-based object reposing using Generalized Pose Correspondences0
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning0
Sim2Real Transfer for Vision-Based Grasp VerificationCode0
Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning0
RESAnything: Attribute Prompting for Arbitrary Referring Segmentation0
Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes0
FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors0
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
Inconsistency-based Active Learning for LiDAR Object Detection0
HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection0
Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction0
Stereo X-ray tomography on deformed object tracking0
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection0
Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors0
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation0
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models0
The Mean of Multi-Object Trajectories0
Hierarchical Context Learning of object components for unsupervised semantic segmentationCode0
Category-Level and Open-Set Object Pose Estimation for Robotics0
LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition0
Dexonomy: Synthesizing All Dexterous Grasp Types in a Grasp Taxonomy0
A Review of 3D Object Detection with Vision-Language Models0
Multi-Sensor Fusion of Active and Passive Measurements for Extended Object Tracking0
PCF-Grasp: Converting Point Completion to Geometry Feature to Enhance 6-DoF Grasp0
Object Learning and Robust 3D Reconstruction0
DeepPD: Joint Phase and Object Estimation from Phase Diversity with Neural Calibration of a Deformable Mirror0
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection0
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence MatchingCode0
Visual Intention Grounding for Egocentric Assistants0
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling0
VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture0
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity0
Crossing the Human-Robot Embodiment Gap with Sim-to-Real RL using One Human Demonstration0
ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation0
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation0
Show:102550
← PrevPage 4 of 214Next →

No leaderboard results yet.