SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 9511000 of 10696 papers

TitleStatusHype
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
1st Place Solution to the 8th HANDS Workshop Challenge -- ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction0
CAFF-DINO: Multi-spectral object detection transformers with cross-attention features fusion0
An Overview of Multi-Object Estimation via Labeled Random Finite Set0
Improving Visual Object Tracking through Visual PromptingCode1
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
Query matching for spatio-temporal action detection with query-based object detector0
Search3D: Hierarchical Open-Vocabulary 3D Segmentation0
You Only Speak Once to See0
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose EstimationCode1
Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval0
Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing0
SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining0
Amodal Instance Segmentation with Diffusion Shape Prior Estimation0
Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D ReconstructionCode2
General Compression Framework for Efficient Transformer Object Tracking0
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image GenerationCode2
CAMOT: Camera Angle-aware Multi-Object Tracking0
Hand-object reconstruction via interaction-aware graph attention mechanism0
A Grasping Movement Intention Estimator for Intuitive Control of Assistive Devices0
Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving0
Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM0
Source-Free Domain Adaptation for YOLO Object DetectionCode2
Progressive Representation Learning for Real-Time UAV TrackingCode2
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion ModelCode1
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2Code5
A Versatile and Differentiable Hand-Object Interaction Representation0
UICE-MIRNet guided image enhancement for underwater object detection0
OW-Rep: Open World Object Detection with Instance Representation Learning0
Mind the Prompt: A Novel Benchmark for Prompt-based Class-Agnostic CountingCode1
LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose EstimationCode1
Tiny Robotics Dataset and Benchmark for Continual Object DetectionCode0
Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis0
Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking0
DecoupleNet: A Lightweight Backbone Network With Efficient Feature Decoupling for Remote Sensing Visual TasksCode1
SOS: Segment Object System for Open-World Instance Segmentation With Object Priors0
A Bottom-Up Approach to Class-Agnostic Image Segmentation0
Formula-Supervised Visual-Geometric Pre-training0
Learning to Play Video Games with Intuitive Physics Priors0
Interpretable Action Recognition on Hard to Classify Actions0
Frequency-Guided Spatial Adaptation for Camouflaged Object Detection0
PoTATO: A Dataset for Analyzing Polarimetric Traces of Afloat Trash ObjectsCode0
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting0
SIM-OFE: Structure Information Mining and Object-aware Feature Enhancement for Fine-Grained Visual Categorization0
FAST GDRNPP: Improving the Speed of State-of-the-Art 6D Object Pose Estimation0
One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation0
Towards Global Localization using Multi-Modal Object-Instance Re-IdentificationCode0
End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation0
DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information0
Representing Positional Information in Generative World Models for Object Manipulation0
Show:102550
← PrevPage 20 of 214Next →

No leaderboard results yet.