SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 551575 of 10696 papers

TitleStatusHype
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary DetectionCode1
DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and ObjectsCode1
Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite SetsCode1
SRPose: Two-view Relative Pose Estimation with Sparse KeypointsCode1
ActionVOS: Actions as Prompts for Video Object SegmentationCode1
Cue Point Estimation using Object DetectionCode1
CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object SegmentationCode1
Zero-shot Object Counting with Good ExemplarsCode1
StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object DetectionCode1
Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic SegmentationCode1
Comics Datasets Framework: Mix of Comics datasets for detection benchmarkingCode1
Similarity Distance-Based Label Assignment for Tiny Object DetectionCode1
Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous RoboticsCode1
BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR DataCode1
Uncertainty for SVBRDF Acquisition using Frequency AnalysisCode1
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language ModelsCode1
MVOC: a training-free multiple video object composition method with diffusion modelsCode1
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object DetectionCode1
African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object ClassificationCode1
CustAny: Customizing Anything from A Single ExampleCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
MMRel: A Relation Understanding Benchmark in the MLLM EraCode1
ImageNet3D: Towards General-Purpose Object-Level 3D UnderstandingCode1
3D-AVS: LiDAR-based 3D Auto-Vocabulary SegmentationCode1
LaMOT: Language-Guided Multi-Object TrackingCode1
Show:102550
← PrevPage 23 of 428Next →

No leaderboard results yet.