SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 601650 of 10696 papers

TitleStatusHype
HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language ModelsCode0
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving ScenesCode1
Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsCode1
Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories0
Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement0
Symbolic Disentangled Representations for Images0
EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation0
CGCOD: Class-Guided Camouflaged Object DetectionCode2
Evaluating the Adversarial Robustness of Detection Transformers0
Distortion-Aware Adversarial Attacks on Bounding Boxes of Object DetectorsCode0
COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection0
PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models0
Multi-Point Positional Insertion Tuning for Small Object Detection0
S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural FieldCode0
Cross-View Referring Multi-Object TrackingCode2
OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving0
Seamless Detection: Unifying Salient Object Detection and Camouflaged Object DetectionCode1
Concept Guided Co-saliency Objection Detection0
Generalizable Articulated Object Perception with Superpoints0
Improving Object Detection for Time-Lapse Imagery Using Temporal Features in Wildlife MonitoringCode0
Affordance-Aware Object Insertion via Mask-Aware Dual DiffusionCode1
Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal PropertiesCode2
Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations0
Leveraging Color Channel Independence for Improved Unsupervised Object Detection0
LeviTor: 3D Trajectory Oriented Image-to-Video SynthesisCode2
ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping0
Descriptive Caption Enhancement with Visual Specialists for Multimodal PerceptionCode0
Real Classification by Description: Extending CLIP's Limits of Part Attributes RecognitionCode0
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing0
Temporally Consistent Object-Centric Learning by Contrasting Slots0
RelationField: Relate Anything in Radiance FieldsCode2
Object Style Diffusion for Generalized Object Detection in Urban Scene0
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and GenerationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images0
Differential Alignment for Domain Adaptive Object DetectionCode1
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection GuidanceCode3
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts0
Efficient Object-centric Representation Learning with Pre-trained Geometric Prior0
Probabilistic GOSPA: A Metric for Performance Evaluation of Multi-Object Filters with UncertaintiesCode0
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers0
Leveraging Retrieval-Augmented Tags for Large Vision-Language Understanding in Complex Scenes0
Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion0
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning0
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes0
Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty DetectionCode0
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic PromptCode2
Show:102550
← PrevPage 13 of 214Next →

No leaderboard results yet.