SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 501550 of 10696 papers

TitleStatusHype
Objects matter: object-centric world models improve reinforcement learning in visually complex environments0
3D Reconstruction of non-visible surfaces of objects from a Single Depth View -- Comparative Study0
Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object ClassificationCode0
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities0
Estimation-theoretic analysis of lensless imaging0
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations0
CSAOT: Cooperative Multi-Agent System for Active Object Tracking0
CuriousBot: Interactive Mobile Exploration via Actionable 3D Relational Object Graph0
MONA: Moving Object Detection from Videos Shot by Dynamic Camera0
Slot-BERT: Self-supervised Object Discovery in Surgical Video0
TOFFE -- Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking0
SMamba: Sparse Mamba for Event-based Object DetectionCode1
Green Video Camouflaged Object Detection0
Surface-SOS: Self-Supervised Object Segmentation via Neural Surface RepresentationCode0
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis0
RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection0
MonoSOWA: Scalable monocular 3D Object detector Without human Annotations0
Detecting Contextual Anomalies by Discovering Consistent Spatial Regions0
Predicting Performance of Object Detection Models in Electron Microscopy Using Random ForestsCode0
Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying0
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
Everybody Likes to Sleep: A Computer-Assisted Comparison of Object Naming Data from 30 LanguagesCode0
SmartEraser: Remove Anything from Images using Masked-Region Guidance0
DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models0
Object-Centric 2D Gaussian Splatting: Background Removal and Occlusion-Aware Pruning for Compact Object Models0
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations0
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics0
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video EditingCode0
VDOR: A Video-based Dataset for Object Removal via Sequence Consistency0
VAGeo: View-specific Attention for Cross-View Object Geo-Localization0
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
UnCommon Objects in 3DCode5
Guided SAM: Label-Efficient Part Segmentation0
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D ShapesCode1
Mamba-MOC: A Multicategory Remote Object Counting via State Space Model0
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation0
From Simple to Complex Skills: The Case of In-Hand Object Reorientation0
Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation0
Improving Skeleton-based Action Recognition with Interactive Object InformationCode0
UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles0
TexHOI: Reconstructing Textures of 3D Unknown Objects in Monocular Hand-Object Interaction ScenesCode0
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Learning to Transfer Human Hand Skills for Robot Manipulations0
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features0
Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionCode4
Universal Fine-grained Visual Categorization by Concept Guided LearningCode0
HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation0
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation0
Human Gaze Boosts Object-Centered Representation Learning0
Show:102550
← PrevPage 11 of 214Next →

No leaderboard results yet.