SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 101150 of 10696 papers

TitleStatusHype
NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID0
Progressive Scaling Visual Object Tracking0
MaskedManipulator: Versatile Whole-Body Control for Loco-Manipulation0
EOTNet: Deep Memory Aided Bayesian Filter for Extended Object TrackingCode0
FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes0
RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection0
Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross AttentionCode1
Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking0
Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms0
MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection0
Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds0
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance0
TextureSAM: Towards a Texture Aware Foundation Model for Segmentation0
MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation0
Investigating Fine- and Coarse-grained Structural Correspondences Between Deep Neural Networks and Human Object Image Similarity Judgments Using Unsupervised Alignment0
PromptTAD: Object-Prompt Enhanced Traffic Anomaly DetectionCode0
gen2seg: Generative Models Enable Generalizable Instance Segmentation0
Expanding Zero-Shot Object Counting with Rich Prompts0
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
Multispectral Detection Transformer with Infrared-Centric Sensor FusionCode0
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation0
Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation0
Optimizing Retrieval Augmented Generation for Object Constraint Language0
LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking0
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
OPA-Pack: Object-Property-Aware Robotic Bin Packing0
Emergent Active Perception and Dexterity of Simulated Humanoids from Visual Reinforcement Learning0
GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity0
Feasibility with Language Models for Open-World Compositional Zero-Shot Learning0
PARSEC: Preference Adaptation for Robotic Object Rearrangement from Scene ContextCode0
AW-GATCN: Adaptive Weighted Graph Attention Convolutional Network for Event Camera Data Joint Denoising and Object Recognition0
RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects0
A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation0
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story GenerationCode1
MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence0
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation0
MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection0
Camera-Only 3D Panoptic Scene Completion for Autonomous Driving through Differentiable Object ShapesCode0
Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models0
Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores0
Leveraging Multi-Modal Information to Enhance Dataset Distillation0
Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology0
HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne PerspectiveCode0
Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix0
Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position PredictivityCode0
Asynchronous Multi-Object Tracking with an Event CameraCode1
Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking0
Hybrid Spiking Vision Transformer for Object Detection with Event Cameras0
Show:102550
← PrevPage 3 of 214Next →

No leaderboard results yet.