SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 101150 of 10696 papers

TitleStatusHype
Equalized Focal Loss for Dense Long-Tailed Object DetectionCode2
EGTR: Extracting Graph from Transformer for Scene Graph GenerationCode2
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
Efficient Teacher: Semi-Supervised Object Detection for YOLOv5Code2
Efficient Image Pre-Training with Siamese Cropped Masked AutoencodersCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
EgoLifter: Open-world 3D Segmentation for Egocentric PerceptionCode2
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
Evaluating Object Hallucination in Large Vision-Language ModelsCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object RearrangementCode2
Exploring Orthogonality in Open World Object DetectionCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Fast R-CNNCode2
Find Any Part in 3DCode2
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object SegmentationCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
ARCTIC: A Dataset for Dexterous Bimanual Hand-Object ManipulationCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
Focal Loss for Dense Object DetectionCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
Fully Sparse 3D Object DetectionCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
Gaussian Grouping: Segment and Edit Anything in 3D ScenesCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the WildCode2
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion ModelsCode2
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image InpaintingCode2
DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D ImagesCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose EstimationCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
ALBench: A Framework for Evaluating Active Learning in Object DetectionCode2
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on SimulationCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
DetGPT: Detect What You Need via ReasoningCode2
Detect Everything with Few ExamplesCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
Deep Snake for Real-Time Instance SegmentationCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
Show:102550
← PrevPage 3 of 214Next →

No leaderboard results yet.