SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 201250 of 10696 papers

TitleStatusHype
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy ScenesCode2
HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance FieldsCode2
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from VideoCode2
Improving Text-guided Object Inpainting with Semantic Pre-inpaintingCode2
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic SegmentationCode2
Gaussian Grouping: Segment and Edit Anything in 3D ScenesCode2
ALBench: A Framework for Evaluating Active Learning in Object DetectionCode2
DetGPT: Detect What You Need via ReasoningCode2
InterDiff: Generating 3D Human-Object Interactions with Physics-Informed DiffusionCode2
Interpreting Object-level Foundation Models via Visual Precision SearchCode2
Is CLIP the main roadblock for fine-grained open-world perception?Code2
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
Autoregressive Visual TrackingCode2
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image GenerationCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
LeviTor: 3D Trajectory Oriented Image-to-Video SynthesisCode2
LeYOLO, New Scalable and Efficient CNN Architecture for Object DetectionCode2
Localization Distillation for Object DetectionCode2
Detect Everything with Few ExamplesCode2
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real WorldCode2
Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-IdentificationCode2
MegaPose: 6D Pose Estimation of Novel Objects via Render & CompareCode2
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object TrackingCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object RearrangementCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
Monocular 3D Object Detection with Depth from MotionCode2
Beyond MOT: Semantic Multi-Object TrackingCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Deep Snake for Real-Time Instance SegmentationCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
Dense Distinct Query for End-to-End Object DetectionCode2
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft DatasetCode2
Multi-Grained Angle Representation for Remote Sensing Object DetectionCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
DAVE -- A Detect-and-Verify Paradigm for Low-Shot CountingCode2
NetTrack: Tracking Highly Dynamic Objects with a NetCode2
NOPE: Novel Object Pose Estimation from a Single ImageCode2
Objaverse++: Curated 3D Object Dataset with Quality AnnotationsCode2
Segmentation Transformer: Object-Contextual Representations for Semantic SegmentationCode2
BOP Challenge 2020 on 6D Object LocalizationCode2
AccDiffusion: An Accurate Method for Higher-Resolution Image GenerationCode2
OCNet: Object Context Network for Scene ParsingCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
Decoupling Features in Hierarchical Propagation for Video Object SegmentationCode2
Cross-View Referring Multi-Object TrackingCode2
Show:102550
← PrevPage 5 of 214Next →

No leaderboard results yet.