SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 51100 of 10696 papers

TitleStatusHype
Deep Learning-Based Object Pose Estimation: A Comprehensive SurveyCode3
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular VideosCode3
Moving Object Segmentation: All You Need Is SAM (and Flow)Code3
ZeST: Zero-Shot Material Transfer from a Single ImageCode3
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with ObjectsCode3
Multiple Object Tracking as ID PredictionCode3
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
ShapeLLM: Universal 3D Object Understanding for Embodied InteractionCode3
SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAMCode3
General Object Foundation Model for Images and Videos at ScaleCode3
MotionCtrl: A Unified and Flexible Motion Controller for Video GenerationCode3
Putting the Object Back into Video Object SegmentationCode3
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
Segment Anything Meets Point TrackingCode3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and MoreCode3
Geometric-aware Pretraining for Vision-centric 3D Object DetectionCode3
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-IdentificationCode3
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkCode3
BoT-SORT: Robust Associations Multi-Pedestrian TrackingCode3
PETRv2: A Unified Framework for 3D Perception from Multi-Camera ImagesCode3
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object TrackingCode3
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation ModelsCode3
NeROIC: Neural Rendering of Objects from Online Image CollectionsCode3
Motion Representations for Articulated AnimationCode3
Robust and Accurate Object Detection via Adversarial LearningCode3
A Comparative Analysis of Object Detection Metrics with a Companion Open-Source ToolkitCode3
A Survey on Performance Metrics for Object-Detection AlgorithmsCode3
YOLOv4: Optimal Speed and Accuracy of Object DetectionCode3
First Order Motion Model for Image AnimationCode3
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample SelectionCode3
EfficientDet: Scalable and Efficient Object DetectionCode3
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real WorldCode2
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and TrackingCode2
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and ResultsCode2
Objaverse++: Curated 3D Object Dataset with Quality AnnotationsCode2
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal PromptingCode2
InteractVLM: 3D Interaction Reasoning from 2D Foundational ModelsCode2
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by SegmentationCode2
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian SplittingCode2
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian SplattingCode2
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
Omnidirectional Multi-Object TrackingCode2
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object SegmentationCode2
Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose EstimationCode2
Show:102550
← PrevPage 2 of 214Next →

No leaderboard results yet.