SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 701750 of 10696 papers

TitleStatusHype
Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object DiscoveryCode1
MedYOLO: A Medical Image Object Detection FrameworkCode1
GenHowTo: Learning to Generate Actions and State Transformations from Instructional VideosCode1
InteractDiffusion: Interaction Control in Text-to-Image Diffusion ModelsCode1
Correcting Diffusion Generation through ResamplingCode1
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color ConsistencyCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single ImagesCode1
Mitigating Open-Vocabulary Caption HallucinationsCode1
High Pileup Particle Tracking with Object CondensationCode1
TokenCompose: Text-to-Image Diffusion with Token-level SupervisionCode1
Boosting Segment Anything Model Towards Open-Vocabulary LearningCode1
DreamComposer: Controllable 3D Object Generation via Multi-View ConditionsCode1
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object DetectionCode1
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption RewritesCode1
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object DetectionCode1
Object Recognition as Next Token PredictionCode1
Toward Improving Robustness of Object Detectors Against Domain ShiftCode1
SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive TransformersCode1
Is Underwater Image Enhancement All Object Detectors Need?Code1
A Simple Video Segmenter by Tracking Objects Along Axial TrajectoriesCode1
Lasagna: Layered Score Distillation for Disentangled Object RelightingCode1
The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understandingCode1
RQFormer: Rotated Query Transformer for End-to-End Oriented Object DetectionCode1
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object SegmentationCode1
UGG: Unified Generative GraspingCode1
Segment Every Out-of-Distribution ObjectCode1
Single-Model and Any-Modality for Video Object TrackingCode1
Visual Programming for Zero-shot Open-Vocabulary 3D Visual GroundingCode1
PointOBB: Learning Oriented Object Detection via Single Point SupervisionCode1
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point SupervisionCode1
Physical Reasoning and Object Planning for Household Embodied AgentsCode1
Point, Segment and Count: A Generalized Framework for Object CountingCode1
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher LearningCode1
Enhancing Novel Object Detection via Cooperative Foundational ModelsCode1
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionCode1
SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose EstimationCode1
ShapeMatcher: Self-Supervised Joint Shape Canonicalization, Segmentation, Retrieval and DeformationCode1
Closely-Spaced Object Classification Using MuyGPySCode1
Neural-Logic Human-Object Interaction DetectionCode1
Identifying Linear Relational Concepts in Large Language ModelsCode1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination EvaluationCode1
Which One? Leveraging Context Between Objects and Multiple Views for Language GroundingCode1
Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in ClutterCode1
Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMsCode1
Rotation Invariant Transformer for Recognizing Object in UAVsCode1
Proposal-Level Unsupervised Domain Adaptation for Open World Unbiased DetectorCode1
VQPy: An Object-Oriented Approach to Modern Video AnalyticsCode1
Patch-based Selection and Refinement for Early Object DetectionCode1
Re-Scoring Using Image-Language Similarity for Few-Shot Object DetectionCode1
Show:102550
← PrevPage 15 of 214Next →

No leaderboard results yet.