SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 28012850 of 10696 papers

TitleStatusHype
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Hierarchical Contact-Rich Trajectory Optimization for Multi-Modal Manipulation using Tight Convex Relaxations0
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting0
Embodied Crowd Counting0
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection0
Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion0
Large model enhanced computational ghost imagingCode0
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways0
Multi-Modal 3D Mesh Reconstruction from Images and Text0
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization0
Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection0
EAZY: Eliminating Hallucinations in LVLMs by Zeroing out Hallucinatory Image Tokens0
Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives0
A Light Perspective for 3D Object Detection0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation0
D3DR: Lighting-Aware Object Insertion in Gaussian Splatting0
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images0
Object-Centric World Model for Language-Guided Manipulation0
Accurate and Efficient Two-Stage Gun Detection in Video0
OSCAR: Object Status and Contextual Awareness for Recipes to Support Non-Visual Cooking0
2D Object Detection: A Survey0
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction0
Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection0
Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models0
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects0
ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport TheoremCode0
Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training0
Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks0
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach0
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation0
L2RDaS: Synthesizing 4D Radar Tensors for Model Generalization via Dataset Expansion0
Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use CaseCode0
Active 6D Pose Estimation for Textureless Objects using Multi-View RGB Frames0
Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection0
BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation0
A dataset-free approach for self-supervised learning of 3D reflectional symmetries0
MonoLite3D: Lightweight 3D Object Properties Estimation0
ClipGrader: Leveraging Vision-Language Models for Robust Label Quality Assessment in Object Detection0
Category-level Meta-learned NeRF Priors for Efficient Object Mapping0
VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors0
Object-Aware Video Matting with Cross-Frame Guidance0
Language-Guided Object Search in Agricultural Environments0
AirRoom: Objects Matter in Room Reidentification0
AI-Driven Relocation Tracking in Dynamic Kitchen EnvironmentsCode0
EigenActor: Variant Body-Object Interaction Generation Evolved from Invariant Action Basis Reasoning0
Taming Large Multimodal Agents for Ultra-low Bitrate Semantically Disentangled Image CompressionCode0
Enhancing deep neural networks through complex-valued representations and Kuramoto synchronization dynamics0
Towards Semantic 3D Hand-Object Interaction Generation via Functional Text Guidance0
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects0
Show:102550
← PrevPage 57 of 214Next →

No leaderboard results yet.