SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 22512300 of 10696 papers

TitleStatusHype
I'M HOI: Inertia-aware Monocular Capture of 3D Human-Object Interactions0
The Quest for an Integrated Set of Neural Mechanisms Underlying Object Recognition in Primates0
Correcting Diffusion Generation through ResamplingCode1
Open World Object Detection in the Era of Foundation Models0
InteractDiffusion: Interaction Control in Text-to-Image Diffusion ModelsCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color ConsistencyCode1
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control0
Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single ImagesCode1
Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection0
PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction0
Natural-language-driven Simulation Benchmark and Copilot for Efficient Production of Object Interactions in Virtual Road Scenes0
Gen2Det: Generate to Detect0
TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes0
High Pileup Particle Tracking with Object CondensationCode1
Controllable Human-Object Interaction Synthesis0
SurfaceAug: Closing the Gap in Multimodal Ground Truth Sampling0
Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion0
Automated Multimodal Data Annotation via Calibration With Indoor Positioning System0
TokenCompose: Text-to-Image Diffusion with Token-level SupervisionCode1
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image InpaintingCode0
Texture-Semantic Collaboration Network for ORSI Salient Object DetectionCode0
MotionCtrl: A Unified and Flexible Motion Controller for Video GenerationCode3
Low-shot Object Learning with Mutual Exclusivity BiasCode0
DreamComposer: Controllable 3D Object Generation via Multi-View ConditionsCode1
Mitigating Open-Vocabulary Caption HallucinationsCode1
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation0
Boosting Segment Anything Model Towards Open-Vocabulary LearningCode1
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing0
ScAR: Scaling Adversarial Robustness for LiDAR Object DetectionCode0
ZeroReg: Zero-Shot Point Cloud Registration with Foundation Models0
Are Vision Transformers More Data Hungry Than Newborn Visual Systems?Code0
SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary ConstraintsCode2
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object DetectionCode1
RotaTR: Detection Transformer for Dense and Rotated Object0
MANUS: Markerless Grasp Capture using Articulated 3D Gaussians0
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption RewritesCode1
Object Recognition as Next Token PredictionCode1
Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane0
Adaptive Confidence Threshold for ByteTrack in Multi-Object TrackingCode0
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object DetectionCode1
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
SANeRF-HQ: Segment Anything for NeRF in High Quality0
SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects0
ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models0
Toward Improving Robustness of Object Detectors Against Domain ShiftCode1
Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction0
Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D0
ImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationCode2
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models0
Show:102550
← PrevPage 46 of 214Next →

No leaderboard results yet.