| Fashion Object Detection for Tops & Bottoms | May 29, 2023 | Objectobject-detection | —Unverified | 0 |
| PaLI-X: On Scaling up a Multilingual Vision and Language Model | May 29, 2023 | Chart Question Answeringdocument understanding | CodeCode Available | 1 |
| ZeroPose: CAD-Prompted Zero-shot Object 6D Pose Estimation in Cluttered Scenes | May 29, 2023 | 6D Pose EstimationInstance Segmentation | —Unverified | 0 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 |
| VCVW-3D: A Virtual Construction Vehicles and Workers Dataset with 3D Annotations | May 29, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 0 |
| Pix2Repair: Implicit Shape Restoration from Images | May 29, 2023 | Object | —Unverified | 0 |
| CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models | May 29, 2023 | DenoisingObject | CodeCode Available | 1 |
| Z-GMOT: Zero-shot Generic Multiple Object Tracking | May 28, 2023 | Multi-Object TrackingMultiple Object Tracking | —Unverified | 0 |
| Counter-Hypothetical Particle Filters for Single Object Pose Tracking | May 28, 2023 | 6D Pose EstimationObject | —Unverified | 0 |
| Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5 | May 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization | May 28, 2023 | Monocular 3D Object LocalizationObject | —Unverified | 0 |
| Real-time Object Detection: YOLOv1 Re-Implementation in PyTorch | May 28, 2023 | Objectobject-detection | —Unverified | 0 |
| NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images | May 27, 2023 | Neural RenderingObject | CodeCode Available | 2 |
| Self-Supervised Learning of Action Affordances as Interaction Modes | May 27, 2023 | ObjectSelf-Supervised Learning | —Unverified | 0 |
| On the Importance of Backbone to the Adversarial Robustness of Object Detectors | May 27, 2023 | Adversarial RobustnessAutonomous Driving | CodeCode Available | 0 |
| DeepSeaNet: Improving Underwater Object Detection using EfficientDet | May 26, 2023 | Objectobject-detection | —Unverified | 0 |
| Generalizable Pose Estimation Using Implicit Scene Representations | May 26, 2023 | Density EstimationObject | CodeCode Available | 0 |
| FSD: Fully-Specialized Detector via Neural Architecture Search | May 26, 2023 | Lesion DetectionNeural Architecture Search | —Unverified | 0 |
| How To Not Train Your Dragon: Training-free Embodied Object Goal Navigation with Semantic Frontiers | May 26, 2023 | Imitation LearningNavigate | —Unverified | 0 |
| Linear Object Detection in Document Images using Multiple Object Tracking | May 26, 2023 | Instance SegmentationMultiple Object Tracking | —Unverified | 0 |
| Structured Latent Variable Models for Articulated Object Interaction | May 26, 2023 | Object | —Unverified | 0 |
| SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation | May 26, 2023 | cross-modal alignmentObject | CodeCode Available | 1 |
| Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception? | May 26, 2023 | ObjectObject Recognition | —Unverified | 0 |
| Guided Attention for Next Active Object @ EGO4D STA Challenge | May 25, 2023 | ObjectShort-term Object Interaction Anticipation | CodeCode Available | 0 |
| Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis | May 25, 2023 | Cross-Modal RetrievalObject | —Unverified | 0 |
| Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3) | May 25, 2023 | 6D Pose Estimation using RGBComputational Efficiency | CodeCode Available | 1 |
| CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion | May 25, 2023 | DiversityObject | CodeCode Available | 1 |
| Learning Occupancy for Monocular 3D Object Detection | May 25, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation | May 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference | May 25, 2023 | 3D geometryObject | CodeCode Available | 1 |
| Camera-Incremental Object Re-Identification with Identity Knowledge Evolution | May 25, 2023 | Knowledge DistillationObject | CodeCode Available | 0 |
| Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos | May 25, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| NAP: Neural 3D Articulation Prior | May 25, 2023 | 3D GenerationDenoising | CodeCode Available | 1 |
| Leveraging object detection for the identification of lung cancer | May 25, 2023 | Computational EfficiencyMedical Image Analysis | —Unverified | 0 |
| Contrastive Training of Complex-Valued Autoencoders for Object Discovery | May 24, 2023 | Contrastive LearningObject | CodeCode Available | 0 |
| Realistically distributing object placements in synthetic training data improves the performance of vision-based object detection models | May 24, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Streaming Object Detection on Fisheye Cameras for Automatic Parking | May 24, 2023 | Instance SegmentationObject | —Unverified | 0 |
| Semi-Supervised and Long-Tailed Object Detection with CascadeMatch | May 24, 2023 | Long-tailed Object DetectionObject | CodeCode Available | 0 |
| Text encoders bottleneck compositionality in contrastive vision-language models | May 24, 2023 | AttributeImage Captioning | CodeCode Available | 1 |
| Learning high-level visual representations from a child's perspective without strong inductive biases | May 24, 2023 | ObjectObject Localization | CodeCode Available | 1 |
| GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions | May 24, 2023 | ObjectQuestion Answering | —Unverified | 0 |
| NOVUM: Neural Object Volumes for Robust Object Classification | May 24, 2023 | Classificationimage-classification | CodeCode Available | 0 |
| DC-Net: Divide-and-Conquer for Salient Object Detection | May 24, 2023 | DecoderObject | CodeCode Available | 1 |
| A Study on Deep CNN Structures for Defect Detection From Laser Ultrasonic Visualization Testing Images | May 23, 2023 | Defect DetectionObject | —Unverified | 0 |
| Integrated Object Deformation and Contact Patch Estimation from Visuo-Tactile Feedback | May 23, 2023 | Object | —Unverified | 0 |
| Siamese Masked Autoencoders | May 23, 2023 | Data AugmentationDecoder | —Unverified | 0 |
| Grounding and Distinguishing Conceptual Vocabulary Through Similarity Learning in Embodied Simulations | May 23, 2023 | AttributeObject | —Unverified | 0 |
| Provably Learning Object-Centric Representations | May 23, 2023 | ObjectRepresentation Learning | —Unverified | 0 |
| Achieving Efficient and Realistic Full-Radar Simulations and Automatic Data Annotation by exploiting Ray Meta Data of a Radar Ray Tracing Simulator | May 23, 2023 | Object | —Unverified | 0 |
| Learning Remote Sensing Object Detection with Single Point Supervision | May 23, 2023 | Objectobject-detection | CodeCode Available | 1 |