| Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising | Nov 28, 2024 | DenoisingLanguage Modeling | —Unverified | 0 |
| Detailed Object Description with Controllable Dimensions | Nov 28, 2024 | Object | CodeCode Available | 0 |
| Semi-Supervised Neural Processes for Articulated Object Interactions | Nov 28, 2024 | Object | —Unverified | 0 |
| ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos | Nov 28, 2024 | ObjectObject Localization | —Unverified | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks | Nov 27, 2024 | Multispectral Object DetectionObject | —Unverified | 0 |
| G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation | Nov 27, 2024 | Imitation LearningObject | CodeCode Available | 0 |
| OPCap:Object-aware Prompting Captioning | Nov 27, 2024 | AttributeDecoder | —Unverified | 0 |
| A comparison of extended object tracking with multi-modal sensors in indoor environment | Nov 27, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models | Nov 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Adversarial Bounding Boxes Generation (ABBG) Attack against Visual Object Trackers | Nov 26, 2024 | Object | CodeCode Available | 0 |
| Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation | Nov 26, 2024 | ObjectOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | Nov 26, 2024 | Objectobject-detection | CodeCode Available | 0 |
| AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation | Nov 26, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| GMFlow: Global Motion-Guided Recurrent Flow for 6D Object Pose Estimation | Nov 26, 2024 | 6D Pose Estimation using RGBComputational Efficiency | —Unverified | 0 |
| On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance | Nov 26, 2024 | Object | —Unverified | 0 |
| Object-centric proto-symbolic behavioural reasoning from pixels | Nov 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Leverage Task Context for Object Affordance Ranking | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| CIA: Controllable Image Augmentation Framework Based on Stable Diffusion | Nov 25, 2024 | Image AugmentationObject | CodeCode Available | 0 |
| Leveraging Foundation Models To learn the shape of semi-fluid deformable objects | Nov 25, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image | Nov 25, 2024 | ObjectPose Estimation | —Unverified | 0 |
| Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Nov 25, 2024 | ObjectPose Estimation | CodeCode Available | 0 |
| Open-Vocabulary Octree-Graph for 3D Scene Understanding | Nov 25, 2024 | ObjectScene Understanding | —Unverified | 0 |
| Boosting 3D Object Generation through PBR Materials | Nov 25, 2024 | Object | —Unverified | 0 |
| VideoOrion: Tokenizing Object Dynamics in Videos | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FastTrackTr:Towards Fast Multi-Object Tracking with Transformers | Nov 24, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| Twin Trigger Generative Networks for Backdoor Attacks against Object Detection | Nov 23, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation | Nov 23, 2024 | Objectobject-detection | —Unverified | 0 |
| ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models | Nov 22, 2024 | HallucinationObject | —Unverified | 0 |
| Instance-Aware Generalized Referring Expression Segmentation | Nov 22, 2024 | Generalized Referring Expression SegmentationObject | —Unverified | 0 |
| A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles | Nov 22, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| SEMPose: A Single End-to-end Network for Multi-object Pose Estimation | Nov 21, 2024 | ObjectPose Estimation | —Unverified | 0 |
| Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction | Nov 21, 2024 | 3D GenerationGPU | —Unverified | 0 |
| Enhancing Thermal MOT: A Novel Box Association Method Leveraging Thermal Identity and Motion Similarity | Nov 20, 2024 | Multiple Object TrackingObject | CodeCode Available | 0 |
| ClickTrack: Towards Real-time Interactive Single Object Tracking | Nov 20, 2024 | ObjectObject Tracking | —Unverified | 0 |
| YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Nov 20, 2024 | 2D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Text-guided Zero-Shot Object Localization | Nov 18, 2024 | ObjectObject Localization | —Unverified | 0 |
| Radio Frequency Ray Tracing with Neural Object Representation | Nov 16, 2024 | Object | —Unverified | 0 |
| Generating Compositional Scenes via Text-to-image RGBA Instance Generation | Nov 16, 2024 | ObjectPrompt Engineering | —Unverified | 0 |
| Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras | Nov 15, 2024 | energy managementManagement | —Unverified | 0 |
| ColorEdit: Training-free Image-Guided Color editing with diffusion model | Nov 15, 2024 | AttributeDenoising | —Unverified | 0 |
| Structure Tensor Representation for Robust Oriented Object Detection | Nov 15, 2024 | Objectobject-detection | —Unverified | 0 |
| Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Nov 15, 2024 | DescriptiveObject | —Unverified | 0 |
| Dynamic Reconstruction of Hand-Object Interaction with Distributed Force-aware Contact Representation | Nov 14, 2024 | Dynamic ReconstructionObject | —Unverified | 0 |
| Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction | Nov 14, 2024 | Contrastive LearningLong-tailed Object Detection | —Unverified | 0 |
| LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection | Nov 14, 2024 | Objectobject-detection | —Unverified | 0 |
| Image Processing for Motion Magnification | Nov 14, 2024 | Motion MagnificationObject | —Unverified | 0 |
| Multimodal Object Detection using Depth and Image Data for Manufacturing Parts | Nov 13, 2024 | Objectobject-detection | —Unverified | 0 |