Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–449 of 449 papers

Title	Date	Tasks	Status
EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting	Jun 28, 2024	Human-Object Interaction DetectionObject	—Unverified
EigenActor: Variant Body-Object Interaction Generation Evolved from Invariant Action Basis Reasoning	Mar 1, 2025	Human-Object Interaction DetectionObject	—Unverified
Simultaneous Joint and Object Trajectory Templates for Human Activity Recognition from 3-D Data	Nov 5, 2017	Activity RecognitionHuman Activity Recognition	—Unverified
End-to-End HOI Reconstruction Transformer with Graph-based Encoding	Mar 8, 2025	Human-Object Interaction Detection	—Unverified
Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning	May 7, 2018	Action RecognitionGraph Neural Network	—Unverified
Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning	Mar 15, 2024	Autonomous DrivingHuman-Object Interaction Detection	—Unverified
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios	Sep 26, 2023	Action DetectionHuman-Object Interaction Detection	—Unverified
Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos	Nov 2, 2021	Human-Object Interaction DetectionObject	—Unverified
Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition	Dec 21, 2021	Action RecognitionActivity Recognition	—Unverified
TMHOI: Translational Model for Human-Object Interaction Detection	Mar 7, 2023	Computational EfficiencyHuman-Object Interaction Detection	—Unverified
A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities	Sep 15, 2024	Action DetectionActivity Detection	—Unverified
Exploring Interactive Semantic Alignment for Efficient HOI Detection with Vision-language Model	Apr 19, 2024	Human-Object Interaction DetectionLanguage Modeling	—Unverified
Exploring Pose-Aware Human-Object Interaction via Hybrid Learning	Jan 1, 2024	Human-Object Interaction DetectionObject	—Unverified
3D Human Interaction Generation: A Survey	Mar 17, 2025	Human-Object Interaction DetectionMotion Generation	—Unverified
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection	Jan 11, 2024	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022	Nov 16, 2022	Human-Object Interaction DetectionObject	—Unverified
Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection	Jun 7, 2022	Human-Object Interaction DetectionObject	—Unverified
Spatial Priming for Detecting Human-Object Interactions	Apr 9, 2020	Human-Object Interaction DetectionObject	—Unverified
Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems	Jan 23, 2025	FrictionHuman-Object Interaction Detection	—Unverified
Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOI	Dec 17, 2023	DiversityFew-Shot Learning	—Unverified
Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition	Aug 19, 2021	Human-Object Interaction DetectionObject	—Unverified
F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions	Jul 17, 2024	Human-Object Interaction DetectionLanguage Modelling	—Unverified
Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF	Sep 30, 2017	General ClassificationHuman-Object Interaction Detection	—Unverified
First Person Action-Object Detection with EgoNet	Mar 15, 2016	Human-Object Interaction DetectionObject	—Unverified
Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion	May 5, 2022	Human-Object Interaction DetectionObject	—Unverified
FORCE: Physics-aware Human-object Interaction	Mar 17, 2024	DiversityFriction	—Unverified
ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection	Mar 28, 2025	Action RecognitionHuman-Object Interaction Detection	—Unverified
FreeA: Human-object Interaction Detection using Free Annotation Labels	Mar 4, 2024	Human-Object Interaction DetectionObject	—Unverified
From Category to Scenery: An End-to-End Framework for Multi-Person Human-Object Interaction Recognition in Videos	Jul 1, 2024	Human-Object Interaction Detection	—Unverified
From Infants to AI: Incorporating Infant-like Learning in Models Boosts Efficiency and Generalization in Learning Social Prediction Tasks	Mar 5, 2025	Human-Object Interaction Detection	—Unverified
Visual Object Tracking in First Person Vision	Sep 27, 2022	Human-Object Interaction DetectionObject	—Unverified
Functional 3D Scene Synthesis through Human-Scene Optimization	Feb 5, 2025	Human-Object Interaction DetectionScene Generation	—Unverified
GASPACHO: Gaussian Splatting for Controllable Humans and Objects	Mar 12, 2025	Human-Object Interaction DetectionObject	—Unverified
Generalized Visual Relation Detection with Diffusion Models	Apr 16, 2025	Graph GenerationHuman-Object Interaction Detection	—Unverified
Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models	Nov 26, 2023	Human-Object Interaction Detection	—Unverified
Generating Human Interaction Motions in Scenes with Text Control	Apr 16, 2024	DenoisingHuman-Object Interaction Detection	—Unverified
What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions	Apr 2, 2022	DecoderHuman-Object Interaction Detection	—Unverified
VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis	Nov 27, 2024	Human-Object Interaction DetectionImage-text matching	—Unverified
Geometric Visual Fusion Graph Neural Networks for Multi-Person Human-Object Interaction Recognition in Videos	Jun 3, 2025	Graph LearningGraph Neural Network	—Unverified
GID-Net: Detecting Human-Object Interaction with Global and Instance Dependency	Mar 11, 2020	Human-Object Interaction DetectionObject	—Unverified
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications	Oct 14, 2024	3DGS3D Reconstruction	—Unverified
SyncDiff: Synchronized Motion Diffusion for Multi-Body Human-Object Interaction Synthesis	Dec 28, 2024	Human AnimationHuman-Object Interaction Detection	—Unverified
GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping	Nov 19, 2024	Common Sense ReasoningHuman-Object Interaction Detection	—Unverified
Graphing the Future: Activity and Next Active Object Prediction using Graph-based Activity Representations	Sep 12, 2022	Graph MatchingHuman-Object Interaction Detection	—Unverified
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction	Oct 17, 2024	Human-Object Interaction DetectionImage Generation	—Unverified
Gravity-Aware Monocular 3D Human-Object Reconstruction	Aug 19, 2021	Human-Object Interaction DetectionObject	—Unverified
Synthesizing Diverse Human Motions in 3D Indoor Scenes	May 21, 2023	Collision AvoidanceHuman-Object Interaction Detection	—Unverified
Grounded Human-Object Interaction Hotspots from Video (Extended Abstract)	Jun 3, 2019	Human-Object Interaction DetectionObject	—Unverified
Group Activity Recognition via Dynamic Composition and Interaction	May 9, 2023	Activity RecognitionGroup Activity Recognition	—Unverified

Show:10 25 50

← PrevPage 9 of 9Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full ([email protected])	11.12	—	Unverified
2	ST-GAZE	Detection: Full ([email protected])	10.4	—	Unverified
3	STTRAN	Detection: Full ([email protected])	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	[email protected] role	25.93	—	Unverified