Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 449 papers

Title	Date	Tasks	Status	Hype	Score
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection	Jul 28, 2022	Human-Object Interaction Detection	CodeCode Available	1	5
Affordance Transfer Learning for Human-Object Interaction Detection	Apr 7, 2021	Affordance DetectionAffordance Recognition	CodeCode Available	1	5
Polysemy Deciphering Network for Robust Human-Object Interaction Detection	Aug 7, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Perceiving 3D Human-Object Spatial Arrangements from a Single Image in the Wild	Jul 30, 2020	3D Human Pose Estimation3D Human Reconstruction	CodeCode Available	1	5
Polysemy Deciphering Network for Human-Object Interaction Detection	Aug 1, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Person in Place: Generating Associative Skeleton-Guidance Maps for Human-Object Interaction Image Editing	Jan 1, 2024	Human-Object Interaction DetectionObject	CodeCode Available	1	5
BEHAVE: Dataset and Method for Tracking Human Object Interactions	Apr 14, 2022	3D Human Reconstruction3D Object Reconstruction	CodeCode Available	1	5
DRG: Dual Relation Graph for Human-Object Interaction Detection	Aug 26, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Pose-based Modular Network for Human-Object Interaction Detection	Aug 5, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting	Oct 19, 2022	Human-Object Interaction DetectionMotion Generation	CodeCode Available	1	5
Relational Context Learning for Human-Object Interaction Detection	Apr 11, 2023	DecoderHuman-Object Interaction Detection	CodeCode Available	1	5
Dynamics-Regulated Kinematic Policy for Egocentric Pose Estimation	Jun 10, 2021	Egocentric Pose EstimationHuman-Object Interaction Detection	CodeCode Available	1	5
Transferable Interactiveness Knowledge for Human-Object Interaction Detection	Nov 20, 2018	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory	Sep 7, 2023	Human-Object Interaction DetectionRetrieval	CodeCode Available	1	5
ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection	Aug 14, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning	Apr 24, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer	Dec 3, 2021	GPUHuman-Object Interaction Detection	CodeCode Available	1	5
RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection	Sep 5, 2022	Human-Object Interaction DetectionRelation	CodeCode Available	1	5
Visual Compositional Learning for Human-Object Interaction Detection	Jul 24, 2020	Affordance RecognitionHuman-Object Interaction Detection	CodeCode Available	1	5
In Defense of Scene Graphs for Image Captioning	Feb 9, 2021	Human-Object Interaction DetectionImage Captioning	CodeCode Available	1	5
Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors	May 30, 2025	Human-Object Interaction DetectionSemantic Segmentation	CodeCode Available	1	5
EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning	Dec 11, 2023	BenchmarkingHuman-Object Interaction Detection	CodeCode Available	1	5
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection	Jan 8, 2023	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Spatially Conditioned Graphs for Detecting Human-Object Interactions	Dec 11, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
End-to-End Human Object Interaction Detection with HOI Transformer	Mar 8, 2021	Human-Object Interaction Detectionobject-detection	CodeCode Available	1	5
Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection	Jul 9, 2025	Human-Object Interaction DetectionLarge Language Model	CodeCode Available	0	5
SHD360: A Benchmark Dataset for Salient Human Detection in 360° Videos	May 24, 2021	Human DetectionHuman-Object Interaction Detection	CodeCode Available	0	5
Contextual Action Recognition with R*CNN	May 5, 2015	Action RecognitionAttribute	CodeCode Available	0	5
Egocentric Human-Object Interaction Detection Exploiting Synthetic Data	Apr 14, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0	5
Efficient Explicit Joint-level Interaction Modeling with Mamba for Text-guided HOI Generation	Mar 29, 2025	Human-Object Interaction DetectionMamba	CodeCode Available	0	5
Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection	Apr 11, 2022	Human-Object Interaction Detectionobject-detection	CodeCode Available	0	5
Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States	Aug 14, 2020	Human-Object Interaction Detection	CodeCode Available	0	5
Recurrent Space-time Graph Neural Networks	Apr 11, 2019	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	0	5
RoHOI: Robustness Benchmark for Human-Object Interaction Detection	Jul 12, 2025	Human-Object Interaction DetectionObject	CodeCode Available	0	5
PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection	Dec 30, 2019	GPUHuman-Object Interaction Detection	CodeCode Available	0	5
Pose-aware Multi-level Feature Network for Human Object Interaction Detection	Sep 18, 2019	Human-Object Interaction DetectionObject	CodeCode Available	0	5
PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation	Jan 1, 2025	Human-Object Interaction DetectionHuman-Object Interaction Generation	CodeCode Available	0	5
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos	Sep 11, 2019	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	0	5
PersonaHOI: Effortlessly Improving Personalized Face with Human-Object Interaction Generation	Jan 10, 2025	Human-Object Interaction DetectionHuman-Object Interaction Generation	CodeCode Available	0	5
Distance Matters in Human-Object Interaction Detection	Jul 5, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0	5
Object-centric Video Representation for Long-term Action Anticipation	Oct 31, 2023	Action AnticipationHuman-Object Interaction Detection	CodeCode Available	0	5
No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques	Nov 14, 2018	Human-Object Interaction DetectionObject	CodeCode Available	0	5
Pairwise Body-Part Attention for Recognizing Human-Object Interactions	Jul 28, 2018	feature selectionHuman-Object Interaction Detection	CodeCode Available	0	5
Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual Manipulations	Jun 1, 2025	DeepFake DetectionFace Swapping	CodeCode Available	0	5
TED-Net: Dispersal Attention for Perceiving Interaction Region in Indirectly-Contact HOI Detection	Jan 26, 2024	Human-Object Interaction Detectionobject-detection	CodeCode Available	0	5
Attentional Pooling for Action Recognition	Nov 4, 2017	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	0	5
Learning Human-Object Interactions by Graph Parsing Neural Networks	Aug 23, 2018	Human-Object Interaction DetectionObject	CodeCode Available	0	5
Leverage Interactive Affinity for Affordance Learning	Jan 1, 2023	Human-Object Interaction DetectionObject	CodeCode Available	0	5
A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection	Jul 11, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0	5
Interactiveness Field in Human-Object Interactions	Apr 16, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0	5

Show:10 25 50

← PrevPage 3 of 9Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified