Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 276–300 of 449 papers

Title	Date	Tasks	Status	Hype
THORN: Temporal Human-Object Relation Network for Action Recognition	Apr 20, 2022	Action RecognitionHuman-Object Interaction Detection	—Unverified	0
Interactiveness Field in Human-Object Interactions	Apr 16, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0
Egocentric Human-Object Interaction Detection Exploiting Synthetic Data	Apr 14, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0
BEHAVE: Dataset and Method for Tracking Human Object Interactions	Apr 14, 2022	3D Human Reconstruction3D Object Reconstruction	CodeCode Available	1
Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection	Apr 11, 2022	Human-Object Interaction Detectionobject-detection	CodeCode Available	0
Category-Aware Transformer Network for Better Human-Object Interaction Detection	Apr 11, 2022	Human-Object Interaction DetectionObject	—Unverified	0
What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions	Apr 2, 2022	DecoderHuman-Object Interaction Detection	—Unverified	0
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation	Apr 1, 2022	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available	1
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection	Mar 28, 2022	DecoderHuman-Object Interaction Detection	—Unverified	0
Discovering Human-Object Interaction Concepts via Self-Compositional Learning	Mar 27, 2022	Affordance RecognitionHuman-Object Interaction Concept Discovery	CodeCode Available	1
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection	Mar 26, 2022	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows	Mar 20, 2022	Human-Object Interaction DetectionObject	—Unverified	0
Learning Affordance Grounding from Exocentric Images	Mar 18, 2022	DiversityHuman-Object Interaction Detection	CodeCode Available	1
The Overlooked Classifier in Human-Object Interaction Recognition	Mar 10, 2022	ClassificationHuman-Object Interaction Detection	—Unverified	0
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction	Mar 3, 2022	Action SegmentationBenchmarking	CodeCode Available	1
NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions	Feb 25, 2022	Human-Object Interaction DetectionMulti-Task Learning	—Unverified	0
Effective Actor-centric Human-object Interaction Detection	Feb 24, 2022	Human-Object Interaction DetectionObject	—Unverified	0
Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection	Feb 19, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0
HAKE: A Knowledge Engine Foundation for Human Activity Understanding	Feb 14, 2022	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	2
Webly Supervised Concept Expansion for General Purpose Vision Models	Feb 4, 2022	Human-Object Interaction DetectionImage Retrieval	—Unverified	0
Multi-Stage Deep Transfer Learning for EmIoT-enabled Human-Computer Interaction	Feb 3, 2022	Human-Object Interaction Detectiontext-to-speech	—Unverified	0
DexVIP: Learning Dexterous Grasping with Human Hand Pose Priors from Video	Feb 1, 2022	Deep Reinforcement LearningHuman-Object Interaction Detection	—Unverified	0
Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics	Feb 1, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
Complex Video Action Reasoning via Learnable Markov Logic Network	Jan 1, 2022	Action RecognitionHuman-Object Interaction Detection	—Unverified	0
Learning Transferable Human-Object Interaction Detector With Natural Language Supervision	Jan 1, 2022	Human-Object Interaction Detection	CodeCode Available	1

Show:10 25 50

← PrevPage 12 of 18Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified