Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 449 papers

Title	Date	Tasks	Status	Hype	Score
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation	Apr 1, 2022	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available	1	5
Detailed 2D-3D Joint Representation for Human-Object Interaction	Apr 17, 2020	Action UnderstandingHuman-Object Interaction Detection	CodeCode Available	1	5
Category Query Learning for Human-Object Interaction Classification	Mar 24, 2023	ClassificationDecoder	CodeCode Available	1	5
Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models	Nov 7, 2023	DecoderHuman-Object Interaction Detection	CodeCode Available	1	5
Exploiting Scene Graphs for Human-Object Interaction Detection	Aug 19, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Detecting Human-Object Interactions with Action Co-occurrence Priors	Jul 17, 2020	Human-Object Interaction Detection	CodeCode Available	1	5
Detecting Human-Object Interactions with Action Co-occurrence Priors	Aug 1, 2020	Human-Object Interaction Detection	CodeCode Available	1	5
Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics	Feb 1, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Detecting Human-Object Interaction via Fabricated Compositional Learning	Mar 15, 2021	Affordance RecognitionHuman-Object Interaction Detection	CodeCode Available	1	5
Full-Body Articulated Human-Object Interaction	Dec 20, 2022	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	1	5
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection	Oct 31, 2024	Human-Object Interaction DetectionLarge Language Model	CodeCode Available	1	5
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions	May 27, 2022	BenchmarkingFew-Shot Image Classification	CodeCode Available	1	5
First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment	Jun 23, 2023	Human-Object Interaction Detection	CodeCode Available	1	5
Diagnosing Human-object Interaction Detectors	Aug 16, 2023	ClassificationHuman-Object Interaction Detection	CodeCode Available	1	5
Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection	Apr 12, 2021	Human-Object Interaction Detection	CodeCode Available	1	5
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection	Mar 26, 2022	DecoderHuman-Object Interaction Detection	CodeCode Available	1	5
DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection	Oct 2, 2020	Human-Object Interaction Detection	CodeCode Available	1	5
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions	Nov 14, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos	Jul 19, 2022	Human-Object Interaction Detection	CodeCode Available	1	5
ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection	Sep 9, 2021	Human-Object Interaction Detection	CodeCode Available	1	5
Discovering Human Interactions With Novel Objects via Zero-Shot Learning	Jun 1, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Discovering Human-Object Interaction Concepts via Self-Compositional Learning	Mar 27, 2022	Affordance RecognitionHuman-Object Interaction Concept Discovery	CodeCode Available	1	5
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection	Aug 5, 2024	Human-Object Interaction DetectionPrompt Learning	CodeCode Available	1	5
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions	Mar 21, 2025	Human-Object Interaction DetectionMamba	CodeCode Available	1	5
Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics	Jun 30, 2024	Human-Object Interaction DetectionObject	CodeCode Available	1	5

Show:10 25 50

← PrevPage 4 of 18Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified