Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 449 papers

Title	Date	Tasks	Status
An Abstract Specification of VoxML as an Annotation Language	May 22, 2023	Human Agent CollaborationHuman-Object Interaction Detection	—Unverified
Human Motion Prediction, Reconstruction, and Generation	Feb 21, 2025	Human motion predictionHuman-Object Interaction Detection	—Unverified
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models	Oct 26, 2024	Contrastive LearningHuman-Object Interaction Detection	—Unverified
Test-time Distribution Learning Adapter for Cross-modal Visual Reasoning	Mar 10, 2024	Human-Object Interaction DetectionPrediction	—Unverified
Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior	May 7, 2021	Human-Object Interaction DetectionObject	—Unverified
Human-Object Interaction Detection via Disentangled Transformer	Apr 20, 2022	DecoderHuman-Object Interaction Detection	—Unverified
Human-Object Interaction Detection via Weak Supervision	Dec 1, 2021	Human-Object Interaction DetectionObject	—Unverified
Human-Object Interaction from Human-Level Instructions	Jun 25, 2024	Common Sense ReasoningHuman-Object Interaction Detection	—Unverified
ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Dec 24, 2024	Human-Object Interaction DetectionVideo Generation	—Unverified
Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics	Mar 24, 2025	Human-Object Interaction DetectionLanguage Modeling	—Unverified
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment	Feb 7, 2025	DiversityHuman-Object Interaction Detection	—Unverified
HUMOTO: A 4D Dataset of Mocap Human Object Interactions	Apr 14, 2025	Human-Object Interaction DetectionMotion Generation	—Unverified
HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation	Jun 10, 2025	Human AnimationHuman-Object Interaction Detection	—Unverified
The Overlooked Classifier in Human-Object Interaction Recognition	Mar 10, 2022	ClassificationHuman-Object Interaction Detection	—Unverified
iMapper: Interaction-guided Joint Scene and Human Motion Mapping from Monocular Videos	Jun 20, 2018	Human-Object Interaction DetectionObject	—Unverified
I'M HOI: Inertia-aware Monocular Capture of 3D Human-Object Interactions	Dec 10, 2023	Human-Object Interaction DetectionObject	—Unverified
THORN: Temporal Human-Object Relation Network for Action Recognition	Apr 20, 2022	Action RecognitionHuman-Object Interaction Detection	—Unverified
Improving Human-Object Interaction Detection via Phrase Learning and Label Composition	Dec 14, 2021	Human-Object Interaction DetectionScene Understanding	—Unverified
Improving Human-Object Interaction Detection via Virtual Image Learning	Aug 4, 2023	Human-Object Interaction DetectionObject	—Unverified
Amplifying Key Cues for Human-Object-Interaction Detection	Aug 1, 2020	Human-Object Interaction DetectionObject	—Unverified
THOR: Text to Human-Object Interaction Diffusion via Relation Intervention	Mar 17, 2024	DiversityHuman-Object Interaction Detection	—Unverified
Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream	Apr 6, 2023	Human-Object Interaction DetectionNovel View Synthesis	—Unverified
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation	Jan 1, 2025	BenchmarkingHuman-Object Interaction Detection	—Unverified
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing	May 30, 2025	Human-Object Interaction DetectionObject	—Unverified
Interact as You Intend: Intention-Driven Human-Object Interaction Detection	Aug 29, 2018	Human-Object Interaction Detection	—Unverified

Show:10 25 50

← PrevPage 9 of 18Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified