Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 449 papers

Title	Date	Tasks	Status	Score
PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation	Jan 1, 2025	Human-Object Interaction DetectionHuman-Object Interaction Generation	CodeCode Available	5
Recurrent Space-time Graph Neural Networks	Apr 11, 2019	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	5
Toward Open-Set Human Object Interaction Detection	Mar 24, 2024	Contrastive LearningHuman-Object Interaction Detection	CodeCode Available	5
Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video	Nov 25, 2019	Action AnticipationHuman-Object Interaction Detection	CodeCode Available	5
Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual Manipulations	Jun 1, 2025	DeepFake DetectionFace Swapping	CodeCode Available	5
Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception	Dec 18, 2024	DescriptiveHuman-Object Interaction Detection	CodeCode Available	5
Focusing on what to decode and what to train: SOV Decoding with Specific Target Guided DeNoising and Vision Language Advisor	Jul 5, 2023	DecoderDenoising	CodeCode Available	5
Leverage Interactive Affinity for Affordance Learning	Jan 1, 2023	Human-Object Interaction DetectionObject	CodeCode Available	5
Fine-Grained Multi-View Hand Reconstruction Using Inverse Rendering	Jul 8, 2024	Human-Object Interaction DetectionInverse Rendering	CodeCode Available	5
Learning Human-Object Interactions by Graph Parsing Neural Networks	Aug 23, 2018	Human-Object Interaction DetectionObject	CodeCode Available	5
Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos	Feb 7, 2023	Action AnticipationAction Recognition	CodeCode Available	5
No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques	Nov 14, 2018	Human-Object Interaction DetectionObject	CodeCode Available	5
Interaction Region Visual Transformer for Egocentric Action Anticipation	Nov 25, 2022	Action AnticipationHuman-Object Interaction Detection	CodeCode Available	5
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing	Jan 28, 2025	Human-Object Interaction Detection	CodeCode Available	5
Interactiveness Field in Human-Object Interactions	Apr 16, 2022	Human-Object Interaction DetectionObject	CodeCode Available	5
Boosting Zero-Shot Human-Object Interaction Detection with Vision-Language Transfer	Mar 18, 2024	Human-Object Interaction DetectionLanguage Modeling	CodeCode Available	5
A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap	Jul 31, 2024	Human-Object Interaction DetectionImage Reconstruction	CodeCode Available	5
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario	Jun 21, 2023	Human-Object Interaction Detection	CodeCode Available	5
Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels	Sep 10, 2023	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available	5
ERNet: Efficient and Reliable Human-Object Interaction Detection	Jan 26, 2023	Human-Object Interaction DetectionObject	CodeCode Available	5
Object-centric Video Representation for Long-term Action Anticipation	Oct 31, 2023	Action AnticipationHuman-Object Interaction Detection	CodeCode Available	5
Controllable Human-Object Interaction Synthesis	Dec 6, 2023	Human-Object Interaction DetectionObject	—Unverified	0
End-to-End HOI Reconstruction Transformer with Graph-based Encoding	Mar 8, 2025	Human-Object Interaction Detection	—Unverified	0
EigenActor: Variant Body-Object Interaction Generation Evolved from Invariant Action Basis Reasoning	Mar 1, 2025	Human-Object Interaction DetectionObject	—Unverified	0
Contextual Heterogeneous Graph Network for Human-Object Interaction Detection	Oct 20, 2020	Graph AttentionHuman-Object Interaction Detection	—Unverified	0
EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting	Jun 28, 2024	Human-Object Interaction DetectionObject	—Unverified	0
Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation	Jun 7, 2021	Human-Object Interaction DetectionInstance Segmentation	—Unverified	0
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views	May 22, 2024	Human-Object Interaction DetectionObject	—Unverified	0
Egocentric Human-Object Interaction Detection: A New Benchmark and Method	Jun 17, 2025	BenchmarkingHuman-Object Interaction Detection	—Unverified	0
Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness	Jan 1, 2024	Human-Object Interaction Detectionobject-detection	—Unverified	0
AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation	Nov 26, 2024	Human-Object Interaction DetectionObject	—Unverified	0
ContextHOI: Spatial Context Learning for Human-Object Interaction Detection	Dec 12, 2024	Human-Object Interaction DetectionObject	—Unverified	0
Efficient Human-Object-Interaction (EHOI) Detection via Interaction Label Coding and Conditional Decision	Aug 13, 2024	Decision MakingHuman-Object Interaction Detection	—Unverified	0
Bi-Causal: Group Activity Recognition via Bidirectional Causality	Jan 1, 2024	Activity RecognitionGroup Activity Recognition	—Unverified	0
Effective Actor-centric Human-object Interaction Detection	Feb 24, 2022	Human-Object Interaction DetectionObject	—Unverified	0
Dynamic Scene Understanding from Vision-Language Representations	Jan 20, 2025	Grounded Situation RecognitionHuman-Human Interaction Recognition	—Unverified	0
Compositional Learning in Transformer-Based Human-Object Interaction Detection	Aug 11, 2023	Human-Object Interaction DetectionObject	—Unverified	0
Beyond Holistic Object Recognition: Enriching Image Understanding with Part States	Dec 15, 2016	Human-Object Interaction DetectionImage Captioning	—Unverified	0
An analysis of HOI: using a training-free method with multimodal visual foundation models when only the test set is available, without the training set	Aug 11, 2024	Human-Object Interaction Detection	—Unverified	0
A Deep Learning Approach to Object Affordance Segmentation	Apr 18, 2020	Deep LearningHuman-Object Interaction Detection	—Unverified	0
DropKey for Vision Transformer	Jan 1, 2023	Human-Object Interaction Detectionimage-classification	—Unverified	0
DropKey	Aug 4, 2022	Human-Object Interaction Detectionimage-classification	—Unverified	0
Compositional Learning for Human Object Interaction	Sep 1, 2018	Human-Object Interaction DetectionObject	—Unverified	0
DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors	Sep 12, 2024	Human-Object Interaction DetectionNeRF	—Unverified	0
Compositional 3D Human-Object Neural Animation	Apr 27, 2023	Human-Object Interaction DetectionNeRF	—Unverified	0
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation	Jul 31, 2023	Action SegmentationHuman-Object Interaction Detection	—Unverified	0
Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?	Mar 31, 2019	Human-Object Interaction DetectionObject	—Unverified	0
Complex Video Action Reasoning via Learnable Markov Logic Network	Jan 1, 2022	Action RecognitionHuman-Object Interaction Detection	—Unverified	0
Distillation Using Oracle Queries for Transformer-Based Human-Object Interaction Detection	Jan 1, 2022	Data AugmentationDecoder	—Unverified	0
Distillation of Human-Object Interaction Contexts for Action Recognition	Dec 17, 2021	Action RecognitionGraph Attention	—Unverified	0

Show:10 25 50

← PrevPage 4 of 9Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full ([email protected])	11.12	—	Unverified
2	ST-GAZE	Detection: Full ([email protected])	10.4	—	Unverified
3	STTRAN	Detection: Full ([email protected])	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	[email protected] role	25.93	—	Unverified