Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 449 papers

Title	Date	Tasks	Status	Hype
Full-Body Articulated Human-Object Interaction	Dec 20, 2022	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	1
IMos: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions	Dec 14, 2022	Human-Object Interaction DetectionMotion Synthesis	CodeCode Available	1
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions	Nov 14, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting	Oct 19, 2022	Human-Object Interaction DetectionMotion Generation	CodeCode Available	1
Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges	Sep 12, 2022	3D ReconstructionHuman-Object Interaction Detection	CodeCode Available	1
RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection	Sep 5, 2022	Human-Object Interaction DetectionRelation	CodeCode Available	1
SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization	Sep 5, 2022	Fine-Grained Image ClassificationGraph Neural Network	CodeCode Available	1
Grounded Affordance from Exocentric View	Aug 28, 2022	DiversityHuman-Object Interaction Detection	CodeCode Available	1
Distance-Aware Occlusion Detection with Focused Attention	Aug 23, 2022	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection	Jul 28, 2022	Human-Object Interaction Detection	CodeCode Available	1
Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos	Jul 19, 2022	Human-Object Interaction Detection	CodeCode Available	1
Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection	Jul 12, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
Learning Continuous Grasping Function with a Dexterous Hand from Human Demonstrations	Jul 11, 2022	Human-Object Interaction Detectionmotion retargeting	CodeCode Available	1
Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection	Jun 13, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions	May 27, 2022	BenchmarkingFew-Shot Image Classification	CodeCode Available	1
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning	Apr 24, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
BEHAVE: Dataset and Method for Tracking Human Object Interactions	Apr 14, 2022	3D Human Reconstruction3D Object Reconstruction	CodeCode Available	1
End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation	Apr 1, 2022	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available	1
Discovering Human-Object Interaction Concepts via Self-Compositional Learning	Mar 27, 2022	Affordance RecognitionHuman-Object Interaction Concept Discovery	CodeCode Available	1
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection	Mar 26, 2022	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Learning Affordance Grounding from Exocentric Images	Mar 18, 2022	DiversityHuman-Object Interaction Detection	CodeCode Available	1
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction	Mar 3, 2022	Action SegmentationBenchmarking	CodeCode Available	1
Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics	Feb 1, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
Learning Transferable Human-Object Interaction Detector With Natural Language Supervision	Jan 1, 2022	Human-Object Interaction Detection	CodeCode Available	1
QAHOI: Query-Based Anchors for Human-Object Interaction Detection	Dec 16, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1
Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer	Dec 3, 2021	GPUHuman-Object Interaction Detection	CodeCode Available	1
Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction	Oct 7, 2021	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions	Oct 7, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1
ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection	Sep 9, 2021	Human-Object Interaction Detection	CodeCode Available	1
Exploiting Scene Graphs for Human-Object Interaction Detection	Aug 19, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1
D3D-HOI: Dynamic 3D Human-Object Interactions from Videos	Aug 19, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1
Mining the Benefits of Two-stage and One-stage HOI Detection	Aug 11, 2021	ClassificationHuman-Object Interaction Detection	CodeCode Available	1
GTNet:Guided Transformer Network for Detecting Human-Object Interactions	Aug 2, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1
Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection	Jul 28, 2021	Action DetectionHuman-Object Interaction Detection	CodeCode Available	1
Dynamics-Regulated Kinematic Policy for Egocentric Pose Estimation	Jun 10, 2021	Egocentric Pose EstimationHuman-Object Interaction Detection	CodeCode Available	1
ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos	May 25, 2021	Action DetectionHuman-Object Interaction Anticipation	CodeCode Available	1
HOTR: End-to-End Human-Object Interaction Detection with Transformers	Apr 28, 2021	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection	Apr 12, 2021	Human-Object Interaction Detection	CodeCode Available	1
Affordance Transfer Learning for Human-Object Interaction Detection	Apr 7, 2021	Affordance DetectionAffordance Recognition	CodeCode Available	1
Detecting Human-Object Interaction via Fabricated Compositional Learning	Mar 15, 2021	Affordance RecognitionHuman-Object Interaction Detection	CodeCode Available	1
Reformulating HOI Detection as Adaptive Set Prediction	Mar 10, 2021	Human-Object Interaction DetectionPrediction	CodeCode Available	1
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information	Mar 9, 2021	Human-Object Interaction Concept DiscoveryHuman-Object Interaction Detection	CodeCode Available	1
End-to-End Human Object Interaction Detection with HOI Transformer	Mar 8, 2021	Human-Object Interaction Detectionobject-detection	CodeCode Available	1
In Defense of Scene Graphs for Image Captioning	Feb 9, 2021	Human-Object Interaction DetectionImage Captioning	CodeCode Available	1
Transferable Interactiveness Knowledge for Human-Object Interaction Detection	Jan 25, 2021	Human-Object Interaction DetectionObject	CodeCode Available	1
Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations	Jan 1, 2021	Human-Object Interaction DetectionMulti-label zero-shot learning	CodeCode Available	1
LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos	Dec 17, 2020	Human-Object Interaction DetectionRelationship Detection	CodeCode Available	1
Spatially Conditioned Graphs for Detecting Human-Object Interactions	Dec 11, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
HOI Analysis: Integrating and Decomposing Human-Object Interaction	Oct 30, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain	Oct 12, 2020	Action RecognitionActive Object Detection	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 9Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full ([email protected])	11.12	—	Unverified
2	ST-GAZE	Detection: Full ([email protected])	10.4	—	Unverified
3	STTRAN	Detection: Full ([email protected])	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	[email protected] role	25.93	—	Unverified