Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 449 papers

Title	Date	Tasks	Status
Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations	Mar 23, 2023	Human-Object Interaction DetectionMotion Estimation	—Unverified
Unified Visual Relationship Detection with Vision and Language Models	Mar 16, 2023	Human-Object Interaction DetectionRelationship Detection	—Unverified
Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors	Mar 9, 2023	Human-Object Interaction DetectionLanguage Modeling	—Unverified
TMHOI: Translational Model for Human-Object Interaction Detection	Mar 7, 2023	Computational EfficiencyHuman-Object Interaction Detection	—Unverified
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos	Feb 7, 2023	Action AnticipationAction Recognition	CodeCode Available
ERNet: Efficient and Reliable Human-Object Interaction Detection	Jan 26, 2023	Human-Object Interaction DetectionObject	CodeCode Available
Parallel Reasoning Network for Human-Object Interaction Detection	Jan 9, 2023	Human-Object Interaction DetectionObject	—Unverified
Leverage Interactive Affinity for Affordance Learning	Jan 1, 2023	Human-Object Interaction DetectionObject	CodeCode Available
CHORUS : Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images	Jan 1, 2023	Common Sense ReasoningDiversity	—Unverified
Open Set Video HOI detection from Action-Centric Chain-of-Look Prompting	Jan 1, 2023	Human-Object Interaction DetectionLanguage Modelling	—Unverified
DropKey for Vision Transformer	Jan 1, 2023	Human-Object Interaction Detectionimage-classification	—Unverified
Open-Category Human-Object Interaction Pre-Training via Language Modeling Framework	Jan 1, 2023	Human-Object Interaction DetectionLanguage Modeling	—Unverified
NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions	Dec 15, 2022	Human-Object Interaction Detection	—Unverified
Interaction Region Visual Transformer for Egocentric Action Anticipation	Nov 25, 2022	Action AnticipationHuman-Object Interaction Detection	CodeCode Available
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022	Nov 16, 2022	Human-Object Interaction DetectionObject	—Unverified
Visual Object Tracking in First Person Vision	Sep 27, 2022	Human-Object Interaction DetectionObject	—Unverified
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain	Sep 19, 2022	Action AnticipationAction Recognition	—Unverified
Graphing the Future: Activity and Next Active Object Prediction using Graph-based Activity Representations	Sep 12, 2022	Graph MatchingHuman-Object Interaction Detection	—Unverified
Reconstructing Action-Conditioned Human-Object Interactions Using Commonsense Knowledge Priors	Sep 6, 2022	Human-Object Interaction DetectionObject	—Unverified
DropKey	Aug 4, 2022	Human-Object Interaction Detectionimage-classification	—Unverified
SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions	Jul 24, 2022	Dense CaptioningDense Video Captioning	—Unverified
Knowledge Guided Bidirectional Attention Network for Human-Object Interaction Detection	Jul 16, 2022	DecoderHuman-Object Interaction Detection	—Unverified
A Skeleton-aware Graph Convolutional Network for Human-Object Interaction Detection	Jul 11, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Learning Structured Representations of Visual Scenes	Jul 9, 2022	Human-Object Interaction DetectionRepresentation Learning	—Unverified
Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection	Jul 6, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Distance Matters in Human-Object Interaction Detection	Jul 5, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions	Jun 25, 2022	Graph Neural NetworkHuman-Object Interaction Detection	—Unverified
Precise Affordance Annotation for Egocentric Action Video Datasets	Jun 11, 2022	Action AnticipationAffordance Recognition	—Unverified
Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection	Jun 7, 2022	Human-Object Interaction DetectionObject	—Unverified
Video-based Human-Object Interaction Detection from Tubelet Tokens	Jun 4, 2022	Human-Object Interaction Detection	—Unverified
Interaction Replica: Tracking Human-Object Interaction and Scene Changes From Human Motion	May 5, 2022	Human-Object Interaction DetectionObject	—Unverified
COUCH: Towards Controllable Human-Chair Interactions	May 1, 2022	Human-Object Interaction DetectionObject	—Unverified
Persistent-Transient Duality in Human Behavior Modeling	Apr 21, 2022	Human-Object Interaction Detectionmotion prediction	—Unverified
Human-Object Interaction Detection via Disentangled Transformer	Apr 20, 2022	DecoderHuman-Object Interaction Detection	—Unverified
THORN: Temporal Human-Object Relation Network for Action Recognition	Apr 20, 2022	Action RecognitionHuman-Object Interaction Detection	—Unverified
Interactiveness Field in Human-Object Interactions	Apr 16, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Egocentric Human-Object Interaction Detection Exploiting Synthetic Data	Apr 14, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection	Apr 11, 2022	Human-Object Interaction Detectionobject-detection	CodeCode Available
Category-Aware Transformer Network for Better Human-Object Interaction Detection	Apr 11, 2022	Human-Object Interaction DetectionObject	—Unverified
What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions	Apr 2, 2022	DecoderHuman-Object Interaction Detection	—Unverified
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection	Mar 28, 2022	DecoderHuman-Object Interaction Detection	—Unverified
Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows	Mar 20, 2022	Human-Object Interaction DetectionObject	—Unverified
The Overlooked Classifier in Human-Object Interaction Recognition	Mar 10, 2022	ClassificationHuman-Object Interaction Detection	—Unverified
NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions	Feb 25, 2022	Human-Object Interaction DetectionMulti-Task Learning	—Unverified
Effective Actor-centric Human-object Interaction Detection	Feb 24, 2022	Human-Object Interaction DetectionObject	—Unverified
Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection	Feb 19, 2022	Human-Object Interaction DetectionObject	CodeCode Available
Webly Supervised Concept Expansion for General Purpose Vision Models	Feb 4, 2022	Human-Object Interaction DetectionImage Retrieval	—Unverified
Multi-Stage Deep Transfer Learning for EmIoT-enabled Human-Computer Interaction	Feb 3, 2022	Human-Object Interaction Detectiontext-to-speech	—Unverified
DexVIP: Learning Dexterous Grasping with Human Hand Pose Priors from Video	Feb 1, 2022	Deep Reinforcement LearningHuman-Object Interaction Detection	—Unverified

Show:10 25 50

← PrevPage 7 of 9Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified