Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 449 papers

Title	Date	Tasks	Status	Hype
Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection	Jul 19, 2023	Human-Object Interaction DetectionObject	—Unverified	0
Focusing on what to decode and what to train: SOV Decoding with Specific Target Guided DeNoising and Vision Language Advisor	Jul 5, 2023	DecoderDenoising	CodeCode Available	0
HOKEM: Human and Object Keypoint-based Extension Module for Human-Object Interaction Detection	Jun 25, 2023	Human-Object Interaction DetectionObject	—Unverified	0
First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment	Jun 23, 2023	Human-Object Interaction Detection	CodeCode Available	1
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario	Jun 21, 2023	Human-Object Interaction Detection	CodeCode Available	0
Human-Object Interaction Prediction in Videos through Gaze Following	Jun 6, 2023	Human-Object Interaction AnticipationHuman-Object Interaction Detection	CodeCode Available	1
An Abstract Specification of VoxML as an Annotation Language	May 22, 2023	Human Agent CollaborationHuman-Object Interaction Detection	—Unverified	0
Synthesizing Diverse Human Motions in 3D Indoor Scenes	May 21, 2023	Collision AvoidanceHuman-Object Interaction Detection	—Unverified	0
Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model	May 20, 2023	DiversityHuman-Object Interaction Detection	CodeCode Available	1
HICO-DET-SG and V-COCO-SG: New Data Splits for Evaluating the Systematic Generalization Performance of Human-Object Interaction Detection Models	May 17, 2023	Human-Object Interaction DetectionSystematic Generalization	CodeCode Available	0
Group Activity Recognition via Dynamic Composition and Interaction	May 9, 2023	Activity RecognitionGroup Activity Recognition	—Unverified	0
Modelling Spatio-Temporal Interactions for Compositional Action Recognition	May 4, 2023	Action RecognitionHuman-Object Interaction Detection	—Unverified	0
Compositional 3D Human-Object Neural Animation	Apr 27, 2023	Human-Object Interaction DetectionNeRF	—Unverified	0
What Happened 3 Seconds Ago? Inferring the Past with Thermal Imaging	Apr 26, 2023	Human-Object Interaction DetectionPose Estimation	CodeCode Available	1
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video	Apr 24, 2023	Human-Object Interaction DetectionObject	—Unverified	0
Video-based Contrastive Learning on Decision Trees: from Action Recognition to Autism Diagnosis	Apr 20, 2023	Action RecognitionBinary Classification	—Unverified	0
ViPLO: Vision Transformer based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection	Apr 17, 2023	Human-Object Interaction DetectionQuantization	CodeCode Available	1
Relational Context Learning for Human-Object Interaction Detection	Apr 11, 2023	DecoderHuman-Object Interaction Detection	CodeCode Available	1
StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation	Apr 8, 2023	Human-Object Interaction DetectionObject	CodeCode Available	1
Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream	Apr 6, 2023	Human-Object Interaction DetectionNovel View Synthesis	—Unverified	0
Visibility Aware Human-Object Interaction Tracking from Single RGB Camera	Mar 29, 2023	3D Human Reconstruction3D Object Reconstruction	—Unverified	0
HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models	Mar 28, 2023	DecoderHuman-Object Interaction Detection	CodeCode Available	1
Category Query Learning for Human-Object Interaction Classification	Mar 24, 2023	ClassificationDecoder	CodeCode Available	1
Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations	Mar 23, 2023	Human-Object Interaction DetectionMotion Estimation	—Unverified	0
Unified Visual Relationship Detection with Vision and Language Models	Mar 16, 2023	Human-Object Interaction DetectionRelationship Detection	CodeCode Available	0
Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors	Mar 9, 2023	Human-Object Interaction DetectionLanguage Modeling	—Unverified	0
TMHOI: Translational Model for Human-Object Interaction Detection	Mar 7, 2023	Computational EfficiencyHuman-Object Interaction Detection	—Unverified	0
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning	Mar 2, 2023	Human-Object Interaction DetectionKnowledge Distillation	—Unverified	0
Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos	Feb 7, 2023	Action AnticipationAction Recognition	CodeCode Available	0
ERNet: Efficient and Reliable Human-Object Interaction Detection	Jan 26, 2023	Human-Object Interaction DetectionObject	CodeCode Available	0
Parallel Reasoning Network for Human-Object Interaction Detection	Jan 9, 2023	Human-Object Interaction DetectionObject	—Unverified	0
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection	Jan 8, 2023	Human-Object Interaction DetectionObject	CodeCode Available	1
Open Set Video HOI detection from Action-Centric Chain-of-Look Prompting	Jan 1, 2023	Human-Object Interaction DetectionLanguage Modelling	—Unverified	0
CHORUS : Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images	Jan 1, 2023	Common Sense ReasoningDiversity	—Unverified	0
Open-Category Human-Object Interaction Pre-Training via Language Modeling Framework	Jan 1, 2023	Human-Object Interaction DetectionLanguage Modeling	—Unverified	0
Leverage Interactive Affinity for Affordance Learning	Jan 1, 2023	Human-Object Interaction DetectionObject	CodeCode Available	0
DropKey for Vision Transformer	Jan 1, 2023	Human-Object Interaction Detectionimage-classification	—Unverified	0
Full-Body Articulated Human-Object Interaction	Dec 20, 2022	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	1
NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions	Dec 15, 2022	Human-Object Interaction Detection	—Unverified	0
IMos: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions	Dec 14, 2022	Human-Object Interaction DetectionMotion Synthesis	CodeCode Available	1
Interaction Region Visual Transformer for Egocentric Action Anticipation	Nov 25, 2022	Action AnticipationHuman-Object Interaction Detection	CodeCode Available	0
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022	Nov 16, 2022	Human-Object Interaction DetectionObject	—Unverified	0
Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions	Nov 14, 2022	Human-Object Interaction DetectionObject	CodeCode Available	1
PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting	Oct 19, 2022	Human-Object Interaction DetectionMotion Generation	CodeCode Available	1
Visual Object Tracking in First Person Vision	Sep 27, 2022	Human-Object Interaction DetectionObject	—Unverified	0
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain	Sep 19, 2022	Action AnticipationAction Recognition	—Unverified	0
Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges	Sep 12, 2022	3D ReconstructionHuman-Object Interaction Detection	CodeCode Available	1
Graphing the Future: Activity and Next Active Object Prediction using Graph-based Activity Representations	Sep 12, 2022	Graph MatchingHuman-Object Interaction Detection	—Unverified	0
Reconstructing Action-Conditioned Human-Object Interactions Using Commonsense Knowledge Priors	Sep 6, 2022	Human-Object Interaction DetectionObject	—Unverified	0
SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization	Sep 5, 2022	Fine-Grained Image ClassificationGraph Neural Network	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 9Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified