Human-Object Interaction Detection

Human-Object Interaction (HOI) detection is a task of identifying "a set of interactions" in an image, which involves the i) localization of the subject (i.e., humans) and target (i.e., objects) of interaction, and ii) the classification of the interaction labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 449 papers

Title	Date	Tasks	Status	Hype
Eye Gaze as a Signal for Conveying User Attention in Contextual AI Systems	Jan 23, 2025	FrictionHuman-Object Interaction Detection	—Unverified	0
Dynamic Scene Understanding from Vision-Language Representations	Jan 20, 2025	Grounded Situation RecognitionHuman-Human Interaction Recognition	—Unverified	0
DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models	Jan 14, 2025	Human-Object Interaction DetectionObject	—Unverified	0
From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities	Jan 10, 2025	Human-Object Interaction DetectionKnowledge Distillation	CodeCode Available	1
PersonaHOI: Effortlessly Improving Personalized Face with Human-Object Interaction Generation	Jan 10, 2025	Human-Object Interaction DetectionHuman-Object Interaction Generation	CodeCode Available	0
Vision-Guided Action: Enhancing 3D Human Motion Prediction with Gaze-informed Affordance in 3D Scenes	Jan 1, 2025	Human motion predictionHuman-Object Interaction Detection	—Unverified	0
PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation	Jan 1, 2025	Human-Object Interaction DetectionHuman-Object Interaction Generation	CodeCode Available	0
Reasoning Mamba: Hypergraph-Guided Region Relation Calculating for Weakly Supervised Affordance Grounding	Jan 1, 2025	Human-Object Interaction DetectionMamba	—Unverified	0
HORP: Human-Object Relation Priors Guided HOI Detection	Jan 1, 2025	Human-Object Interaction DetectionObject	—Unverified	0
InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation	Jan 1, 2025	BenchmarkingHuman-Object Interaction Detection	—Unverified	0
ChatHuman: Chatting about 3D Humans with Tools	Jan 1, 2025	Human-Object Interaction DetectionIn-Context Learning	—Unverified	0
ChainHOI: Joint-based Kinematic Chain Modeling for Human-Object Interaction Generation	Jan 1, 2025	Human-Object Interaction DetectionHuman-Object Interaction Generation	—Unverified	0
PICO: Reconstructing 3D People In Contact with Objects	Jan 1, 2025	Human-Object Interaction DetectionObject	—Unverified	0
Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model	Dec 30, 2024	Human-Object Interaction Detection	—Unverified	0
SyncDiff: Synchronized Motion Diffusion for Multi-Body Human-Object Interaction Synthesis	Dec 28, 2024	Human AnimationHuman-Object Interaction Detection	—Unverified	0
Interacted Object Grounding in Spatio-Temporal Human-Object Interactions	Dec 27, 2024	Human-Object Interaction DetectionObject	CodeCode Available	1
ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Dec 24, 2024	Human-Object Interaction DetectionVideo Generation	—Unverified	0
Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration	Dec 19, 2024	Human-Object Interaction Detectionmotion retargeting	CodeCode Available	4
Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception	Dec 18, 2024	DescriptiveHuman-Object Interaction Detection	CodeCode Available	0
ContextHOI: Spatial Context Learning for Human-Object Interaction Detection	Dec 12, 2024	Human-Object Interaction DetectionObject	—Unverified	0
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection	Dec 11, 2024	Human-Object Interaction Detection	—Unverified	0
TriDi: Trilateral Diffusion of 3D Humans, Objects, and Interactions	Dec 9, 2024	Human-Object Interaction DetectionMixed Reality	—Unverified	0
Lifting Motion to the 3D World via 2D Diffusion	Nov 27, 2024	Human-Object Interaction Detection	—Unverified	0
OOD-HOI: Text-Driven 3D Whole-Body Human-Object Interactions Generation Beyond Training Domains	Nov 27, 2024	Human-Object Interaction Detection	—Unverified	0
VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis	Nov 27, 2024	Human-Object Interaction DetectionImage-text matching	—Unverified	0

Show:10 25 50

← PrevPage 3 of 18Next →

All datasets HICO-DET V-COCO HICO VidHOI Ambiguious-HOI MECCANO

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Ours (PViC+)	mAP	46.49	—	Unverified
2	RLIPv2 (Swin-L)	mAP	45.09	—	Unverified
3	PViC-SwinL	mAP	44.32	—	Unverified
4	SOV-STG (Swin-L)	mAP	43.35	—	Unverified
5	DiffHOI	mAP	41.5	—	Unverified
6	ViPLO	mAP	37.22	—	Unverified
7	FGAHOI	mAP	37.18	—	Unverified
8	ERNet	mAP	36.89	—	Unverified
9	CQL+GEN-VLKT-L	mAP	36.03	—	Unverified
10	QAHOI (Swin-L)	mAP	35.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RLIPv2	AP(S1)	72.1	—	Unverified
2	MUREN	AP(S1)	68.8	—	Unverified
3	STIP	AP(S1)	66	—	Unverified
4	DiffHOI	AP(S1)	65.7	—	Unverified
5	OCN (ResNet101)	AP(S1)	65.3	—	Unverified
6	OCN (ResNet50)	AP(S1)	64.2	—	Unverified
7	CDN (ResNet101)	AP(S1)	63.91	—	Unverified
8	HOICLIP	AP(S1)	63.5	—	Unverified
9	QPIC + CPC	MAP	63.1	—	Unverified
10	Body Part Interactiveness	AP(S1)	63	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DEFR	mAP	65.6	—	Unverified
2	HAKE	mAP	47.1	—	Unverified
3	PaStaNet	mAP	46.3	—	Unverified
4	RelViT	mAP	43.98	—	Unverified
5	Pairwise-Part	mAP	39.9	—	Unverified
6	Mallya & Lazebnik	mAP	36.1	—	Unverified
7	Girdhar & Ramanan	mAP	34.6	—	Unverified
8	R*CNN	mAP	28.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HOI4ABOT	Detection: Full (mAP@0.5)	11.12	—	Unverified
2	ST-GAZE	Detection: Full (mAP@0.5)	10.4	—	Unverified
3	STTRAN	Detection: Full (mAP@0.5)	7.61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DJ-RN	mAP	10.37	—	Unverified
2	iCAN	mAP	8.14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SlowFast + FasterRCNN	mAP@0.5 role	25.93	—	Unverified