SOTAVerified

Multimodal Datasets and Benchmarks for Reasoning about Dynamic Spatio-Temporality in Everyday Environments

2024-08-21Unverified0· sign in to hype

Takanori Ugai, Kensho Hara, Shusaku Egami, Ken Fukuda

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We used a 3D simulator to create artificial video data with standardized annotations, aiming to aid in the development of Embodied AI. Our question answering (QA) dataset measures the extent to which a robot can understand human behavior and the environment in a home setting. Preliminary experiments suggest our dataset is useful in measuring AI's comprehension of daily life. abstract

Tasks

Reproductions