Fingerspelling PoseNet: Enhancing Fingerspelling Translation with Pose-Based Transformer Models

2023-11-20Code Available0· sign in to hype

Pooya Fayyazsanavi, Negar Nejatishahidin, Jana Kosecka

Code Available — Be the first to reproduce this paper.

Code

github.com/pooyafayyaz/fingerspelling-posenet
OfficialIn paperpytorch★ 5

Abstract

We address the task of American Sign Language fingerspelling translation using videos in the wild. We exploit advances in more accurate hand pose estimation and propose a novel architecture that leverages the transformer based encoder-decoder model enabling seamless contextual word translation. The translation model is augmented by a novel loss term that accurately predicts the length of the finger-spelled word, benefiting both training and inference. We also propose a novel two-stage inference approach that re-ranks the hypotheses using the language model capabilities of the decoder. Through extensive experiments, we demonstrate that our proposed method outperforms the state-of-the-art models on ChicagoFSWild and ChicagoFSWild+ achieving more than 10% relative improvement in performance. Our findings highlight the effectiveness of our approach and its potential to advance fingerspelling recognition in sign language translation. Code is also available at https://github.com/pooyafayyaz/Fingerspelling-PoseNet.

Tasks

Decoder Hand Pose Estimation Language Modeling Language Modelling Pose Estimation Sign Language Translation Translation Word Translation

Fingerspelling PoseNet: Enhancing Fingerspelling Translation with Pose-Based Transformer Models

Code

Abstract

Tasks

Reproductions