SOTAVerified

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

2024-11-29Code Available0· sign in to hype

Jungbin Cho, Junwan Kim, Jisoo Kim, Minseo Kim, Mingu Kang, Sungeun Hong, Tae-Hyun Oh, Youngjae Yu

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Human motion is inherently continuous and dynamic, posing significant challenges for generative models. While discrete generation methods are widely used, they suffer from limited expressiveness and frame-wise noise artifacts. In contrast, continuous approaches produce smoother, more natural motion but often struggle to adhere to conditioning signals due to high-dimensional complexity and limited training data. To resolve this discord between discrete and continuous representations, we introduce DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding, a novel method that leverages rectified flow to decode discrete motion tokens in the continuous, raw motion space. Our core idea is to frame token decoding as a conditional generation task, ensuring that DisCoRD captures fine-grained dynamics and achieves smoother, more natural motions. Compatible with any discrete-based framework, our method enhances naturalness without compromising faithfulness to the conditioning signals on diverse settings. Extensive evaluations Our project page is available at: https://whwjdqls.github.io/discord.github.io/.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
HumanML3DDisCoRD (+MoMask)FID0.03Unverified
KIT Motion-LanguageDisCoRD (+MoMask)FID0.17Unverified

Reproductions