DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

2024-11-29Code Available0· sign in to hype

Jungbin Cho, Junwan Kim, Jisoo Kim, Minseo Kim, Mingu Kang, Sungeun Hong, Tae-Hyun Oh, Youngjae Yu

Code Available — Be the first to reproduce this paper.

Code

github.com/whwjdqls/discord
Official★ 51

Abstract

Human motion is inherently continuous and dynamic, posing significant challenges for generative models. While discrete generation methods are widely used, they suffer from limited expressiveness and frame-wise noise artifacts. In contrast, continuous approaches produce smoother, more natural motion but often struggle to adhere to conditioning signals due to high-dimensional complexity and limited training data. To resolve this discord between discrete and continuous representations, we introduce DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding, a novel method that leverages rectified flow to decode discrete motion tokens in the continuous, raw motion space. Our core idea is to frame token decoding as a conditional generation task, ensuring that DisCoRD captures fine-grained dynamics and achieves smoother, more natural motions. Compatible with any discrete-based framework, our method enhances naturalness without compromising faithfulness to the conditioning signals on diverse settings. Extensive evaluations Our project page is available at: https://whwjdqls.github.io/discord.github.io/.

Tasks

Motion Synthesis Quantization

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
HumanML3D	DisCoRD (+MoMask)	FID	0.03	—	Unverified
KIT Motion-Language	DisCoRD (+MoMask)	FID	0.17	—	Unverified

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Code

Abstract

Tasks

Benchmark Results

Reproductions