| PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation | Sep 4, 2024 | Pose PredictionRhythm | —Unverified | 0 |
| C^2RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval | Aug 19, 2024 | Gloss-free Sign Language TranslationRepresentation Learning | —Unverified | 0 |
| Multiple Notch ligands in the synchronization of the segmentation clock | Aug 7, 2024 | Rhythm | —Unverified | 0 |
| Re-ENACT: Reinforcement Learning for Emotional Speech Generation using Actor-Critic Strategy | Aug 4, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Analyzing and reducing the synthetic-to-real transfer gap in Music Information Retrieval: the task of automatic drum transcription | Jul 29, 2024 | Drum TranscriptionInformation Retrieval | —Unverified | 0 |
| Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity | Jul 20, 2024 | DiversityRhythm | —Unverified | 0 |
| Universal Facial Encoding of Codec Avatars from VR Headsets | Jul 17, 2024 | RhythmSelf-Supervised Learning | —Unverified | 0 |
| MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation | Jul 3, 2024 | DescriptiveRhythm | —Unverified | 0 |
| Subtractive Training for Music Stem Insertion using Latent Diffusion Models | Jun 27, 2024 | Rhythm | —Unverified | 0 |
| A Dynamic Systems Approach to Modelling Human-Machine Rhythm Interaction | Jun 26, 2024 | Rhythm | —Unverified | 0 |