| C^2RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval | Aug 19, 2024 | Gloss-free Sign Language TranslationRepresentation Learning | —Unverified | 0 |
| Multiple Notch ligands in the synchronization of the segmentation clock | Aug 7, 2024 | Rhythm | —Unverified | 0 |
| Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation | Aug 5, 2024 | RhythmSelf-Supervised Learning | CodeCode Available | 2 |
| Re-ENACT: Reinforcement Learning for Emotional Speech Generation using Actor-Critic Strategy | Aug 4, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Analyzing and reducing the synthetic-to-real transfer gap in Music Information Retrieval: the task of automatic drum transcription | Jul 29, 2024 | Drum TranscriptionInformation Retrieval | —Unverified | 0 |
| MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation | Jul 21, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity | Jul 20, 2024 | DiversityRhythm | —Unverified | 0 |
| Universal Facial Encoding of Codec Avatars from VR Headsets | Jul 17, 2024 | RhythmSelf-Supervised Learning | —Unverified | 0 |
| MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation | Jul 3, 2024 | DescriptiveRhythm | —Unverified | 0 |
| Subtractive Training for Music Stem Insertion using Latent Diffusion Models | Jun 27, 2024 | Rhythm | —Unverified | 0 |