| Analyzing long-term rhythm variations in Mising and Assamese using frequency domain correlates | Oct 26, 2024 | Rhythm | —Unverified | 0 |
| MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization | Oct 16, 2024 | In-Context LearningMusic Generation | —Unverified | 0 |
| Swin-BERT: A Feature Fusion System designed for Speech-based Alzheimer's Dementia Detection | Oct 9, 2024 | Rhythm | —Unverified | 0 |
| Assessing the Circadian Rhythm of Cats Living in a Group using Accelerometers | Oct 9, 2024 | Rhythm | —Unverified | 0 |
| Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks | Oct 9, 2024 | Audio ClassificationRhythm | —Unverified | 0 |
| Closed-Loop phase selection in EEG-TMS using Bayesian Optimization | Oct 8, 2024 | Bayesian OptimizationEEG | CodeCode Available | 0 |
| Exploring rhythm formant analysis for Indic language classification | Oct 8, 2024 | ClassificationRhythm | —Unverified | 0 |
| An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains | Oct 5, 2024 | DiagnosticEvent Detection | CodeCode Available | 2 |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Sep 24, 2024 | ClusteringLanguage Modelling | CodeCode Available | 3 |
| Physics-Informed Neural Networks can accurately model cardiac electrophysiology in 3D geometries and fibrillatory conditions | Sep 18, 2024 | Rhythm | —Unverified | 0 |
| LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling | Sep 13, 2024 | CPURhythm | —Unverified | 0 |
| Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos | Sep 12, 2024 | Disentanglementmotion prediction | —Unverified | 0 |
| Hierarchical Symbolic Pop Music Generation with Graph Neural Networks | Sep 12, 2024 | Music GenerationRhythm | —Unverified | 0 |
| AS-Speech: Adaptive Style For Speech Synthesis | Sep 9, 2024 | RhythmSpeech Synthesis | —Unverified | 0 |
| PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation | Sep 4, 2024 | Pose PredictionRhythm | —Unverified | 0 |
| C^2RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval | Aug 19, 2024 | Gloss-free Sign Language TranslationRepresentation Learning | —Unverified | 0 |
| Multiple Notch ligands in the synchronization of the segmentation clock | Aug 7, 2024 | Rhythm | —Unverified | 0 |
| Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation | Aug 5, 2024 | RhythmSelf-Supervised Learning | CodeCode Available | 2 |
| Re-ENACT: Reinforcement Learning for Emotional Speech Generation using Actor-Critic Strategy | Aug 4, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Analyzing and reducing the synthetic-to-real transfer gap in Music Information Retrieval: the task of automatic drum transcription | Jul 29, 2024 | Drum TranscriptionInformation Retrieval | —Unverified | 0 |
| MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation | Jul 21, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity | Jul 20, 2024 | DiversityRhythm | —Unverified | 0 |
| Universal Facial Encoding of Codec Avatars from VR Headsets | Jul 17, 2024 | RhythmSelf-Supervised Learning | —Unverified | 0 |
| MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation | Jul 3, 2024 | DescriptiveRhythm | —Unverified | 0 |
| Subtractive Training for Music Stem Insertion using Latent Diffusion Models | Jun 27, 2024 | Rhythm | —Unverified | 0 |