| Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR | Jan 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Perception-Guided EEG Analysis: A Deep Learning Approach Inspired by Level of Detail (LOD) Theory | Jan 11, 2025 | EEGRhythm | —Unverified | 0 |
| Combining YOLO and Visual Rhythm for Vehicle Counting | Jan 8, 2025 | Rhythmvehicle detection | CodeCode Available | 0 |
| Real-Time Textless Dialogue Generation | Jan 8, 2025 | Dialogue GenerationRhythm | CodeCode Available | 0 |
| Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis | Jan 8, 2025 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 |
| Optimizing Audio Compression Through Entropy-Controlled Dithering | Jan 4, 2025 | Audio CompressionRhythm | —Unverified | 0 |
| Efficient Video-Based ALPR System Using YOLO and Visual Rhythm | Jan 4, 2025 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 0 |
| Aesthetic Matters in Music Perception for Image Stylization: A Emotion-driven Music-to-Visual Manipulation | Jan 3, 2025 | EEGImage Stylization | —Unverified | 0 |
| Machine Learning-Based Differential Diagnosis of Parkinson's Disease Using Kinematic Feature Extraction and Selection | Jan 2, 2025 | Diagnosticfeature selection | —Unverified | 0 |
| Low count of optically pumped magnetometers furnishes a reliable real-time access to sensorimotor rhythm | Dec 24, 2024 | Brain Computer InterfaceMotor Imagery | —Unverified | 0 |
| Cech Complex Generation with Homotopy Equivalence Framework for Myocardial Infarction Diagnosis using Electrocardiogram Signals | Dec 23, 2024 | Anomaly DetectionRhythm | —Unverified | 0 |
| SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis | Dec 21, 2024 | Gesture GenerationMotion Generation | —Unverified | 0 |
| Relationships between Keywords and Strong Beats in Lyrical Music | Dec 5, 2024 | Rhythm | —Unverified | 0 |
| Representation Purification for End-to-End Speech Translation | Dec 5, 2024 | Machine TranslationRhythm | —Unverified | 0 |
| Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor | Dec 1, 2024 | AllNatural Language Understanding | —Unverified | 0 |
| MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network | Nov 28, 2024 | DenoisingMamba | CodeCode Available | 0 |
| Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification | Nov 27, 2024 | Data AugmentationRhythm | —Unverified | 0 |
| DAIRHuM: A Platform for Directly Aligning AI Representations with Human Musical Judgments applied to Carnatic Music | Nov 22, 2024 | RhythmSpecificity | —Unverified | 0 |
| AnyECG: Foundational Models for Multitask Cardiac Analysis in Real-World Settings | Nov 17, 2024 | Anomaly DetectionArrhythmia Detection | —Unverified | 0 |
| A Hybrid Artificial Intelligence System for Automated EEG Background Analysis and Report Generation | Nov 15, 2024 | Anomaly DetectionDiagnostic | CodeCode Available | 0 |
| More variable circadian rhythms in epilepsy captured by long-term heart rate recordings from wearable sensors | Nov 7, 2024 | Rhythm | CodeCode Available | 0 |
| Unveiling Placental Development in Circadian Rhythm-Disrupted Mice: A Photo-acoustic Imaging Study on Unstained Tissue | Nov 7, 2024 | DiagnosticRhythm | —Unverified | 0 |
| A Contrastive Self-Supervised Learning scheme for beat tracking amenable to few-shot learning | Nov 6, 2024 | Beat TrackingFew-Shot Learning | —Unverified | 0 |
| Advancements and limitations of LLMs in replicating human color-word associations | Nov 4, 2024 | Rhythm | —Unverified | 0 |
| Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? | Oct 31, 2024 | Rhythmspeech-recognition | —Unverified | 0 |
| Analyzing long-term rhythm variations in Mising and Assamese using frequency domain correlates | Oct 26, 2024 | Rhythm | —Unverified | 0 |
| MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization | Oct 16, 2024 | In-Context LearningMusic Generation | —Unverified | 0 |
| Swin-BERT: A Feature Fusion System designed for Speech-based Alzheimer's Dementia Detection | Oct 9, 2024 | Rhythm | —Unverified | 0 |
| Assessing the Circadian Rhythm of Cats Living in a Group using Accelerometers | Oct 9, 2024 | Rhythm | —Unverified | 0 |
| Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks | Oct 9, 2024 | Audio ClassificationRhythm | —Unverified | 0 |
| Closed-Loop phase selection in EEG-TMS using Bayesian Optimization | Oct 8, 2024 | Bayesian OptimizationEEG | CodeCode Available | 0 |
| Exploring rhythm formant analysis for Indic language classification | Oct 8, 2024 | ClassificationRhythm | —Unverified | 0 |
| An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains | Oct 5, 2024 | DiagnosticEvent Detection | CodeCode Available | 2 |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Sep 24, 2024 | ClusteringLanguage Modelling | CodeCode Available | 3 |
| Physics-Informed Neural Networks can accurately model cardiac electrophysiology in 3D geometries and fibrillatory conditions | Sep 18, 2024 | Rhythm | —Unverified | 0 |
| LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling | Sep 13, 2024 | CPURhythm | —Unverified | 0 |
| Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos | Sep 12, 2024 | Disentanglementmotion prediction | —Unverified | 0 |
| Hierarchical Symbolic Pop Music Generation with Graph Neural Networks | Sep 12, 2024 | Music GenerationRhythm | —Unverified | 0 |
| AS-Speech: Adaptive Style For Speech Synthesis | Sep 9, 2024 | RhythmSpeech Synthesis | —Unverified | 0 |
| PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation | Sep 4, 2024 | Pose PredictionRhythm | —Unverified | 0 |
| C^2RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval | Aug 19, 2024 | Gloss-free Sign Language TranslationRepresentation Learning | —Unverified | 0 |
| Multiple Notch ligands in the synchronization of the segmentation clock | Aug 7, 2024 | Rhythm | —Unverified | 0 |
| Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation | Aug 5, 2024 | RhythmSelf-Supervised Learning | CodeCode Available | 2 |
| Re-ENACT: Reinforcement Learning for Emotional Speech Generation using Actor-Critic Strategy | Aug 4, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Analyzing and reducing the synthetic-to-real transfer gap in Music Information Retrieval: the task of automatic drum transcription | Jul 29, 2024 | Drum TranscriptionInformation Retrieval | —Unverified | 0 |
| MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation | Jul 21, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity | Jul 20, 2024 | DiversityRhythm | —Unverified | 0 |
| Universal Facial Encoding of Codec Avatars from VR Headsets | Jul 17, 2024 | RhythmSelf-Supervised Learning | —Unverified | 0 |
| MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation | Jul 3, 2024 | DescriptiveRhythm | —Unverified | 0 |
| Subtractive Training for Music Stem Insertion using Latent Diffusion Models | Jun 27, 2024 | Rhythm | —Unverified | 0 |