SOTAVerified

FAD

Papers

Showing 150 of 62 papers

TitleStatusHype
Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio DistanceCode3
FlowDec: A flow-based full-band general audio codec with high perceptual qualityCode2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio GenerationCode2
Efficient Autoregressive Audio Modeling via Next-Scale PredictionCode2
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object DetectionCode2
Taming Data and Transformers for Audio GenerationCode2
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion ModelsCode2
Adapting Frechet Audio Distance for Generative Music EvaluationCode2
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video GenerationCode2
BemaGANv2: A Tutorial and Comparative Survey of GAN-based Vocoders for Long-Term Audio GenerationCode1
DOSE : Drum One-Shot Extraction from Music MixtureCode1
Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference OptimizationCode1
Aligning Text-to-Music Evaluation with Human PreferencesCode1
Frechet Music Distance: A Metric For Generative Symbolic Music EvaluationCode1
Multi-Source Music Generation with Latent DiffusionCode1
AMSP-UOD: When Vortex Convolution and Stochastic Perturbation Meet Underwater Object DetectionCode1
Timbre Transfer with Variational Auto Encoding and Cycle-Consistent Adversarial NetworksCode1
Representation Sharing for Fast Object Detector Search and BeyondCode1
Detecting immune cells with label-free two-photon autofluorescence and deep learning0
FAD-Net: Frequency-Domain Attention-Guided Diffusion Network for Coronary Artery Segmentation using Invasive Coronary Angiography0
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling0
FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning0
Efficient Listener: Dyadic Facial Motion Synthesis via Action Diffusion0
DRAGON: Distributional Rewards Optimize Diffusion Generative Models0
Enhancing U.S. swine farm preparedness for infectious foreign animal diseases with rapid access to biosecurity information0
TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis0
RenderBox: Expressive Performance Rendering with Text Control0
Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning0
Sound Scene Synthesis at the DCASE 2024 Challenge0
Market Making with Fads, Informed, and Uninformed Traders0
MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System0
Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings0
Latent Diffusion Bridges for Unsupervised Musical Audio Timbre Transfer0
AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning with Only Normal SamplesCode0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
Exploring compressibility of transformer based text-to-music (TTM) models0
Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRICode0
FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method0
FaceCat: Enhancing Face Recognition Security with a Unified Diffusion Model0
Latent CLAP Loss for Better Foley Sound SynthesisCode0
MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction0
Audiobox: Unified Audio Generation with Natural Language Prompts0
Performance Conditioning for Diffusion-Based Multi-Instrument Music Synthesis0
Retrieval-Augmented Text-to-Audio Generation0
Flatness-Aware Minimization for Domain Generalization0
Feature Adversarial Distillation for Point Cloud Classification0
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and InferenceCode0
A General Framework for Learning Procedural Audio Models of Environmental Sounds0
Federated Automatic Differentiation0
CLOTH4D: A Dataset for Clothed Human ReconstructionCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.