SOTAVerified

FAD

Papers

Showing 125 of 62 papers

TitleStatusHype
Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio DistanceCode3
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object DetectionCode2
FlowDec: A flow-based full-band general audio codec with high perceptual qualityCode2
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video GenerationCode2
Taming Data and Transformers for Audio GenerationCode2
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion ModelsCode2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio GenerationCode2
Efficient Autoregressive Audio Modeling via Next-Scale PredictionCode2
Adapting Frechet Audio Distance for Generative Music EvaluationCode2
BemaGANv2: A Tutorial and Comparative Survey of GAN-based Vocoders for Long-Term Audio GenerationCode1
AMSP-UOD: When Vortex Convolution and Stochastic Perturbation Meet Underwater Object DetectionCode1
Aligning Text-to-Music Evaluation with Human PreferencesCode1
Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference OptimizationCode1
Timbre Transfer with Variational Auto Encoding and Cycle-Consistent Adversarial NetworksCode1
Frechet Music Distance: A Metric For Generative Symbolic Music EvaluationCode1
DOSE : Drum One-Shot Extraction from Music MixtureCode1
Multi-Source Music Generation with Latent DiffusionCode1
Representation Sharing for Fast Object Detector Search and BeyondCode1
AnoPLe: Few-Shot Anomaly Detection via Bi-directional Prompt Learning with Only Normal SamplesCode0
Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRICode0
CLOTH4D: A Dataset for Clothed Human ReconstructionCode0
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and InferenceCode0
Refined Semantic Enhancement towards Frequency Diffusion for Video CaptioningCode0
Latent CLAP Loss for Better Foley Sound SynthesisCode0
Generating Diverse Vocal Bursts with StyleGAN2 and MEL-SpectrogramsCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.