| DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization | Dec 11, 2020 | DiversityQuantization | CodeCode Available | 1 |
| Exploring wav2vec 2.0 on speaker verification and language identification | Dec 11, 2020 | Language IdentificationMulti-Task Learning | —Unverified | 0 |
| Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning | Oct 27, 2020 | Emotion RecognitionRepresentation Learning | CodeCode Available | 1 |
| A Comparison of Discrete Latent Variable Models for Speech Representation Learning | Oct 24, 2020 | Phoneme RecognitionRepresentation Learning | —Unverified | 0 |
| Similarity Analysis of Self-Supervised Speech Representations | Oct 22, 2020 | Representation LearningSpeech Representation Learning | —Unverified | 0 |
| Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound | Aug 14, 2020 | Contrastive LearningGaze Prediction | —Unverified | 0 |
| Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers | Jun 9, 2020 | General ClassificationRepresentation Learning | —Unverified | 0 |
| CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning | Jun 4, 2020 | BIG-bench Machine LearningContrastive Learning | —Unverified | 0 |
| A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning | Jun 3, 2020 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition? | May 4, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |