| Fine-Tuning Language Models via Epistemic Neural Networks | Nov 3, 2022 | Active LearningLanguage Modeling | CodeCode Available | 1 |
| Generative Adversarial Training Can Improve Neural Language Models | Nov 2, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Numerical Optimizations for Weighted Low-rank Estimation on Language Model | Nov 2, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model | Nov 2, 2022 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Towards Zero-Shot Code-Switched Speech Recognition | Nov 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation | Nov 2, 2022 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 |
| A Quantitative Analysis of Comparison of Emoji Sentiment: Taiwan Mandarin Users and English Users | Nov 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Based Chinese Handwriting Address Recognition | Nov 1, 2022 | Handwriting RecognitionLanguage Modeling | —Unverified | 0 |
| HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language Model | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NERVE at ROCLING 2022 Shared Task: A Comparison of Three Named Entity Recognition Frameworks Based on Language Model and Lexicon Approach | Nov 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The future is different: Large pre-trained language models fail in prediction tasks | Nov 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reduce, Reuse, Recycle: Improving Training Efficiency with Distillation | Nov 1, 2022 | image-classificationImage Classification | —Unverified | 0 |
| T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5 | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Machine learning can guide experimental approaches for protein digestibility estimations | Nov 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding | Nov 1, 2022 | Citation Intent ClassificationLanguage Modeling | —Unverified | 0 |
| Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Improving Variational Autoencoders with Density Gap-based Regularization | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Generating Sequences by Learning to Self-Correct | Oct 31, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Blank Collapse: Compressing CTC emission for the faster decoding | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Simple, Yet Effective Approach to Finding Biases in Code Generation | Oct 31, 2022 | Causal Language ModelingCode Generation | —Unverified | 0 |
| 1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector | Oct 31, 2022 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| When Language Model Meets Private Library | Oct 31, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain | Oct 31, 2022 | FLUELanguage Modeling | —Unverified | 0 |
| Pneg: Prompt-based Negative Response Generation for Dialogue Response Selection Task | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Tables to LaTeX: structure and content extraction from scientific tables | Oct 31, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Modular Hybrid Autoregressive Transducer | Oct 31, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control | Oct 31, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 |
| L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning | Oct 31, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| CodeEditor: Learning to Edit Source Code with Pre-trained Models | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts | Oct 30, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text | Oct 30, 2022 | intent-classificationIntent Classification | —Unverified | 0 |
| BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model | Oct 29, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Differentiable Data Augmentation for Contrastive Sentence Representation Learning | Oct 29, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| NTULM: Enriching Social Media Text Representations with Non-Textual Units | Oct 29, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Oct 28, 2022 | Common Sense ReasoningCoreference Resolution | —Unverified | 0 |
| Leveraging Label Correlations in a Multi-label Setting: A Case Study in Emotion | Oct 28, 2022 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 |
| DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention | Oct 28, 2022 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Feature Engineering vs BERT on Twitter Data | Oct 28, 2022 | Feature EngineeringGPU | —Unverified | 0 |
| RoChBert: Towards Robust BERT Fine-tuning for Chinese | Oct 28, 2022 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance | Oct 28, 2022 | Image GenerationImage-text matching | —Unverified | 0 |
| You can't pick your neighbors, or can you? When and how to rely on retrieval in the kNN-LM | Oct 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Nearest Neighbor Language Models for Stylistic Controllable Generation | Oct 27, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Simulating realistic speech overlaps improves multi-talker ASR | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge | Oct 27, 2022 | Acoustic Unit DiscoveryLanguage Modeling | —Unverified | 0 |
| Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| What Language Model to Train if You Have One Million GPU Hours? | Oct 27, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| Seq2Seq-SC: End-to-End Semantic Communication Systems with Pre-trained Language Model | Oct 27, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |