SOTAVerified

Language Modeling

Papers

Showing 90519100 of 14182 papers

TitleStatusHype
Fine-Tuning Language Models via Epistemic Neural NetworksCode1
Generative Adversarial Training Can Improve Neural Language Models0
Numerical Optimizations for Weighted Low-rank Estimation on Language Model0
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model0
Towards Zero-Shot Code-Switched Speech Recognition0
Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation0
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
A Quantitative Analysis of Comparison of Emoji Sentiment: Taiwan Mandarin Users and English Users0
Language Model Based Chinese Handwriting Address Recognition0
HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language ModelCode0
NERVE at ROCLING 2022 Shared Task: A Comparison of Three Named Entity Recognition Frameworks Based on Language Model and Lexicon Approach0
The future is different: Large pre-trained language models fail in prediction tasks0
Reduce, Reuse, Recycle: Improving Training Efficiency with Distillation0
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5Code1
Machine learning can guide experimental approaches for protein digestibility estimations0
VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding0
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 smallCode4
Improving Variational Autoencoders with Density Gap-based RegularizationCode0
Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language InstructionsCode0
Generating Sequences by Learning to Self-Correct0
Blank Collapse: Compressing CTC emission for the faster decodingCode0
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic ChangeCode1
A Simple, Yet Effective Approach to Finding Biases in Code Generation0
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position SelectorCode0
When Language Model Meets Private LibraryCode2
WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain0
Pneg: Prompt-based Negative Response Generation for Dialogue Response Selection TaskCode0
Tables to LaTeX: structure and content extraction from scientific tables0
Modular Hybrid Autoregressive Transducer0
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlCode1
L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep LearningCode1
CodeEditor: Learning to Edit Source Code with Pre-trained ModelsCode0
Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts0
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text0
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model0
Differentiable Data Augmentation for Contrastive Sentence Representation LearningCode1
NTULM: Enriching Social Media Text Representations with Non-Textual Units0
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models0
Leveraging Label Correlations in a Multi-label Setting: A Case Study in EmotionCode1
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention0
Feature Engineering vs BERT on Twitter Data0
RoChBert: Towards Robust BERT Fine-tuning for ChineseCode1
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance0
You can't pick your neighbors, or can you? When and how to rely on retrieval in the kNN-LMCode0
Nearest Neighbor Language Models for Stylistic Controllable Generation0
Simulating realistic speech overlaps improves multi-talker ASR0
Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge0
Retrieval Oriented Masking Pre-training Language Model for Dense Passage RetrievalCode2
What Language Model to Train if You Have One Million GPU Hours?Code3
Seq2Seq-SC: End-to-End Semantic Communication Systems with Pre-trained Language Model0
Show:102550
← PrevPage 182 of 284Next →

No leaderboard results yet.