SOTAVerified

Language Modeling

Papers

Showing 1105111100 of 14182 papers

TitleStatusHype
Transfer Learning with Shallow Decoders: BSC at WMT2021’s Multilingual Low-Resource Translation for Indo-European Languages Shared TaskCode0
NICT Kyoto Submission for the WMT’21 Quality Estimation Task: Multimetric Multilingual Pretraining for Critical Error Detection0
Unsupervised Multi-View Post-OCR Error Correction With Language Models0
Unsupervised Adverbial Identification in Modern Chinese Literature0
Scaffolded input promotes atomic organization in the recurrent neural network language modelCode0
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed DomainsCode0
Unsupervised Discovery of Unaccusative and Unergative Verbs0
On the Role of Corpus Ordering in Language Modeling0
ProSPer: Probing Human and Neural Network Language Model Understanding of Spatial PerspectiveCode0
What Can a Generative Language Model Answer About a Passage?0
What BERT Based Language Model Learns in Spoken Transcripts: An Empirical Study0
Stacked AMR Parsing with Silver DataCode0
UnClE: Explicitly Leveraging Semantic Similarity to Reduce the Parameters of Word Embeddings0
R-BERT-CNN: Drug-target interactions extraction from biomedical literature0
PnPOOD : Out-Of-Distribution Detection for Text Classification via Plug andPlay Data Augmentation0
Automatic Knowledge Augmentation for Generative Commonsense Reasoning0
EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments0
Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition0
Pre-training Co-evolutionary Protein Representation via A Pairwise Masked Language Model0
Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight Fine-TuningCode0
No News is Good News: A Critique of the One Billion Word Benchmark0
Paradigm Shift in Language Modeling: Revisiting CNN for Modeling Sanskrit Originated Bengali and Hindi Language0
Distributionally Robust Recurrent Decoders with Random Network Distillation0
Sentence Punctuation for Collaborative Commentary Generation in Esports Live-Streaming0
Text Counterfactuals via Latent Optimization and Shapley-Guided SearchCode0
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training0
Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach0
JavaBERT: Training a transformer-based model for the Java programming languageCode0
Knowledge Graph informed Fake News Classification via Heterogeneous Representation EnsemblesCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
DEEPAGÉ: Answering Questions in Portuguese about the Brazilian EnvironmentCode0
Automatic Learning of Subword Dependent Model Scales0
NormFormer: Improved Transformer Pretraining with Extra Normalization0
Reminding the Incremental Language Model via Data-Free Self-Distillation0
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation0
Sharpness-Aware Minimization Improves Language Model Generalization0
N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking0
Prix-LM: Pretraining for Multilingual Knowledge Base ConstructionCode0
xGQA: Cross-Lingual Visual Question Answering0
Multilingual unsupervised sequence segmentation transfers to extremely low-resource languagesCode0
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens0
ASR4REAL: An extended benchmark for speech models0
Leveraging Knowledge in Multilingual Commonsense Reasoning0
DEMix Layers: Disentangling Domains for Modular Language Modeling0
A Novel Metric for Evaluating Semantics PreservationCode0
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models0
Echo-Attention: Attend Once and Get N Attentions for Free0
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model CompressionCode0
Kronecker Decomposition for GPT Compression0
DS-TOD: Efficient Domain Specialization for Task Oriented DialogCode0
Show:102550
← PrevPage 222 of 284Next →

No leaderboard results yet.