SOTAVerified

Language Modeling

Papers

Showing 1105111100 of 14182 papers

TitleStatusHype
LMSOC: An approach for socially sensitive pretraining0
Scatterbrain: Unifying Sparse and Low-rank AttentionCode1
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining0
Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning0
GapPredict: A Language Model for Resolving Gaps in Draft Genome AssembliesCode0
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding0
Accelerating Gossip SGD with Periodic Global Averaging0
Effective Attention Sheds Light On InterpretabilityCode1
Text based personality prediction from multiple social media data sources using pre-trained language model and model averaging0
Stage-wise Fine-tuning for Graph-to-Text GenerationCode1
Sentence Similarity Based on Contexts0
Neural Predictive Text for Grammatical Error Prevention0
SINA-BERT: A Pre-Trained Language Model for Analysis of Medical Texts in Persian0
Doc2Dict: Information Extraction as Text GenerationCode0
A Cognitive Regularizer for Language Modeling0
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language UnderstandingCode0
RetGen: A Joint framework for Retrieval and Grounded Text Generation ModelingCode1
Not All Memories are Created Equal: Learning to Forget by ExpiringCode1
Towards Human-Free Automatic Quality Evaluation of German Summarization0
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge DistillationCode1
Slower is Better: Revisiting the Forgetting Mechanism in LSTM for Slower Information Decay0
BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?Code1
Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph CaptioningCode0
DocSCAN: Unsupervised Text Classification via Learning from NeighborsCode1
Lawformer: A Pre-trained Language Model for Chinese Legal Long DocumentsCode1
Understanding by Understanding Not: Modeling Negation in Language ModelsCode1
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-ExpertsCode1
Handwritten Mathematical Expression Recognition with Bidirectionally Trained TransformerCode1
Computer-Aided Design as Language0
HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish0
Inferring the Reader: Guiding Automated Story Generation with Commonsense ReasoningCode0
Impact of Gender Debiased Word Embeddings in Language Modeling0
Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review0
On the limit of English conversational speech recognition0
Unsupervised Document Expansion for Information Retrieval with Stochastic Text GenerationCode0
Larger-Scale Transformers for Multilingual Masked Language Modeling0
It’s Basically the Same Language Anyway: the Case for a Nordic Language Model0
Error Analysis of using BART for Multi-Document Summarization: A Study for English and German LanguageCode0
Measuring Translationese across Levels of Expertise: Are Professionals more Surprising than Students?0
When to Fold'em: How to answer Unanswerable questionsCode1
The Zero Resource Speech Challenge 2021: Spoken language modelling0
Extractive and Abstractive Explanations for Fact-Checking and Evaluation of News0
Diverse Image Inpainting with Bidirectional and Autoregressive Transformers0
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesisCode0
Teaching a Massive Open Online Course on Natural Language Processing0
Reranking Machine Translation Hypotheses with Structured and Web-based Language Models0
Learning Passage Impacts for Inverted IndexesCode1
Transfer training from smaller language model0
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network0
Extracting Adverse Drug Events from Clinical Notes0
Show:102550
← PrevPage 222 of 284Next →

No leaderboard results yet.