SOTAVerified

Language Modeling

Papers

Showing 1220112250 of 14182 papers

TitleStatusHype
Addressing Some Limitations of Transformers with Feedback MemoryCode1
MaxUp: A Simple Way to Improve Generalization of Neural Network TrainingCode0
Scalable Second Order Optimization for Deep LearningCode0
A Systematic Comparison of Architectures for Document-Level Sentiment ClassificationCode0
LAMBERT: Layout-Aware (Language) Modeling for information extractionCode1
SentenceMIM: A Latent Variable Language ModelCode1
Studying the Effects of Cognitive Biases in Evaluation of Conversational Agents0
A Financial Service Chatbot based on Deep Bidirectional Transformers0
Global and Local Feature Learning for Ego-Network Analysis0
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and GenerationCode1
Transformer on a DietCode1
FQuAD: French Question Answering Dataset0
A Data Efficient End-To-End Spoken Language Understanding Architecture0
Comparison of Turkish Word Representations Trained on Different Morphological Forms0
CBAG: Conditional Biomedical Abstract Generation0
Deep Learning for Source Code Modeling and Generation: Models, Applications and ChallengesCode0
Pre-Training for Query Rewriting in A Spoken Language Understanding System0
Regularizing activations in neural networks via distribution matching with the Wasserstein metric0
Localized Flood DetectionWith Minimal Labeled Social Media Data Using Transfer Learning0
REALM: Retrieval-Augmented Language Model Pre-TrainingCode1
How Much Knowledge Can You Pack Into the Parameters of a Language Model?Code1
FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA0
Limits of Detecting Text Generated by Large-Scale Language Models0
Blank Language ModelsCode1
Time-aware Large Kernel ConvolutionsCode1
Introducing Aspects of Creativity in Automatic Poetry GenerationCode0
Consistency of a Recurrent Language Model With Respect to Incomplete DecodingCode0
Aligning the Pretraining and Finetuning Objectives of Language Models0
Parsing as PretrainingCode1
Explaining Relationships Between Scientific DocumentsCode1
A Difference-of-Convex Programming Approach With Parallel Branch-and-Bound For Sentence Compression Via A Hybrid Extractive Model0
Adversarial Training for Aspect-Based Sentiment Analysis with BERTCode1
Aspect-based Academic Search using Domain-specific KB0
PEL-BERT: A Joint Model for Protocol Entity Linking0
DUMA: Reading Comprehension with Transposition ThinkingCode1
Compressing Language Models using Doped Kronecker Products0
Scaling Laws for Neural Language ModelsCode1
A Simple Baseline to Semi-Supervised Domain Adaptation for Machine TranslationCode1
Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on GeneralizationCode1
ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data0
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language InferenceCode1
Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue SystemsCode1
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard0
RobBERT: a Dutch RoBERTa-based Language ModelCode1
Block-wise Dynamic SparsenessCode0
Montage: A Neural Network Language Model-Guided JavaScript Engine FuzzerCode1
A Continuous Space Neural Language Model for Bengali Language0
Learning Cross-Context Entity Representations from Text0
Towards Minimal Supervision BERT-based Grammar Error Correction0
Automatic Business Process Structure Discovery using Ordered Neurons LSTM: A Preliminary Study0
Show:102550
← PrevPage 245 of 284Next →

No leaderboard results yet.