SOTAVerified

Language Modeling

Papers

Showing 71267150 of 14182 papers

TitleStatusHype
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization ProblemsCode0
TabuLa: Harnessing Language Models for Tabular Data SynthesisCode1
Data Augmentations for Improved (Large) Language Model Generalization0
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based ArchitectureCode2
Solving the multiplication problem of a large language model system using a graph-based method0
Preference Optimization for Molecular Language ModelsCode0
Document-Level Language Models for Machine Translation0
Pseudointelligence: A Unifying Framework for Language Model Evaluation0
Harnessing Dataset Cartography for Improved Compositional Generalization in TransformersCode0
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language ModelCode1
Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long SequencesCode0
Solving Hard Analogy Questions with Relation Embedding ChainsCode0
Large Language Model Prediction Capabilities: Evidence from a Real-World Forecasting Tournament0
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition0
Multi-stage Large Language Model Correction for Speech Recognition0
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter MergingCode1
Revealing the Unwritten: Visual Investigation of Beam Search Trees to Address Language Model Prompting Challenges0
BitNet: Scaling 1-bit Transformers for Large Language ModelsCode2
Leveraging Large Language Model for Automatic Evolving of Industrial Data-Centric R&D Cycle0
ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text ProcessingCode0
Correction Focused Language Model Training for Speech Recognition0
Learn Your Tokens: Word-Pooled Tokenization for Language ModelingCode0
Watermarking LLMs with Weight QuantizationCode1
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation0
Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue0
Show:102550
← PrevPage 286 of 568Next →

No leaderboard results yet.