SOTAVerified

Language Modeling

Papers

Showing 88768900 of 14182 papers

TitleStatusHype
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion ScaleCode1
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation PerturbationCode1
Claim Optimization in Computational ArgumentationCode0
POIBERT: A Transformer-based Model for the Tour Recommendation Problem0
ALERT: Adapting Language Models to Reasoning Tasks0
LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension0
Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language0
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-GenerationCode1
Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines0
Efficient Pre-training of Masked Language Model via Concept-based Curriculum MaskingCode1
FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference0
Attention as a Guide for Simultaneous Speech TranslationCode0
Joint processing of linguistic properties in brains and language modelsCode0
Efficient Long Sequence Modeling via State Space Augmented TransformerCode1
The Effects of In-domain Corpus Size on pre-training BERTCode0
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningCode1
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling0
Cross-Modal Similarity-Based Curriculum Learning for Image Captioning0
The Challenges of HTR Model Training: Feedback from the Project Donner le gout de l'archive a l'ere numerique0
Technical Report -- Competition Solution for Prompt Tuning using Pretrained Language Model0
Deep Image Style Transfer from Freeform Text0
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesCode6
Do Text-to-Text Multi-Task Learners Suffer from Task Conflict?Code0
CNO-LSTM: A Chaotic Neural Oscillatory Long Short-Term Memory Model for Text Classification0
Prompting Is Programming: A Query Language for Large Language ModelsCode3
Show:102550
← PrevPage 356 of 568Next →

No leaderboard results yet.