SOTAVerified

Language Modeling

Papers

Showing 25512600 of 14182 papers

TitleStatusHype
Efficient Online Data Mixing For Language Model Pre-TrainingCode1
Efficiently Modeling Long Sequences with Structured State SpacesCode1
Efficient Nearest Neighbor Language ModelsCode1
CoditT5: Pretraining for Source Code and Natural Language EditingCode1
CoEdIT: Text Editing by Task-Specific Instruction TuningCode1
ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language ModelCode1
ITER: Iterative Transformer-based Entity Recognition and Relation ExtractionCode1
CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction AlignmentCode1
Efficient recurrent architectures through activity sparsity and sparse back-propagation through timeCode1
Efficient Hierarchical Domain Adaptation for Pretrained Language ModelsCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
Efficient Content-Based Sparse Attention with Routing TransformersCode1
CogBench: a large language model walks into a psychology labCode1
Efficient Long Sequence Modeling via State Space Augmented TransformerCode1
CogMG: Collaborative Augmentation Between Large Language Model and Knowledge GraphCode1
Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event ExtractionCode1
Effective Sequence-to-Sequence Dialogue State TrackingCode1
Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?Code1
Cognitive Reframing of Negative Thoughts through Human-Language Model InteractionCode1
Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event ExtractionCode1
CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language ModelCode1
Asynchronous Local-SGD Training for Language ModelingCode1
AlephBERT:A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application WithCode1
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl DataCode1
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation MethodCode1
Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest ImagesCode1
ELECTRAMed: a new pre-trained language representation model for biomedical NLPCode1
Matrix Information Theory for Self-Supervised LearningCode1
Cold-start Active Learning through Self-supervised Language ModelingCode1
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsCode1
Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language InterfacesCode1
Effective Attention Sheds Light On InterpretabilityCode1
KITLM: Domain-Specific Knowledge InTegration into Language Models for Question AnsweringCode1
CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question AnsweringCode1
Collaborative Large Language Model for Recommender SystemsCode1
Effective Batching for Recurrent Neural Network GrammarsCode1
Collaborative Retrieval for Large Language Model-based Conversational Recommender SystemsCode1
EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROADCode1
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM CollaborationCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language ModelingCode1
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
Collective Constitutional AI: Aligning a Language Model with Public InputCode1
Effective Human-AI Teams via Learned Natural Language Rules and OnboardingCode1
CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal ControlCode1
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge AcquisitionCode1
Knowledge Graph-Driven Retrieval-Augmented Generation: Integrating Deepseek-R1 with Weaviate for Advanced Chatbot ApplicationsCode1
Causal Structure Learning Supervised by Large Language ModelCode1
A Refer-and-Ground Multimodal Large Language Model for BiomedicineCode1
Show:102550
← PrevPage 52 of 284Next →

No leaderboard results yet.