SOTAVerified

Language Modeling

Papers

Showing 20012050 of 14182 papers

TitleStatusHype
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational CurriculaCode1
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time CorrectionCode1
AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language ModelsCode1
Evaluating Language Model Finetuning Techniques for Low-resource LanguagesCode1
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each BenchmarkCode1
NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient FrameworkCode1
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free AttentionCode1
Evaluating Morphological Alignment of Tokenizers in 70 LanguagesCode1
Non-Exchangeable Conformal Language Generation with Nearest NeighborsCode1
Nonparametric Masked Language ModelingCode1
Event Causality Identification via Derivative Prompt Joint LearningCode1
Euphemistic Phrase Detection by Masked Language ModelCode1
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language ModelsCode1
Not All Memories are Created Equal: Learning to ExpireCode1
Atla Selene Mini: A General Purpose Evaluation ModelCode1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model EvaluationCode1
2SSP: A Two-Stage Framework for Structured Pruning of LLMsCode1
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language ModelCode1
A Pilot Study of Text-to-SQL Semantic Parsing for VietnameseCode1
Evaluating Human-Language Model InteractionCode1
CAB: Comprehensive Attention Benchmarking on Long Sequence ModelingCode1
Espresso: A Fast End-to-end Neural Speech Recognition ToolkitCode1
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market DomainCode1
ESRL: Efficient Sampling-based Reinforcement Learning for Sequence GenerationCode1
Mask-Predict: Parallel Decoding of Conditional Masked Language ModelsCode1
Cascaded Head-colliding AttentionCode1
Establishing baselines for generative discovery of inorganic crystalsCode1
InferCept: Efficient Intercept Support for Augmented Large Language Model InferenceCode1
On Diversified Preferences of Large Language Model AlignmentCode1
Exploring the Limits of Language ModelingCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
On Faithfulness and Factuality in Abstractive SummarizationCode1
Entity Tracking in Language ModelsCode1
On Measuring Social Biases in Prompt-Based Multi-Task LearningCode1
Cal-DPO: Calibrated Direct Preference Optimization for Language Model AlignmentCode1
Epidemic Modeling with Generative AgentsCode1
Enhancing Vision-Language Model with Unmasked Token AlignmentCode1
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capabilityCode1
Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment GenerationCode1
Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpusCode1
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-ExpertsCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation EvaluationCode1
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and GenerationCode1
On the Sentence Embeddings from Pre-trained Language ModelsCode1
Algorithmic progress in language modelsCode1
A Tensorized Transformer for Language ModelingCode1
Enhancing Reasoning to Adapt Large Language Models for Domain-Specific ApplicationsCode1
Enhancing Perception of Key Changes in Remote Sensing Image Change CaptioningCode1
Show:102550
← PrevPage 41 of 284Next →

No leaderboard results yet.