SOTAVerified

Language Modeling

Papers

Showing 17511800 of 14182 papers

TitleStatusHype
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-trainingCode1
Evolving Deep Neural NetworksCode1
Excuse me, sir? Your language model is leaking (information)Code1
An Engorgio Prompt Makes Large Language Model Babble onCode1
Event Causality Identification via Derivative Prompt Joint LearningCode1
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM FamilyCode1
Evaluation Benchmarks for Spanish Sentence RepresentationsCode1
Evolutionary Large Language Model for Automated Feature TransformationCode1
Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question AnsweringCode1
Evaluating Language Models as Synthetic Data GeneratorsCode1
Evaluating Morphological Alignment of Tokenizers in 70 LanguagesCode1
Large Language Model Inference Acceleration: A Comprehensive Hardware PerspectiveCode1
Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!Code1
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) ChallengeCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
Evaluating Language Model Finetuning Techniques for Low-resource LanguagesCode1
Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational CurriculaCode1
Can Large Language Model Comprehend Ancient Chinese? A Preliminary Test on ACLUECode1
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target AtomsCode1
Can Large Language Model Agents Balance Energy Systems?Code1
VLLaVO: Mitigating Visual Gap through LLMsCode1
Evaluating Retrieval Quality in Retrieval-Augmented GenerationCode1
Large Language Models Can Be Easily Distracted by Irrelevant ContextCode1
Evaluating Human-Language Model InteractionCode1
BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout DetectionCode1
Protein Structure Tokenization: Benchmarking and New RecipeCode1
Bot or Human? Detecting ChatGPT Imposters with A Single QuestionCode1
Large Language Model Unlearning via Embedding-Corrupted PromptsCode1
Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time CorrectionCode1
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token PredictionCode1
Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social MediaCode1
CORAL: Expert-Curated medical Oncology Reports to Advance Language Model InferenceCode1
ESRL: Efficient Sampling-based Reinforcement Learning for Sequence GenerationCode1
Latxa: An Open Language Model and Evaluation Suite for BasqueCode1
Adversarial Training for Aspect-Based Sentiment Analysis with BERTCode1
Establishing baselines for generative discovery of inorganic crystalsCode1
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market DomainCode1
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-ThoughtCode1
Espresso: A Fast End-to-end Neural Speech Recognition ToolkitCode1
BiasEdit: Debiasing Stereotyped Language Models via Model EditingCode1
Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model EvaluationCode1
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and GenerationCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text GenerationCode1
Learning Approximate Inference Networks for Structured PredictionCode1
Chess as a Testbed for Language Model State TrackingCode1
Epidemic Modeling with Generative AgentsCode1
Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-TrainingCode1
Learning from Unlabeled 3D Environments for Vision-and-Language NavigationCode1
EscapeBench: Pushing Language Models to Think Outside the BoxCode1
Show:102550
← PrevPage 36 of 284Next →

No leaderboard results yet.