SOTAVerified

Language Modeling

Papers

Showing 31013150 of 14182 papers

TitleStatusHype
Adaptive Reasoning and Acting in Medical Language Agents0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code0
EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation0
LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering0
COrAL: Order-Agnostic Language Modeling for Efficient Iterative RefinementCode0
Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation0
LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense ReasoningCode0
Enterprise Benchmarks for Large Language Model EvaluationCode0
LLMD: A Large Language Model for Interpreting Longitudinal Medical Records0
nach0-pc: Multi-task Language Model with Molecular Point Cloud Encoder0
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos0
ACER: Automatic Language Model Context Extension via Retrieval0
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling0
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
Can a large language model be a gaslighter?Code0
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation0
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model AgentsCode1
Distributionally robust self-supervised learning for tabular dataCode0
Efficiently Scanning and Resampling Spatio-Temporal Tasks with Irregular Observations0
Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning0
Generation with Dynamic VocabularyCode0
MedMobile: A mobile-sized language model with expert-level clinical capabilitiesCode0
Retraining-Free Merging of Sparse MoE via Hierarchical ClusteringCode1
Parameter-Efficient Fine-Tuning of State Space ModelsCode1
SocialGaze: Improving the Integration of Human Social Norms in Large Language ModelsCode0
Baichuan-Omni Technical ReportCode3
Lifelong Event Detection via Optimal Transport0
Zeroth-Order Fine-Tuning of LLMs in Random SubspacesCode1
Preferential Normalizing Flows0
Towards Trustworthy Knowledge Graph Reasoning: An Uncertainty Aware Perspective0
Do Unlearning Methods Remove Information from Language Model Weights?Code1
ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation0
uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks0
Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference0
VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model0
Emergent social conventions and collective bias in LLM populations0
PoisonBench: Assessing Large Language Model Vulnerability to Data PoisoningCode1
SimpleStrat: Diversifying Language Model Generation with Stratification0
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both0
A Framework for Collaborating a Large Language Model Tool in Brainstorming for Triggering Creative Thoughts0
The Large Language Model GreekLegalRoBERTa0
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation PredictionCode0
LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT0
Semantic Self-Consistency: Enhancing Language Model Reasoning via Semantic Weighting0
Multi-Agent Collaborative Data Selection for Efficient LLM PretrainingCode1
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
Language model developers should report train-test overlap0
Uncovering Overfitting in Large Language Model Editing0
Mechanistic Permutability: Match Features Across Layers0
AgroGPT: Efficient Agricultural Vision-Language Model with Expert TuningCode1
Show:102550
← PrevPage 63 of 284Next →

No leaderboard results yet.