SOTAVerified

Language Modeling

Papers

Showing 36513700 of 14182 papers

TitleStatusHype
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling0
VocalAgent: Large Language Models for Vocal Health Diagnostics with Safety-Aware Evaluation0
CIE: Controlling Language Model Text Generations Using Continuous SignalsCode0
ReSW-VL: Representation Learning for Surgical Workflow Analysis Using Vision-Language Model0
Krikri: Advancing Open Large Language Models for Greek0
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice0
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment0
R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model0
CALM: Co-evolution of Algorithms and Language Model for Automatic Heuristic Design0
Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-trainingCode0
Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems0
Self-Destructive Language Model0
Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering0
SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment0
LLM-Based User Simulation for Low-Knowledge Shilling Attacks on Recommender Systems0
From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling0
DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design0
Towards DS-NER: Unveiling and Addressing Latent Noise in Distant AnnotationsCode0
NeuroGen: Neural Network Parameter Generation via Large Language Models0
mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model0
An Explanation of Intrinsic Self-Correction via Linear Representations and Latent Concepts0
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging0
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution BehaviorsCode0
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem FeaturesCode0
CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction0
Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model0
Recursive Question Understanding for Complex Question Answering over Heterogeneous Personal Data0
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation0
Chain-of-Model Learning for Language ModelCode0
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades0
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model0
Large Language Model Use Impact Locus of Control0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
Low-Resource Language Processing: An OCR-Driven Summarization and Translation PipelineCode0
Maximizing Asynchronicity in Event-based Neural Networks0
Efficient Attention via Pre-Scoring: Prioritizing Informative Keys in TransformersCode0
Noise Injection Systemically Degrades Large Language Model Safety Guardrails0
THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering0
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLMCode0
On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating0
Enhancing Low-Resource Minority Language Translation with LLMs and Retrieval-Augmented Generation for Cultural Nuances0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
Token-Level Uncertainty Estimation for Large Language Model Reasoning0
Feasibility with Language Models for Open-World Compositional Zero-Shot Learning0
Neural Thermodynamic Laws for Large Language Model Training0
VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits0
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors0
CRPE: Expanding The Reasoning Capability of Large Language Model for Code Generation0
Show:102550
← PrevPage 74 of 284Next →

No leaderboard results yet.