SOTAVerified

Language Modeling

Papers

Showing 551600 of 14182 papers

TitleStatusHype
Towards DS-NER: Unveiling and Addressing Latent Noise in Distant AnnotationsCode0
DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design0
SLOT: Sample-specific Language Model Optimization at Test-timeCode2
Self-Destructive Language Model0
Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering0
Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems0
NeuroGen: Neural Network Parameter Generation via Large Language Models0
SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment0
From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling0
mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model0
Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-trainingCode0
LifelongAgentBench: Evaluating LLM Agents as Lifelong LearnersCode2
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades0
Chain-of-Model Learning for Language ModelCode0
Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution BehaviorsCode0
An Explanation of Intrinsic Self-Correction via Linear Representations and Latent Concepts0
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission0
Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model0
Recursive Question Understanding for Complex Question Answering over Heterogeneous Personal Data0
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging0
CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction0
SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation0
Demystifying and Enhancing the Efficiency of Large Language Model Based Search AgentsCode2
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem FeaturesCode0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
Noise Injection Systemically Degrades Large Language Model Safety Guardrails0
Token-Level Uncertainty Estimation for Large Language Model Reasoning0
THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering0
An agentic system with reinforcement-learned subsystem improvements for parsing form-like documentsCode0
Feasibility with Language Models for Open-World Compositional Zero-Shot Learning0
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model0
Large Language Model Use Impact Locus of Control0
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
Low-Resource Language Processing: An OCR-Driven Summarization and Translation PipelineCode0
Maximizing Asynchronicity in Event-based Neural Networks0
Efficient Attention via Pre-Scoring: Prioritizing Informative Keys in TransformersCode0
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLMCode0
Enhancing Low-Resource Minority Language Translation with LLMs and Retrieval-Augmented Generation for Cultural Nuances0
On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating0
Unifying Segment Anything in Microscopy with Multimodal Large Language ModelCode1
ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model0
ChestyBot: Detecting and Disrupting Chinese Communist Party Influence Stratagems0
Tracr-Injection: Distilling Algorithms into Pre-trained Language ModelsCode0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech0
Automating Security Audit Using Large Language Model based Agent: An Exploration Experiment0
CRPE: Expanding The Reasoning Capability of Large Language Model for Code Generation0
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector AttentionCode0
ImagineBench: Evaluating Reinforcement Learning with Large Language Model RolloutsCode1
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors0
Show:102550
← PrevPage 12 of 284Next →

No leaderboard results yet.