SOTAVerified

Language Modeling

Papers

Showing 251300 of 14182 papers

TitleStatusHype
MLorc: Momentum Low-rank Compression for Large Language Model Adaptation0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset0
CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG0
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction0
A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems0
Goal-Aware Identification and Rectification of Misinformation in Multi-Agent SystemsCode0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and GenerationCode2
Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model TranslationsCode0
Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings0
MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform0
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation0
Period-LLM: Extending the Periodic Capability of Multimodal Large Language ModelCode1
Drop Dropout on Single-Epoch Language Model PretrainingCode0
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and FinetuningCode0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization0
Transformers Are Universally Consistent0
Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series ForecastingCode1
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization0
Circuit Stability Characterizes Language Model GeneralizationCode0
Interpreting Large Text-to-Image Diffusion Models with Dictionary LearningCode0
From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning0
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RLCode2
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking0
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction0
TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data SynthesisCode0
HardTests: Synthesizing High-Quality Test Cases for LLM Coding0
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering0
Probing the Robustness Properties of Neural Speech CodecsCode0
GradPower: Powering Gradients for Faster Language Model Pre-Training0
How much do language models memorize?0
Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling0
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model EvaluationCode0
Reducing Latency in LLM-Based Natural Language Commands Processing for Robot Navigation0
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model0
Hidden Persuasion: Detecting Manipulative Narratives on Social Media During the 2022 Russian Invasion of Ukraine0
Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease0
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training0
FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model CompressionCode0
Actor-Critic based Online Data Mixing For Language Model Pre-Training0
Large Language Model Meets Constraint Propagation0
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data0
Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models0
Show:102550
← PrevPage 6 of 284Next →

No leaderboard results yet.