SOTAVerified

Language Modeling

Papers

Showing 18011850 of 14182 papers

TitleStatusHype
Estimating the Probability of Sampling a Trained Neural Network at Random0
Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected0
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning0
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions0
Scaling Laws for Differentially Private Language Models0
Improving LLM Unlearning Robustness via Random PerturbationsCode0
Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities0
An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese0
Towards the Worst-case Robustness of Large Language Models0
s1: Simple test-time scalingCode9
Partially Rewriting a Transformer in Natural LanguageCode3
Offline Learning for Combinatorial Multi-armed Bandits0
Structural Embedding Projection for Contextual Large Language Model Inference0
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language ModelsCode4
Low-Rank Adapting Models for Sparse AutoencodersCode1
Scalable-Softmax Is Superior for AttentionCode1
Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow0
Fine-tuning LLaMA 2 interference: a comparative study of language implementations for optimal efficiency0
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering0
Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability0
Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking0
Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation0
Exploring Audio Editing Features as User-Centric Privacy Defenses Against Large Language Model(LLM) Based Emotion Inference Attacks0
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-TrainingCode1
Loss Functions and Operators Generated by f-Divergences0
Differentially Private Steering for Large Language Model AlignmentCode0
Vision-Language Model Selection and Reuse for Downstream Adaptation0
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH0
CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction0
Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents0
Can Generative LLMs Create Query Variants for Test Collections? An Exploratory StudyCode0
Perforated Backpropagation: A Neuroscience Inspired Extension to Artificial Neural NetworksCode0
Prompt-oriented Output of Culture-Specific Items in Translated African Poetry by Large Language Model: An Initial Multi-layered Tabular Review0
Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction0
2SSP: A Two-Stage Framework for Structured Pruning of LLMsCode1
Leveraging Multimodal LLM for Inspirational User Interface SearchCode0
Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models0
DINT Transformer0
Learning Free Token Reduction for Multi-Modal Large Language Models0
From tools to thieves: Measuring and understanding public perceptions of AI through crowdsourced metaphors0
Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
DReSS: Data-driven Regularized Structured Streamlining for Large Language Models0
Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI AssistantCode0
Implementation of a Generative AI Assistant in K-12 Education: The CyberScholar Initiative0
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language ModelCode2
RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token ReprogrammingsCode1
"Ownership, Not Just Happy Talk": Co-Designing a Participatory Large Language Model for Journalism0
An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue0
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling0
Show:102550
← PrevPage 37 of 284Next →

No leaderboard results yet.