SOTAVerified

Language Modeling

Papers

Showing 72517300 of 14182 papers

TitleStatusHype
OptiMUS: Optimization Modeling Using MIP Solvers and large language modelsCode2
Transformers and Large Language Models for Chemistry and Drug Discovery0
Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training0
Rethinking Memory and Communication Cost for Efficient Large Language Model Training0
Factual and Personalized Recommendations using Language Models and Reinforcement Learning0
Estimating Numbers without Regression0
Language Model Beats Diffusion -- Tokenizer is Key to Visual GenerationCode4
Transcending the Attention Paradigm: Representation Learning from Geospatial Social Media DataCode0
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model AttributionCode0
A Meta-Learning Perspective on Transformers for Causal Language Modeling0
Guiding Language Model Reasoning with Planning Tokens0
GraphLLM: Boosting Graph Reasoning Ability of Large Language ModelCode1
Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting0
Transformer Fusion with Optimal TransportCode1
CCAE: A Corpus of Chinese-based Asian Englishes0
Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection0
InstructDET: Diversifying Referring Object Detection with Generalized InstructionsCode1
Synslator: An Interactive Machine Translation Tool with Online Learning0
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language ModelCode1
MindfulDiary: Harnessing Large Language Model to Support Psychiatric Patients' Journaling0
Generative Spoken Language Model based on continuous word-sized audio tokens0
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback0
Optimizing Large Language Models to Expedite the Development of Smart Contracts0
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) ChallengeCode1
ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data0
Prompt-to-OS (P2OS): Revolutionizing Operating Systems and Human-Computer Interaction with Integrated AI Generative Models0
EMO: Earth Mover Distance Optimization for Auto-Regressive Language ModelingCode1
Tree-GPT: Modular Large Language Model Expert System for Forest Remote Sensing Image Understanding and Interactive Analysis0
Question-focused Summarization by Decomposing Articles into Facts and Opinions and Retrieving Entities0
ILuvUI: Instruction-tuned LangUage-Vision modeling of UIs from Machine Conversations0
BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity0
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective AugmentationCode1
Quantized Transformer Language Model Implementations on Edge Devices0
Functional Interpolation for Relative Positions Improves Long Context Transformers0
From task structures to world models: What do LLMs know?0
An In-Context Learning Agent for Formal Theorem-ProvingCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
PepMLM: Target Sequence-Conditioned Generation of Therapeutic Peptide Binders via Span Masked Language ModelingCode1
LLM Based Multi-Document Summarization Exploiting Main-Event Biased Monotone Submodular Content Extraction0
Neural Language Model Pruning for Automatic Speech Recognition0
Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards0
GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionCode2
TRAM: Bridging Trust Regions and Sharpness Aware MinimizationCode0
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference OptimizationCode1
Deep Representations of First-person Pronouns for Prediction of Depression Symptom Severity0
A 5' UTR Language Model for Decoding Untranslated Regions of mRNA and Function Predictions0
DOMINO: A Dual-System for Multi-step Visual Language ReasoningCode1
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient ReasoningCode1
Show:102550
← PrevPage 146 of 284Next →

No leaderboard results yet.