SOTAVerified

Language Modeling

Papers

Showing 701750 of 14182 papers

TitleStatusHype
Adapting a Language Model While Preserving its General KnowledgeCode2
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLMCode2
A Length-Extrapolatable TransformerCode2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
How to Index Item IDs for Recommendation Foundation ModelsCode2
ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical FeedbackCode2
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual ScenariosCode2
CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code GenerationCode2
HMT: Hierarchical Memory Transformer for Long Context Language ProcessingCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsCode2
Leopard: A Vision Language Model For Text-Rich Multi-Image TasksCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language ModelCode2
LifelongAgentBench: Evaluating LLM Agents as Lifelong LearnersCode2
LaVy: Vietnamese Multimodal Large Language ModelCode2
mDPO: Conditional Preference Optimization for Multimodal Large Language ModelsCode2
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
BLSP-Emo: Towards Empathetic Large Speech-Language ModelsCode2
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation ModelsCode2
Block Transformer: Global-to-Local Language Modeling for Fast InferenceCode2
Block-Recurrent TransformersCode2
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMsCode2
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and ToxicityCode2
Blockwise Parallel Transformer for Large Context ModelsCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
Black-Box Tuning for Language-Model-as-a-ServiceCode2
LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating MetaheuristicsCode2
BitNet: Scaling 1-bit Transformers for Large Language ModelsCode2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
Language Model Powered Digital Biology with BRADCode2
Grounding Language Models to Images for Multimodal Inputs and OutputsCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss GuidanceCode2
GraphWiz: An Instruction-Following Language Model for Graph ProblemsCode2
GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended TasksCode2
LLMGA: Multimodal Large Language Model based Generation AssistantCode2
BigBIO: A Framework for Data-Centric Biomedical Natural Language ProcessingCode2
LLM-Seg: Bridging Image Segmentation and Large Language Model ReasoningCode2
CogView2: Faster and Better Text-to-Image Generation via Hierarchical TransformersCode2
Graph-Aware Isomorphic Attention for Adaptive Dynamics in TransformersCode2
OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFDCode2
Collaborative Expert LLMs Guided Multi-Objective Molecular OptimizationCode2
Graph Language ModelsCode2
Grounded 3D-LLM with Referent TokensCode2
Accelerating Large Language Model Decoding with Speculative SamplingCode2
Longhorn: State Space Models are Amortized Online LearnersCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
HGRN2: Gated Linear RNNs with State ExpansionCode2
Show:102550
← PrevPage 15 of 284Next →

No leaderboard results yet.