SOTAVerified

Language Modeling

Papers

Showing 651700 of 14182 papers

TitleStatusHype
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script GenerationCode2
iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvementCode2
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory PredictionCode2
Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsCode2
CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code GenerationCode2
Hyena Hierarchy: Towards Larger Convolutional Language ModelsCode2
C^2LEVA: Toward Comprehensive and Contamination-Free Language Model EvaluationCode2
Adapting Language Models to Compress ContextsCode2
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMsCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
HyperSeg: Towards Universal Visual Segmentation with Large Language ModelCode2
In-Context Language Learning: Architectures and AlgorithmsCode2
HiGPT: Heterogeneous Graph Language ModelCode2
HGRN2: Gated Linear RNNs with State ExpansionCode2
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
Ignore Previous Prompt: Attack Techniques For Language ModelsCode2
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First TimeCode2
Implicit Neural Representation for Cooperative Low-light Image EnhancementCode2
Can Large Language Model Agents Simulate Human Trust Behavior?Code2
Improve Vision Language Model Chain-of-thought ReasoningCode2
HMT: Hierarchical Memory Transformer for Long Context Language ProcessingCode2
Algorithm Evolution Using Large Language ModelCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for EnsemblingCode2
ABodyBuilder3: Improved and scalable antibody structure predictionsCode2
BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile ManipulationCode2
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLMCode2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelCode2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss GuidanceCode2
Grounding Language Models to Images for Multimodal Inputs and OutputsCode2
Causal Agent based on Large Language ModelCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
Adapting a Language Model While Preserving its General KnowledgeCode2
Kani: A Lightweight and Highly Hackable Framework for Building Language Model ApplicationsCode2
VLKEB: A Large Vision-Language Model Knowledge Editing BenchmarkCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
GraphWiz: An Instruction-Following Language Model for Graph ProblemsCode2
A Length-Extrapolatable TransformerCode2
GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended TasksCode2
Grounded 3D-LLM with Referent TokensCode2
How to Index Item IDs for Recommendation Foundation ModelsCode2
In-Context Retrieval-Augmented Language ModelsCode2
LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language ModelsCode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
Black-Box Tuning for Language-Model-as-a-ServiceCode2
Characterization of Large Language Model Development in the DatacenterCode2
Block-Recurrent TransformersCode2
GPT-Driver: Learning to Drive with GPTCode2
Show:102550
← PrevPage 14 of 284Next →

No leaderboard results yet.