SOTAVerified

Large Language Model

Papers

Showing 176200 of 6097 papers

TitleStatusHype
Partially Rewriting a Transformer in Natural LanguageCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
Valley2: Exploring Multimodal Models with Scalable Vision-Language DesignCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
DARWIN 1.5: Large Language Models as Materials Science Adapted LearnersCode3
ATPrompt: Textual Prompt Learning with Embedded AttributesCode3
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based AgentsCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
BayLing 2: A Multilingual Large Language Model with Efficient Language AlignmentCode3
SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language ModelCode3
SuffixDecoding: Extreme Speculative Decoding for Emerging AI ApplicationsCode3
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
Baichuan-Omni Technical ReportCode3
Towards Next-Generation LLM-based Recommender Systems: A Survey and BeyondCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Programming Every Example: Lifting Pre-training Data Quality like Experts at ScaleCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
Odyssey: Empowering Minecraft Agents with Open-World SkillsCode3
Show:102550
← PrevPage 8 of 244Next →

No leaderboard results yet.