SOTAVerified

Large Language Model

Papers

Showing 276300 of 6097 papers

TitleStatusHype
Large Scale Transfer Learning for Tabular Data via Language ModelingCode2
LaVy: Vietnamese Multimodal Large Language ModelCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially FastCode2
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachCode2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsCode2
Large Language Model Enhanced Recommender Systems: A SurveyCode2
Large Language Model Guided Tree-of-ThoughtCode2
Control Industrial Automation System with Large Language Model AgentsCode2
AgentReview: Exploring Peer Review Dynamics with LLM AgentsCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
Language Models can Solve Computer TasksCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
Large Language Model Safety: A Holistic SurveyCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at ScaleCode2
KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAGCode2
Compiler Optimization via LLM Reasoning for Efficient Model ServingCode2
Show:102550
← PrevPage 12 of 244Next →

No leaderboard results yet.