SOTAVerified

Large Language Model

Papers

Showing 401425 of 6097 papers

TitleStatusHype
CMMLU: Measuring massive multitask language understanding in ChineseCode2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
An Empirical Evaluation of Using Large Language Models for Automated Unit Test GenerationCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
AutoMind: Adaptive Knowledgeable Agent for Automated Data ScienceCode2
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel DecodingCode2
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuningCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Large Language Model Safety: A Holistic SurveyCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
Language Models can Solve Computer TasksCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing DomainCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph CompletionCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Large language models can be zero-shot anomaly detectors for time series?Code2
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
Jailbreak Vision Language Models via Bi-Modal Adversarial PromptCode2
Show:102550
← PrevPage 17 of 244Next →

No leaderboard results yet.