SOTAVerified

Large Language Model

Papers

Showing 451500 of 6097 papers

TitleStatusHype
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest QuestionsCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
CMMLU: Measuring massive multitask language understanding in ChineseCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
LaVy: Vietnamese Multimodal Large Language ModelCode2
Libra: Building Decoupled Vision System on Large Language ModelsCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
Alignment faking in large language modelsCode2
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive DistillationCode2
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language ModelCode2
Large language models can be zero-shot anomaly detectors for time series?Code2
Aligning to Thousands of Preferences via System Message GeneralizationCode2
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachCode2
User Behavior Simulation with Large Language Model based AgentsCode2
Large Scale Transfer Learning for Tabular Data via Language ModelingCode2
RegMix: Data Mixture as Regression for Language Model Pre-trainingCode2
Large Language Model Guided Tree-of-ThoughtCode2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
Large Language Model Safety: A Holistic SurveyCode2
L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial AttacksCode2
ARAGOG: Advanced RAG Output GradingCode2
FLAME: Financial Large-Language Model Assessment and Metrics EvaluationCode2
LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular AutomataCode2
ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical FeedbackCode2
ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous VehiclesCode2
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUsCode2
ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual DataCode2
Algorithm Evolution Using Large Language ModelCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
500xCompressor: Generalized Prompt Compression for Large Language ModelsCode2
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially FastCode2
From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning TasksCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context ExamplesCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
Language Models can Solve Computer TasksCode2
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D ScenesCode2
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code GenerationCode2
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference OptimizationCode2
KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAGCode2
Generate rather than Retrieve: Large Language Models are Strong Context GeneratorsCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph CompletionCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
SegEarth-R1: Geospatial Pixel Reasoning via Large Language ModelCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
Show:102550
← PrevPage 10 of 122Next →

No leaderboard results yet.