SOTAVerified

Large Language Model

Papers

Showing 851875 of 6097 papers

TitleStatusHype
BASE-SQL: A powerful open source Text-To-SQL baseline approachCode1
DRG-LLaMA : Tuning LLaMA Model to Predict Diagnosis-related Group for Hospitalized PatientsCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace EditingCode1
Learning to Generate Unit Tests for Automated DebuggingCode1
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small ModelsCode1
LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge SharingCode1
FIRE: Fact-checking with Iterative Retrieval and VerificationCode1
LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling MethodCode1
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerCode1
Learning a Structural Causal Model for Intuition Reasoning in ConversationCode1
LAVCap: LLM-based Audio-Visual Captioning using Optimal TransportCode1
FOLIO: Natural Language Reasoning with First-Order LogicCode1
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image ClassificationCode1
DOMINO: A Dual-System for Multi-step Visual Language ReasoningCode1
Learning the rules of peptide self-assembly through data mining with large language modelsCode1
Do Large Language Model Benchmarks Test Reliability?Code1
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language ModelsCode1
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial InjectionCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement LearningCode1
Large Language Model UnlearningCode1
Aya Dataset: An Open-Access Collection for Multilingual Instruction TuningCode1
Large Language Model Unlearning via Embedding-Corrupted PromptsCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
Show:102550
← PrevPage 35 of 244Next →

No leaderboard results yet.