SOTAVerified

Large Language Model

Papers

Showing 10511100 of 6097 papers

TitleStatusHype
EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROADCode1
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment AnalysisCode1
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQLCode1
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM FamilyCode1
FOLIO: Natural Language Reasoning with First-Order LogicCode1
EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote SensingCode1
DynaPipe: Optimizing Multi-task Training through Dynamic PipelinesCode1
Empowering Large Language Model for Continual Video Question Answering with Collaborative PromptingCode1
Aligning LLM Agents by Learning Latent Preference from User EditsCode1
Dynamic Updates for Language Adaptation in Visual-Language TrackingCode1
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-trainingCode1
Effective Human-AI Teams via Learned Natural Language Rules and OnboardingCode1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
Loop Copilot: Conducting AI Ensembles for Music Generation and Iterative EditingCode1
DrugAssist: A Large Language Model for Molecule OptimizationCode1
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge RecoveryCode1
Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel DecodingCode1
DRG-LLaMA : Tuning LLaMA Model to Predict Diagnosis-related Group for Hospitalized PatientsCode1
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace EditingCode1
LOLA -- An Open-Source Massively Multilingual Large Language ModelCode1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade DevicesCode1
DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan DesignCode1
A Unified Framework for Multi-Domain CTR Prediction via Large Language ModelsCode1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy PreservationCode1
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer TextCode1
LLM-SR: Scientific Equation Discovery via Programming with Large Language ModelsCode1
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary CaptioningCode1
DOMINO: A Dual-System for Multi-step Visual Language ReasoningCode1
LLM Self Defense: By Self Examination, LLMs Know They Are Being TrickedCode1
LLMZip: Lossless Text Compression using Large Language ModelsCode1
LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based RepresentationsCode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
Do Large Language Model Benchmarks Test Reliability?Code1
LLMs Can Simulate Standardized Patients via Agent CoevolutionCode1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing ConstraintsCode1
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language CorrectionsCode1
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerCode1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language ModelCode1
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty AgentsCode1
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
DualAD: Dual-Layer Planning for Reasoning in Autonomous DrivingCode1
LMEye: An Interactive Perception Network for Large Language ModelsCode1
Lshan-1.0 Technical ReportCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
Show:102550
← PrevPage 22 of 122Next →

No leaderboard results yet.