SOTAVerified

Large Language Model

Papers

Showing 12511300 of 6097 papers

TitleStatusHype
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerCode1
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing ConstraintsCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
MM-Instruct: Generated Visual Instructions for Large Multimodal Model AlignmentCode1
Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst DesignCode1
CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-TuningCode1
Context-aware Decoding Reduces Hallucination in Query-focused SummarizationCode1
Dissecting Human and LLM PreferencesCode1
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model AgentsCode1
PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly DetectionCode1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language ModelCode1
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code GenerationCode1
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language CorrectionsCode1
Agentic Feedback Loop Modeling Improves Recommendation and User SimulationCode1
MISR: Measuring Instrumental Self-Reasoning in Frontier ModelsCode1
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
A Study of Generative Large Language Model for Medical Research and HealthcareCode1
Foundation Models Meet Imbalanced Single-Cell Data When Learning Cell Type AnnotationsCode1
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global MemoryCode1
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation SystemsCode1
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language ModelCode1
MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and DetectionCode1
PIVOINE: Instruction Tuning for Open-world Information ExtractionCode1
AstroAgents: A Multi-Agent AI for Hypothesis Generation from Mass Spectrometry DataCode1
CityBench: Evaluating the Capabilities of Large Language Models for Urban TasksCode1
Detecting Hallucinations in Large Language Model Generation: A Token Probability ApproachCode1
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge DistillationCode1
Multi-Modal Classifiers for Open-Vocabulary Object DetectionCode1
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Citekit: A Modular Toolkit for Large Language Model Citation GenerationCode1
Explaining Relationships Between Scientific DocumentsCode1
CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical ResearcherCode1
Aligning LLM Agents by Learning Latent Preference from User EditsCode1
Democratizing Reasoning Ability: Tailored Learning from Large Language ModelCode1
Controllable Dialogue Simulation with In-Context LearningCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
PoisonBench: Assessing Large Language Model Vulnerability to Data PoisoningCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial OptimizationCode1
ASSISTGUI: Task-Oriented Desktop Graphical User Interface AutomationCode1
GenerateCT: Text-Conditional Generation of 3D Chest CT VolumesCode1
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree SearchCode1
PRD: Peer Rank and Discussion Improve Large Language Model based EvaluationsCode1
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model FrameworkCode1
Working Memory Capacity of ChatGPT: An Empirical StudyCode1
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image AnalysisCode1
GIST: Generating Image-Specific Text for Fine-grained Object ClassificationCode1
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability DetectionCode1
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human PreferencesCode1
Show:102550
← PrevPage 26 of 122Next →

No leaderboard results yet.