SOTAVerified

Large Language Model

Papers

Showing 301350 of 6097 papers

TitleStatusHype
Customization Assistant for Text-to-image GenerationCode2
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
Cross-Tokenizer Distillation via Approximate Likelihood MatchingCode2
LLM3:Large Language Model-based Task and Motion Planning with Motion Failure ReasoningCode2
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model GenerationCode2
Accelerating Large Language Model Decoding with Speculative SamplingCode2
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal ModelsCode2
CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at ScaleCode2
Critique-out-Loud Reward ModelsCode2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal UnderstandingCode2
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path PlanningCode2
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language ModelsCode2
Listen, Think, and UnderstandCode2
LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating MetaheuristicsCode2
Lion: Adversarial Distillation of Proprietary Large Language ModelsCode2
OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFDCode2
LION: Empowering Multimodal Large Language Model with Dual-Level Visual KnowledgeCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban EnvironmentsCode2
Libra: Building Decoupled Vision System on Large Language ModelsCode2
LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular AutomataCode2
Control Industrial Automation System with Large Language Model AgentsCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
LifelongAgentBench: Evaluating LLM Agents as Lifelong LearnersCode2
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
LaVy: Vietnamese Multimodal Large Language ModelCode2
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest QuestionsCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachCode2
Large Language Model Safety: A Holistic SurveyCode2
Compiler Optimization via LLM Reasoning for Efficient Model ServingCode2
Large language models can be zero-shot anomaly detectors for time series?Code2
Large Scale Transfer Learning for Tabular Data via Language ModelingCode2
Empirical Asset Pricing with Large Language Model AgentsCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
An Egocentric Vision-Language Model based Portable Real-time Smart AssistantCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial AttacksCode2
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks YetCode2
Collaborative Expert LLMs Guided Multi-Objective Molecular OptimizationCode2
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language ModelCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
Language Models can Solve Computer TasksCode2
Show:102550
← PrevPage 7 of 122Next →

No leaderboard results yet.