SOTAVerified

Large Language Model

Papers

Showing 10511100 of 6097 papers

TitleStatusHype
Subjective-Aligned Dataset and Metric for Text-to-Video Quality AssessmentCode1
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMsCode1
Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk PredictionCode1
Can We Talk Models Into Seeing the World Differently?Code1
Emergence of Social Norms in Generative Agent Societies: Principles and ArchitectureCode1
ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language ModelCode1
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned DecisionCode1
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis AgentsCode1
Generative News RecommendationCode1
Multi-modal Instruction Tuned LLMs with Fine-grained Visual PerceptionCode1
KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing DetectionCode1
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free AttentionCode1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingCode1
A Cross-Modal Approach to Silent Speech with LLM-Enhanced RecognitionCode1
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic InteractionCode1
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient TuningCode1
Large Language Models are Learnable Planners for Long-Term RecommendationCode1
Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic DimensionCode1
Grounding Language Models for Visual Entity RecognitionCode1
CogBench: a large language model walks into a psychology labCode1
MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual PropertyCode1
Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual SpaceCode1
TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human AnnotationCode1
Empowering Large Language Model Agents through Action LearningCode1
Self-Retrieval: End-to-End Information Retrieval with One Large Language ModelCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
LLMBind: A Unified Modality-Task Integration FrameworkCode1
RelayAttention for Efficient Large Language Model Serving with Long System PromptsCode1
SIMPLOT: Enhancing Chart Question Answering by Distilling EssentialsCode1
CriticEval: Evaluating Large Language Model as CriticCode1
Large Language Model-based Human-Agent Collaboration for Complex Task SolvingCode1
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object DiffusionCode1
TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box IdentificationCode1
Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct DecodingCode1
Stealthy Attack on Large Language Model based RecommendationCode1
PreAct: Prediction Enhances Agent's Planning AbilityCode1
Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model EvaluationCode1
Dissecting Human and LLM PreferencesCode1
LaCo: Large Language Model Pruning via Layer CollapseCode1
Controlled Text Generation for Large Language Model with Dynamic Attribute GraphsCode1
Instruction Backdoor Attacks Against Customized LLMsCode1
PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based SamplingCode1
BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout DetectionCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
ChemLLM: A Chemical Large Language ModelCode1
Aya Dataset: An Open-Access Collection for Multilingual Instruction TuningCode1
Understanding the Weakness of Large Language Model Agents within a Complex Android EnvironmentCode1
The Quantified Boolean Bayesian Network: Theory and Experiments with a Logical Graphical ModelCode1
ApiQ: Finetuning of 2-Bit Quantized Large Language ModelCode1
Large Language Model Distilling Medication Recommendation ModelCode1
Show:102550
← PrevPage 22 of 122Next →

No leaderboard results yet.