SOTAVerified

Large Language Model

Papers

Showing 801850 of 6097 papers

TitleStatusHype
Robust Planning with Compound LLM Architectures: An LLM-Modulo ApproachCode1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot LearningCode1
TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly DetectionCode1
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language ModelCode1
Language Models as Causal Effect GeneratorsCode1
LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language ModelsCode1
Learning the rules of peptide self-assembly through data mining with large language modelsCode1
AutoProteinEngine: A Large Language Model Driven Agent Framework for Multimodal AutoML in Protein EngineeringCode1
Enabling LLM Knowledge Analysis via Extensive MaterializationCode1
Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease KnowledgeCode1
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language ModelsCode1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility PredictionCode1
LLaMo: Large Language Model-based Molecular Graph AssistantCode1
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI DetectionCode1
Online Intrinsic Rewards for Decision Making Agents from Large Language Model FeedbackCode1
Real-Time Personalization for LLM-based Recommendation with Customized In-Context LearningCode1
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt TypesCode1
LLMCBench: Benchmarking Large Language Model Compression for Efficient DeploymentCode1
TrajAgent: An Agent Framework for Unified Trajectory ModellingCode1
Agentic Feedback Loop Modeling Improves Recommendation and User SimulationCode1
GCoder: Improving Large Language Model for Generalized Graph Problem SolvingCode1
GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent CollaborationCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
Scalable Influence and Fact Tracing for Large Language Model PretrainingCode1
Automated Spinal MRI Labelling from Reports Using a Large Language ModelCode1
A Realistic Threat Model for Large Language Model JailbreaksCode1
Residual vector quantization for KV cache compression in large language modelCode1
Paths-over-Graph: Knowledge Graph Empowered Large Language Model ReasoningCode1
FIRE: Fact-checking with Iterative Retrieval and VerificationCode1
MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task AutomationCode1
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation SystemsCode1
Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited ResponsesCode1
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable RecommendationCode1
HARDMath: A Benchmark Dataset for Challenging Problems in Applied MathematicsCode1
PoisonBench: Assessing Large Language Model Vulnerability to Data PoisoningCode1
Retraining-Free Merging of Sparse MoE via Hierarchical ClusteringCode1
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model AgentsCode1
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model PromptingCode1
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM UnlearningCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
ImProver: Agent-Based Automated Proof OptimizationCode1
Large Language Model Inference Acceleration: A Comprehensive Hardware PerspectiveCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
You Know What I'm Saying: Jailbreak Attack via Implicit ReferenceCode1
ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent CollaborationCode1
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade DevicesCode1
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model CompressionCode1
Show:102550
← PrevPage 17 of 122Next →

No leaderboard results yet.