SOTAVerified

Large Language Model

Papers

Showing 33763400 of 6097 papers

TitleStatusHype
Unveiling Entity-Level Unlearning for Large Language Models: A Comprehensive Analysis0
Automated radiotherapy treatment planning guided by GPT-4Vision0
Open-Vocabulary Temporal Action Localization using Multimodal Guidance0
Inferring Pluggable Types with Machine Learning0
LLM2FEA: Discover Novel Designs with Generative Evolutionary Multitasking0
Autonomous Agents for Collaborative Task under Information AsymmetryCode14
MoA: Mixture of Sparse Attention for Automatic Large Language Model CompressionCode2
GenoTEX: An LLM Agent Benchmark for Automated Gene Expression Data AnalysisCode2
A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative GenerationCode0
Safely Learning with Private Data: A Federated Learning Framework for Large Language ModelCode1
InternLM-Law: An Open Source Chinese Legal Large Language ModelCode1
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path PlanningCode2
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal BehaviorsCode1
A Learn-Then-Reason Model Towards Generalization in Knowledge Base Question Answering0
MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate0
A Large Language Model Outperforms Other Computational Approaches to the High-Throughput Phenotyping of Physician Notes0
Factual Dialogue Summarization via Learning from Large Language Models0
Advantage Alignment Algorithms0
SPL: A Socratic Playground for Learning Powered by Large Language Model0
Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas0
LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone SensorsCode1
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model EvaluationCode0
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMsCode1
Show:102550
← PrevPage 136 of 244Next →

No leaderboard results yet.