SOTAVerified

Large Language Model

Papers

Showing 401450 of 6097 papers

TitleStatusHype
L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial AttacksCode2
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest QuestionsCode2
LION: Empowering Multimodal Large Language Model with Dual-Level Visual KnowledgeCode2
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You WantCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
Large Language Model Safety: A Holistic SurveyCode2
biorecap: an R package for summarizing bioRxiv preprints with a local LLMCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
Large language models can be zero-shot anomaly detectors for time series?Code2
An Empirical Evaluation of Using Large Language Models for Automated Unit Test GenerationCode2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsCode2
AutoMind: Adaptive Knowledgeable Agent for Automated Data ScienceCode2
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil EngineeringCode2
Large Language Model Guided Tree-of-ThoughtCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachCode2
Beyond Text: Frozen Large Language Models in Visual Signal ComprehensionCode2
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning EngineeringCode2
MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models AgentsCode2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
BianCang: A Traditional Chinese Medicine Large Language ModelCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer ReviewsCode2
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading SystemsCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal ModelsCode2
Language Models can Solve Computer TasksCode2
Large Language Model Enhanced Recommender Systems: A SurveyCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
Muse: Text-To-Image Generation via Masked Generative TransformersCode2
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and CaptioningCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language ModelCode2
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuningCode2
Discovering Preference Optimization Algorithms with and for Large Language ModelsCode2
KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAGCode2
CoLLaVO: Crayon Large Language and Vision mOdelCode2
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph CompletionCode2
ExpertPrompting: Instructing Large Language Models to be Distinguished ExpertsCode2
Direct Preference Optimization of Video Large Multimodal Models from Language Model RewardCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
Alignment faking in large language modelsCode2
Jailbreaking Attack against Multimodal Large Language ModelCode2
Introducing Visual Perception Token into Multimodal Large Language ModelCode2
Show:102550
← PrevPage 9 of 122Next →

No leaderboard results yet.