SOTAVerified

Large Language Model

Papers

Showing 301350 of 6097 papers

TitleStatusHype
ZClip: Adaptive Spike Mitigation for LLM Pre-TrainingCode2
CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language ModelsCode2
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMsCode2
TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud DetectionCode2
Cross-Tokenizer Distillation via Approximate Likelihood MatchingCode2
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
Modifying Large Language Model Post-Training for Diverse Creative WritingCode2
Generative Modeling for Mathematical DiscoveryCode2
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language ModelCode2
A Neural Symbolic Model for Space PhysicsCode2
Referring to Any PersonCode2
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation ModelCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
An Egocentric Vision-Language Model based Portable Real-time Smart AssistantCode2
Collaborative Expert LLMs Guided Multi-Objective Molecular OptimizationCode2
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal ModelsCode2
OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFDCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
Introducing Visual Perception Token into Multimodal Large Language ModelCode2
A Training-free LLM-based Approach to General Chinese Character Error CorrectionCode2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton OperatorsCode2
DataSciBench: An LLM Agent Benchmark for Data ScienceCode2
UXAgent: An LLM Agent-Based Usability Testing Framework for Web DesignCode2
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First TimeCode2
KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAGCode2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image ClassificationCode2
WaferLLM: Large Language Model Inference at Wafer ScaleCode2
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference OptimizationCode2
Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance EstimationCode2
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUsCode2
Reviving The Classics: Active Reward Modeling in Large Language Model AlignmentCode2
MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-ProcessingCode2
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language ModelCode2
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge GraphCode2
OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution FittingCode2
Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic DesignCode2
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal UnderstandingCode2
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal UnderstandingCode2
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code GenerationCode2
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
FLAME: Financial Large-Language Model Assessment and Metrics EvaluationCode2
Natural Language Fine-TuningCode2
Large Language Model Safety: A Holistic SurveyCode2
Large Language Model Enhanced Recommender Systems: A SurveyCode2
Alignment faking in large language modelsCode2
LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input ContextsCode2
Show:102550
← PrevPage 7 of 122Next →

No leaderboard results yet.