SOTAVerified

Large Language Model

Papers

Showing 30013050 of 6097 papers

TitleStatusHype
AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis0
Awes, Laws, and Flaws From Today's LLM Research0
AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs0
Aya 23: Open Weight Releases to Further Multilingual Progress0
BadRobot: Jailbreaking Embodied LLMs in the Physical World0
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline0
BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference0
Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems0
Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction0
BAMBI: Developing Baby Language Models for Italian0
BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs)0
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents0
BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction0
BAT: Learning to Reason about Spatial Sounds with Large Language Models0
Bayesian inference to improve quality of Retrieval Augmented Generation0
Bayesian Reward Models for LLM Alignment0
BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models0
Allies: Prompting Large Language Model with Beam Search0
Behaviour Space Analysis of LLM-driven Meta-heuristic Discovery0
BenchmarkCards: Large Language Model and Risk Reporting0
Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics0
Benchmarking Large Language Model Capabilities for Conditional Generation0
Benchmarking Large Language Models with Integer Sequence Generation Tasks0
Benchmarking Large Language Model Volatility0
Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V30
Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design0
bert2BERT: Towards Reusable Pretrained Language Models0
BeSimulator: A Large Language Model Powered Text-based Behavior Simulator0
Better Process Supervision with Bi-directional Rewarding Signals0
Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension0
BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving0
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph0
Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses0
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws0
Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine0
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks0
Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling0
Beyond Exponential Decay: Rethinking Error Accumulation in Large Language Models0
Beyond Forecasting: Compositional Time Series Reasoning for End-to-End Task Execution0
Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems0
Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models0
Beyond Keywords: A Context-based Hybrid Approach to Mining Ethical Concern-related App Reviews0
Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts0
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks0
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey0
Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization using Large Language Models0
Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering0
Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing0
Beyond Segmentation: Road Network Generation with Multi-Modal LLMs0
Beyond Self-Consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging0
Show:102550
← PrevPage 61 of 122Next →

No leaderboard results yet.