SOTAVerified

TriviaQA

Papers

Showing 2650 of 124 papers

TitleStatusHype
Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMsCode1
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language ModelsCode0
Accurate and Nuanced Open-QA Evaluation Through Textual EntailmentCode0
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering0
Mitigating LLM Hallucinations via Conformal Abstention0
Multi-Granularity Guided Fusion-in-DecoderCode1
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction0
Unfamiliar Finetuning Examples Control How Language Models HallucinateCode1
Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question AnsweringCode1
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents0
Fine-Grained Self-Endorsement Improves Factuality and Reasoning0
The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate0
Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing0
Efficient Transformer Knowledge Distillation: A Performance Review0
Noisy Pair Corrector for Dense Retrieval0
A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative ModelsCode0
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference0
Generator-Retriever-Generator Approach for Open-Domain Question AnsweringCode1
When to Read Documents or QA History: On Unified and Selective Open-domain QA0
Exploiting Abstract Meaning Representation for Open-Domain Question AnsweringCode1
RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question AnsweringCode0
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human FeedbackCode0
Allies: Prompting Large Language Model with Beam SearchCode0
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive CritiquingCode0
Show:102550
← PrevPage 2 of 5Next →

No leaderboard results yet.