SOTAVerified

Document Summarization

Automatic Document Summarization is the task of rewriting a document into its shorter form while still retaining its important content. The most popular two paradigms are extractive approaches and abstractive approaches. Extractive approaches generate summaries by extracting parts of the original document (usually sentences), while abstractive methods may generate new words or phrases which are not in the original document.

Source: HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization

Papers

Showing 150 of 760 papers

TitleStatusHype
GenerationPrograms: Fine-grained Attribution with Executable ProgramsCode0
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token SequencesCode3
Improving Fairness of Large Language Models in Multi-document SummarizationCode0
ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs0
Ask, Retrieve, Summarize: A Modular Pipeline for Scientific Literature SummarizationCode0
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization0
Document Attribution: Examining Citation Relationships using Large Language Models0
A Unified Retrieval Framework with Document Ranking and EDU Filtering for Multi-document Summarization0
Estimating Optimal Context Length for Hybrid Retrieval-augmented Multi-document SummarizationCode0
Align to Structure: Aligning Large Language Models with Structural InformationCode0
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?Code0
Can one size fit all?: Measuring Failure in Multi-Document Summarization Domain Transfer0
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure AnalysisCode2
Agent-Enhanced Large Language Models for Researching Political InstitutionsCode0
A Hybrid Architecture with Efficient Fine Tuning for Abstractive Patent Document Summarization0
Mitigating Preference Hacking in Policy Optimization with Pessimism0
Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing0
LAG: LLM agents for Leaderboard Auto Generation on Demanding0
LM Agents for Coordinating Multi-User Information Gathering0
Exploring Synaptic Resonance in Large Language Models: A Novel Approach to Contextual Memory Integration0
Scaling Multi-Document Event Summarization: Evaluating Compression vs. Full-Text ApproachesCode0
Discourse-Driven Evaluation: Unveiling Factual Inconsistency in Long Document Summarization0
HyGen: Efficient LLM Serving via Elastic Online-Offline Request Co-location0
Progressive Document-level Text Simplification via Large Language Models0
End-to-End Long Document Summarization using Gradient Caching0
A Rhetorical Relations-Based Framework for Tailored Multimedia Document Summarization0
Precise Length Control in Large Language Models0
EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents0
Coverage-based Fairness in Multi-document SummarizationCode0
Mitigating Knowledge Conflicts in Language Model-Driven Question Answering0
Fair Summarization: Bridging Quality and Diversity in Extractive SummariesCode0
What is Wrong with Perplexity for Long-context Language Modeling?Code2
Hybrid Deep Learning for Legal Text Analysis: Predicting Punishment Durations in Indonesian Court Rulings0
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges0
Optimizing the role of human evaluation in LLM-based spoken document summarization systems0
DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph0
From Single to Multi: How LLMs Hallucinate in Multi-Document SummarizationCode0
CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization0
PublicHearingBR: A Brazilian Portuguese Dataset of Public Hearing Transcripts for Summarization of Long Documents0
A Novel LLM-based Two-stage Summarization Approach for Long Dialogues0
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News SummarizationCode0
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?Code1
ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving0
Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications0
BERT-VBD: Vietnamese Multi-Document Summarization Framework0
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning0
Abstractive Text Summarization: State of the Art, Challenges, and Improvements0
SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section0
Biomedical Large Languages Models Seem not to be Superior to Generalist Models on Unseen Medical Data0
Preference-Guided Reflective Sampling for Aligning Language ModelsCode0
Show:102550
← PrevPage 1 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HAT-BARTROUGE-144.48Unverified
2MatchSum (RoBERTa-base)ROUGE-144.41Unverified
3Hie-BARTROUGE-144.35Unverified
4MatchSum (BERT-base)ROUGE-144.22Unverified
5BertSumExtROUGE-143.85Unverified
6BigBird-PegasusROUGE-143.84Unverified
7T5-11BROUGE-143.52Unverified
8BERTSUM+TransformerROUGE-143.25Unverified
9UniLM (Abstractive Summarization)ROUGE-143.08Unverified
10Selector+Pointer GeneratorROUGE-141.72Unverified
#ModelMetricClaimedVerifiedStatus
1LexRank (query: step title)ROUGE-139.6Unverified
2CES (query: step title)ROUGE-139.3Unverified
3CES (query: step + method titles)ROUGE-138.3Unverified
4LexRank (query: step + method titles)ROUGE-138.2Unverified
5CES (query: step + method + article titles)ROUGE-137Unverified
6LexRank (query: step + method + article titles)ROUGE-136.3Unverified
7GreedyRel (query: step + method titles)ROUGE-130.3Unverified
8GreedyRel (query: step title)ROUGE-130.1Unverified
9BM25-HierSumm (query: step + method titles)ROUGE-123Unverified
10BM25-HierSumm (query: step title)ROUGE-122.3Unverified
#ModelMetricClaimedVerifiedStatus
1LexRank (query: method + article + steps titles)ROUGE-153.5Unverified
2CES (query: method + article + steps titles)ROUGE-152.2Unverified
3GreedyRel (query: method + article + steps titles)ROUGE-148.6Unverified
4CES (query: method title)ROUGE-148.4Unverified
5CES (query: method + article titles)ROUGE-148.3Unverified
6LexRank (query: method title)ROUGE-147.7Unverified
7LexRank (query: method + article titles)ROUGE-147.1Unverified
8GreedyRel (query: method title)ROUGE-143.4Unverified
9GreedyRel (query: method + article titles)ROUGE-142.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepPyramidionROUGE-147.15Unverified
#ModelMetricClaimedVerifiedStatus
1DeepPyramidionRouge-219.99Unverified
#ModelMetricClaimedVerifiedStatus
1BigBird-PegasusROUGE-147.12Unverified
#ModelMetricClaimedVerifiedStatus
1DOCmT5Rouge-L31.37Unverified