SOTAVerified

Chunking

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

| Vinken | , | 61 | years | old | | --- | ---| --- | --- | --- | | B-NLP| I-NP | I-NP | I-NP | I-NP |

Papers

Showing 76100 of 447 papers

TitleStatusHype
EPIC: Efficient Position-Independent Caching for Serving Large Language Models0
Action abstractions for amortized sampling0
Is Semantic Chunking Worth the Computational Cost?0
CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data DiversityCode1
Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical PerceptionCode3
SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented GenerationCode0
ChuLo: Chunk-Level Key Information Representation for Long Document ProcessingCode0
Liger Kernel: Efficient Triton Kernels for LLM TrainingCode9
SciGisPy: a Novel Metric for Biomedical Text Simplification via Gist Inference Score0
Integrating Supertag Features into Neural Discontinuous Constituent ParsingCode0
Autoregressive Action Sequence Learning for Robotic ManipulationCode2
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation0
Medha: Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations0
J2N -- Nominal Adjective Identification and its ApplicationCode0
InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation0
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding ModelsCode3
3D Gaussian Splatting for Large-scale Surface Reconstruction from Aerial Images0
Bidirectional Decoding: Improving Action Chunking via Guided Test-Time SamplingCode2
TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation0
Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP StandardsCode1
Meta Knowledge for Retrieval Augmented Large Language Models0
Hierarchical Working Memory and a New Magic Number0
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic Chunking and Hard Example Learning0
From Imitation to Refinement -- Residual RL for Precise Assembly0
Show:102550
← PrevPage 4 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEExact Span F197.3Unverified
2BERT-CRF (Replicated in AdaSeq)Exact Span F197.18Unverified
3ELMo + MAT + Multi-TaskExact Span F197.04Unverified
4CVT+Multi-Task+LargeExact Span F196.98Unverified
5ELMo + Multi-TaskExact Span F196.83Unverified
6FlairExact Span F196.72Unverified
7SeqVATExact Span F195.45Unverified
8Adversarial TrainingExact Span F195.25Unverified
9BiLSTM-CRFExact Span F195.18Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF1 score97.3Unverified
2Flair embeddingsF1 score96.72Unverified
3JMTF1 score95.77Unverified
4Low supervisionF1 score95.57Unverified
5IntNet + BiLSTM-CRFF1 score95.29Unverified
6Suzuki and IsozakiF1 score95.15Unverified
7NCRF++F1 score95.06Unverified
8BI-LSTM-CRF (Senna) (ours)F1 score94.46Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF195Unverified
2Wang et al., 2020F194.4Unverified
3AINF194.04Unverified
#ModelMetricClaimedVerifiedStatus
1Wang et al., 2020F192Unverified
2AINF191.71Unverified
#ModelMetricClaimedVerifiedStatus
1Def2VecAUC93.07Unverified