SOTAVerified

Chunking

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

| Vinken | , | 61 | years | old | | --- | ---| --- | --- | --- | | B-NLP| I-NP | I-NP | I-NP | I-NP |

Papers

Showing 101125 of 447 papers

TitleStatusHype
Two eyes, Two views, and finally, One summary! Towards Multi-modal Multi-tasking Knowledge-Infused Medical Dialogue SummarizationCode0
CUSIDE-array: A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic EvaluationsCode0
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic TransformationsCode0
LumberChunker: Long-Form Narrative Document SegmentationCode2
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs0
Learning Variable Compliance Control From a Few Demonstrations for Bimanual Robot with Haptic Feedback Teleoperation SystemCode1
Leveraging Large Language Models for Web Scraping0
Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech SeparationCode3
Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM modelsCode2
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationCode0
Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumCode1
Equipping Transformer with Random-Access Reading for Long-Context Understanding0
PathOCL: Path-Based Prompt Augmentation for OCL Generation with GPT-40
ExACT: An End-to-End Autonomous Excavator System Using Action Chunking With Transformers0
Sequence Compression Speeds Up Credit Assignment in Reinforcement LearningCode0
Multi-view Content-aware Indexing for Long Document Retrieval0
Improving Retrieval for RAG based Question Answering Models on Financial Documents0
Opening the black box of language acquisitionCode0
BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models0
Grounding Language Model with Chunking-Free In-Context Retrieval0
Punctuation Restoration Improves Structure Understanding Without SupervisionCode0
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT0
Financial Report Chunking for Effective Retrieval Augmented GenerationCode0
Def2Vec: Extensible Word Embeddings from Dictionary DefinitionsCode0
Releasing the CRaQAn (Coreference Resolution in Question-Answering): An open-source dataset and dataset creation methodology using instruction-following models0
Show:102550
← PrevPage 5 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEExact Span F197.3Unverified
2BERT-CRF (Replicated in AdaSeq)Exact Span F197.18Unverified
3ELMo + MAT + Multi-TaskExact Span F197.04Unverified
4CVT+Multi-Task+LargeExact Span F196.98Unverified
5ELMo + Multi-TaskExact Span F196.83Unverified
6FlairExact Span F196.72Unverified
7SeqVATExact Span F195.45Unverified
8Adversarial TrainingExact Span F195.25Unverified
9BiLSTM-CRFExact Span F195.18Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF1 score97.3Unverified
2Flair embeddingsF1 score96.72Unverified
3JMTF1 score95.77Unverified
4Low supervisionF1 score95.57Unverified
5IntNet + BiLSTM-CRFF1 score95.29Unverified
6Suzuki and IsozakiF1 score95.15Unverified
7NCRF++F1 score95.06Unverified
8BI-LSTM-CRF (Senna) (ours)F1 score94.46Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF195Unverified
2Wang et al., 2020F194.4Unverified
3AINF194.04Unverified
#ModelMetricClaimedVerifiedStatus
1Wang et al., 2020F192Unverified
2AINF191.71Unverified
#ModelMetricClaimedVerifiedStatus
1Def2VecAUC93.07Unverified