SOTAVerified

Chunking

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

| Vinken | , | 61 | years | old | | --- | ---| --- | --- | --- | | B-NLP| I-NP | I-NP | I-NP | I-NP |

Papers

Showing 101150 of 447 papers

TitleStatusHype
Two eyes, Two views, and finally, One summary! Towards Multi-modal Multi-tasking Knowledge-Infused Medical Dialogue SummarizationCode0
CUSIDE-array: A Streaming Multi-Channel End-to-End Speech Recognition System with Realistic EvaluationsCode0
Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic TransformationsCode0
LumberChunker: Long-Form Narrative Document SegmentationCode2
Learning Variable Compliance Control From a Few Demonstrations for Bimanual Robot with Haptic Feedback Teleoperation SystemCode1
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs0
Leveraging Large Language Models for Web Scraping0
Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech SeparationCode3
Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM modelsCode2
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationCode0
Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumCode1
Equipping Transformer with Random-Access Reading for Long-Context Understanding0
PathOCL: Path-Based Prompt Augmentation for OCL Generation with GPT-40
ExACT: An End-to-End Autonomous Excavator System Using Action Chunking With Transformers0
Sequence Compression Speeds Up Credit Assignment in Reinforcement LearningCode0
Multi-view Content-aware Indexing for Long Document Retrieval0
Improving Retrieval for RAG based Question Answering Models on Financial Documents0
Opening the black box of language acquisitionCode0
BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models0
Grounding Language Model with Chunking-Free In-Context Retrieval0
Punctuation Restoration Improves Structure Understanding Without SupervisionCode0
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT0
Financial Report Chunking for Effective Retrieval Augmented GenerationCode0
Def2Vec: Extensible Word Embeddings from Dictionary DefinitionsCode0
Releasing the CRaQAn (Coreference Resolution in Question-Answering): An open-source dataset and dataset creation methodology using instruction-following models0
A recurrent connectionist model of melody perception : An exploration using TRACX20
Breaking the Token Barrier: Chunking and Convolution for Efficient Long Text Classification with BERT0
Fast and Accurate Factual Inconsistency Detection Over Long DocumentsCode1
Symmetrical SyncMap for Imbalanced General Chunking Problems0
Abstractive Summarization of Large Document Collections Using GPT0
Chunking: Continual Learning is not just about Distribution ShiftCode0
Fine-tuned vs. Prompt-tuned Supervised Representations: Which Better Account for Brain Language Representations?0
Exploring RWKV for Memory Efficient and Low Latency Streaming ASR0
One ACT Play: Single Demonstration Behavior Cloning with Action Chunking Transformers0
Unsupervised Chunking with Hierarchical RNNCode0
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking0
Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-RankingCode0
Chunked Lists versus Extensible Arrays for Text Inversion0
MultiSChuBERT: Effective Multimodal Fusion for Scholarly Document Quality Prediction0
TBIN: Modeling Long Textual Behavior Data for CTR Prediction0
Sparse Modular Activation for Efficient Sequence ModelingCode1
Recurrent Attention Networks for Long-text ModelingCode1
Habits of Mind: Reusing Action Sequences for Efficient Planning0
The ACL OCL Corpus: Advancing Open Science in Computational Linguistics0
Open Information Extraction via ChunksCode0
A Cognitive Account of the Puzzle of Ideography0
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware0
Tracking by Associating Clips0
Prompting Language Models for Linguistic Structure0
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages0
Show:102550
← PrevPage 3 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEExact Span F197.3Unverified
2BERT-CRF (Replicated in AdaSeq)Exact Span F197.18Unverified
3ELMo + MAT + Multi-TaskExact Span F197.04Unverified
4CVT+Multi-Task+LargeExact Span F196.98Unverified
5ELMo + Multi-TaskExact Span F196.83Unverified
6FlairExact Span F196.72Unverified
7SeqVATExact Span F195.45Unverified
8Adversarial TrainingExact Span F195.25Unverified
9BiLSTM-CRFExact Span F195.18Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF1 score97.3Unverified
2Flair embeddingsF1 score96.72Unverified
3JMTF1 score95.77Unverified
4Low supervisionF1 score95.57Unverified
5IntNet + BiLSTM-CRFF1 score95.29Unverified
6Suzuki and IsozakiF1 score95.15Unverified
7NCRF++F1 score95.06Unverified
8BI-LSTM-CRF (Senna) (ours)F1 score94.46Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF195Unverified
2Wang et al., 2020F194.4Unverified
3AINF194.04Unverified
#ModelMetricClaimedVerifiedStatus
1Wang et al., 2020F192Unverified
2AINF191.71Unverified
#ModelMetricClaimedVerifiedStatus
1Def2VecAUC93.07Unverified