SOTAVerified

Chunking

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

| Vinken | , | 61 | years | old | | --- | ---| --- | --- | --- | | B-NLP| I-NP | I-NP | I-NP | I-NP |

Papers

Showing 101150 of 447 papers

TitleStatusHype
ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems0
Chunk Twice, Embed Once: A Systematic Study of Segmentation and Representation Trade-offs in Chemistry-Aware Retrieval-Augmented Generation0
ClearTK 2.0: Design Patterns for Machine Learning in UIMA0
ClearTK-TimeML: A minimalist approach to TempEval 20130
CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs0
A Unified Framework for Structured Prediction: From Theory to Practice0
AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking0
Combining Top-down and Bottom-up Search for Unsupervised Induction of Transduction Grammars0
A Retrieval-Augmented Generation Framework for Academic Literature Navigation in Data Science0
Concept-Guided Interpretability via Neural Chunking0
Building Trainable Taggers in a Web-based, UIMA-Supported NLP Workbench0
A Conditional Random Field-based Traditional Chinese Base Phrase Parser for SIGHAN Bake-off 2012 Evaluation0
A Reranking Model for Discourse Segmentation using Subtree Features0
A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking0
Coreference Resolution for the Basque Language with BART0
Counting What Counts: Decompounding for Keyphrase Extraction0
CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation0
Cross-lingual transfer parser from Hindi to Bengali using delexicalization and chunking0
Cross-View Training for Semi-Supervised Learning0
Crowd Prefers the Middle Path: A New IAA Metric for Crowdsourcing Reveals Turker Biases in Query Segmentation0
CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR0
Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents0
TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG0
Decoding with Finite-State Transducers on GPUs0
Evaluating distributed word representations for capturing semantics of biomedical concepts0
Deep Learning Transformer Architecture for Named Entity Recognition on Low Resourced Languages: State of the art results0
Evaluation of Domain-specific Word Embeddings using Knowledge Resources0
Evolution of diverse (and advanced) cognitive abilities through adaptive fine-tuning of learning and chunking mechanisms0
Breaking the Token Barrier: Chunking and Convolution for Efficient Long Text Classification with BERT0
A Recursive Recurrent Neural Network for Statistical Machine Translation0
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation0
Boundary-based MWE segmentation with text partitioning0
Bootstrapping a historical commodities lexicon with SKOS and DBpedia0
A recurrent connectionist model of melody perception : An exploration using TRACX20
A Boosting-based Algorithm for Classification of Semi-Structured Text using the Frequency of Substructures0
Boosting Named Entity Recognition with Neural Character Embeddings0
A Quantitative Comparative Study of Prosodic and Discourse Units, the Case of French and Taiwan Mandarin0
Enrichir et raisonner sur des espaces s\'emantiques pour l'attribution de mots-cl\'es (Enriching and reasoning on semantic spaces for keyword extraction) [in French]0
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic Chunking and Hard Example Learning0
Domain Adaptation with Filtering for Named Entity Extraction of Japanese Anime-Related Words0
BIT-Xiaomi’s System for AutoSimTrans 20220
Apprentissage automatique d'un chunker pour le fran (Machine Learning of a chunker for French) [in French]0
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation0
Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes0
DTSim at SemEval-2016 Task 1: Semantic Similarity Model Including Multi-Level Alignment and Vector-Based Compositional Semantics0
DTSim at SemEval-2016 Task 2: Interpreting Similarity of Texts Based on Automated Chunking, Chunk Alignment and Semantic Relation Prediction0
Duluth: Word Sense Discrimination in the Service of Lexicography0
Boosting Named Entity Recognition with Neural Character Embeddings0
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling0
EPIC: Efficient Position-Independent Caching for Serving Large Language Models0
Show:102550
← PrevPage 3 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEExact Span F197.3Unverified
2BERT-CRF (Replicated in AdaSeq)Exact Span F197.18Unverified
3ELMo + MAT + Multi-TaskExact Span F197.04Unverified
4CVT+Multi-Task+LargeExact Span F196.98Unverified
5ELMo + Multi-TaskExact Span F196.83Unverified
6FlairExact Span F196.72Unverified
7SeqVATExact Span F195.45Unverified
8Adversarial TrainingExact Span F195.25Unverified
9BiLSTM-CRFExact Span F195.18Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF1 score97.3Unverified
2Flair embeddingsF1 score96.72Unverified
3JMTF1 score95.77Unverified
4Low supervisionF1 score95.57Unverified
5IntNet + BiLSTM-CRFF1 score95.29Unverified
6Suzuki and IsozakiF1 score95.15Unverified
7NCRF++F1 score95.06Unverified
8BI-LSTM-CRF (Senna) (ours)F1 score94.46Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF195Unverified
2Wang et al., 2020F194.4Unverified
3AINF194.04Unverified
#ModelMetricClaimedVerifiedStatus
1Wang et al., 2020F192Unverified
2AINF191.71Unverified
#ModelMetricClaimedVerifiedStatus
1Def2VecAUC93.07Unverified