SOTAVerified

Chunking

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

| Vinken | , | 61 | years | old | | --- | ---| --- | --- | --- | | B-NLP| I-NP | I-NP | I-NP | I-NP |

Papers

Showing 101150 of 447 papers

TitleStatusHype
ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems0
Chunk Twice, Embed Once: A Systematic Study of Segmentation and Representation Trade-offs in Chemistry-Aware Retrieval-Augmented Generation0
ClearTK 2.0: Design Patterns for Machine Learning in UIMA0
ClearTK-TimeML: A minimalist approach to TempEval 20130
CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs0
A Unified Framework for Structured Prediction: From Theory to Practice0
AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking0
Combining Top-down and Bottom-up Search for Unsupervised Induction of Transduction Grammars0
A Retrieval-Augmented Generation Framework for Academic Literature Navigation in Data Science0
Concept-Guided Interpretability via Neural Chunking0
Building Trainable Taggers in a Web-based, UIMA-Supported NLP Workbench0
A Conditional Random Field-based Traditional Chinese Base Phrase Parser for SIGHAN Bake-off 2012 Evaluation0
A Reranking Model for Discourse Segmentation using Subtree Features0
A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking0
Coreference Resolution for the Basque Language with BART0
Counting What Counts: Decompounding for Keyphrase Extraction0
CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation0
Cross-lingual transfer parser from Hindi to Bengali using delexicalization and chunking0
Cross-View Training for Semi-Supervised Learning0
Crowd Prefers the Middle Path: A New IAA Metric for Crowdsourcing Reveals Turker Biases in Query Segmentation0
Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models0
Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents0
TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG0
Decoding with Finite-State Transducers on GPUs0
Evaluating distributed word representations for capturing semantics of biomedical concepts0
Deep Learning Transformer Architecture for Named Entity Recognition on Low Resourced Languages: State of the art results0
Evaluation of Domain-specific Word Embeddings using Knowledge Resources0
ExACT: An End-to-End Autonomous Excavator System Using Action Chunking With Transformers0
Breaking the Token Barrier: Chunking and Convolution for Efficient Long Text Classification with BERT0
A Recursive Recurrent Neural Network for Statistical Machine Translation0
A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation0
Boundary-based MWE segmentation with text partitioning0
Bootstrapping a historical commodities lexicon with SKOS and DBpedia0
A recurrent connectionist model of melody perception : An exploration using TRACX20
A Boosting-based Algorithm for Classification of Semi-Structured Text using the Frequency of Substructures0
Boosting Named Entity Recognition with Neural Character Embeddings0
A Quantitative Comparative Study of Prosodic and Discourse Units, the Case of French and Taiwan Mandarin0
Enrichir et raisonner sur des espaces s\'emantiques pour l'attribution de mots-cl\'es (Enriching and reasoning on semantic spaces for keyword extraction) [in French]0
BLAZE: Cross-Language and Cross-Project Bug Localization via Dynamic Chunking and Hard Example Learning0
Domain Adaptation with Filtering for Named Entity Extraction of Japanese Anime-Related Words0
BIT-Xiaomi’s System for AutoSimTrans 20220
Apprentissage automatique d'un chunker pour le fran (Machine Learning of a chunker for French) [in French]0
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation0
Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes0
DTSim at SemEval-2016 Task 1: Semantic Similarity Model Including Multi-Level Alignment and Vector-Based Compositional Semantics0
DTSim at SemEval-2016 Task 2: Interpreting Similarity of Texts Based on Automated Chunking, Chunk Alignment and Semantic Relation Prediction0
Duluth: Word Sense Discrimination in the Service of Lexicography0
Boosting Named Entity Recognition with Neural Character Embeddings0
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling0
EPIC: Efficient Position-Independent Caching for Serving Large Language Models0
Show:102550
← PrevPage 3 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEExact Span F197.3Unverified
2BERT-CRF (Replicated in AdaSeq)Exact Span F197.18Unverified
3ELMo + MAT + Multi-TaskExact Span F197.04Unverified
4CVT+Multi-Task+LargeExact Span F196.98Unverified
5ELMo + Multi-TaskExact Span F196.83Unverified
6FlairExact Span F196.72Unverified
7SeqVATExact Span F195.45Unverified
8Adversarial TrainingExact Span F195.25Unverified
9BiLSTM-CRFExact Span F195.18Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF1 score97.3Unverified
2Flair embeddingsF1 score96.72Unverified
3JMTF1 score95.77Unverified
4Low supervisionF1 score95.57Unverified
5IntNet + BiLSTM-CRFF1 score95.29Unverified
6Suzuki and IsozakiF1 score95.15Unverified
7NCRF++F1 score95.06Unverified
8BI-LSTM-CRF (Senna) (ours)F1 score94.46Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF195Unverified
2Wang et al., 2020F194.4Unverified
3AINF194.04Unverified
#ModelMetricClaimedVerifiedStatus
1Wang et al., 2020F192Unverified
2AINF191.71Unverified
#ModelMetricClaimedVerifiedStatus
1Def2VecAUC93.07Unverified