SOTAVerified

Chunking

Chunking, also known as shallow parsing, identifies continuous spans of tokens that form syntactic units such as noun phrases or verb phrases.

Example:

| Vinken | , | 61 | years | old | | --- | ---| --- | --- | --- | | B-NLP| I-NP | I-NP | I-NP | I-NP |

Papers

Showing 5175 of 447 papers

TitleStatusHype
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
Discovering Chunks in Neural Embeddings for Interpretability0
LLM-TA: An LLM-Enhanced Thematic Analysis Pipeline for Transcripts from Parents of Children with Congenital Heart DiseaseCode0
ACT-JEPA: Joint-Embedding Predictive Architecture Improves Policy Representation Learning0
Chat3GPP: An Open-Source Retrieval-Augmented Generation Framework for 3GPP DocumentsCode1
Evolution of diverse (and advanced) cognitive abilities through adaptive fine-tuning of learning and chunking mechanisms0
Passage Segmentation of Documents for Extractive Question Answering0
Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning0
S2 Chunking: A Hybrid Framework for Document Segmentation Through Integrated Spatial and Semantic AnalysisCode1
On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message PassingCode1
CarbonChat: Large Language Model-Based Corporate Carbon Emission Analysis and Climate Knowledge Q&A System0
CAG: Chunked Augmented Generation for Google Chrome's Built-in Gemini NanoCode0
PCA-Featured Transformer for Jamming Detection in 5G UAV Networks0
A Retrieval-Augmented Generation Framework for Academic Literature Navigation in Data Science0
TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG0
Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern0
Advanced System Integration: Analyzing OpenAPI Chunking for Retrieval-Augmented Generation0
Attamba: Attending To Multi-Token StatesCode1
Performance Evaluation of Geospatial Images based on Zarr and Tiff0
Unlocking Legal Knowledge with Multi-Layered Embedding-Based Retrieval0
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for NetworkCode1
LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models0
ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems0
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question AnsweringCode2
ProveRAG: Provenance-Driven Vulnerability Analysis with Automated Retrieval-Augmented LLMsCode0
Show:102550
← PrevPage 3 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACEExact Span F197.3Unverified
2BERT-CRF (Replicated in AdaSeq)Exact Span F197.18Unverified
3ELMo + MAT + Multi-TaskExact Span F197.04Unverified
4CVT+Multi-Task+LargeExact Span F196.98Unverified
5ELMo + Multi-TaskExact Span F196.83Unverified
6FlairExact Span F196.72Unverified
7SeqVATExact Span F195.45Unverified
8Adversarial TrainingExact Span F195.25Unverified
9BiLSTM-CRFExact Span F195.18Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF1 score97.3Unverified
2Flair embeddingsF1 score96.72Unverified
3JMTF1 score95.77Unverified
4Low supervisionF1 score95.57Unverified
5IntNet + BiLSTM-CRFF1 score95.29Unverified
6Suzuki and IsozakiF1 score95.15Unverified
7NCRF++F1 score95.06Unverified
8BI-LSTM-CRF (Senna) (ours)F1 score94.46Unverified
#ModelMetricClaimedVerifiedStatus
1ACEF195Unverified
2Wang et al., 2020F194.4Unverified
3AINF194.04Unverified
#ModelMetricClaimedVerifiedStatus
1Wang et al., 2020F192Unverified
2AINF191.71Unverified
#ModelMetricClaimedVerifiedStatus
1Def2VecAUC93.07Unverified