| AgentStealth: Reinforcing Large Language Model for Anonymizing User-generated Text | Jun 26, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| Learning Extrapolative Sequence Transformations from Markov Chains | May 26, 2025 | Text Anonymization | CodeCode Available | 0 |
| A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court | May 13, 2025 | DiversityDocument Layout Analysis | —Unverified | 0 |
| Survey of Pseudonymization, Abstractive Summarization & Spell Checker for Hindi and Marathi | Dec 24, 2024 | Abstractive Text SummarizationDiversity | —Unverified | 0 |
| Truthful Text Sanitization Guided by Inference Attacks | Dec 17, 2024 | Text Anonymization | —Unverified | 0 |
| DIRI: Adversarial Patient Reidentification with Large Language Models for Evaluating Clinical Text Anonymization | Oct 22, 2024 | De-identificationLanguage Modeling | —Unverified | 0 |
| Robust Utility-Preserving Text Anonymization Based on Large Language Models | Jul 16, 2024 | Text Anonymization | CodeCode Available | 1 |
| Comparing Feature-based and Context-aware Approaches to PII Generalization Level Prediction | Jul 3, 2024 | Ensemble Learningfeature selection | —Unverified | 0 |
| IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization | Jul 3, 2024 | AttributeText Anonymization | —Unverified | 0 |
| Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study | May 29, 2024 | Text Anonymization | —Unverified | 0 |