| Neural Retrievers are Biased Towards LLM-Generated Content | Oct 31, 2023 | Information RetrievalRetrieval | CodeCode Available | 1 |
| Semantic Text Compression for Classification | Sep 19, 2023 | ClassificationDecoder | —Unverified | 0 |
| EntropyRank: Unsupervised Keyphrase Extraction via Side-Information Optimization for Language Model-based Text Compression | Aug 25, 2023 | Keyphrase ExtractionLanguage Modeling | —Unverified | 0 |
| Approximating Human-Like Few-shot Learning with GPT-based Compression | Aug 14, 2023 | Data CompressionFew-Shot Learning | —Unverified | 0 |
| Gzip versus bag-of-words for text classification | Jul 27, 2023 | Classificationtext-classification | CodeCode Available | 0 |
| LLMZip: Lossless Text Compression using Large Language Models | Jun 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Optimal alphabet for single text compression | Jan 13, 2022 | Text Compression | —Unverified | 0 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Text Compression-aided Transformer Encoding | Feb 11, 2021 | Text Compression | —Unverified | 0 |
| Machine Translation with Unsupervised Length-Constraints | Apr 7, 2020 | DecoderMachine Translation | —Unverified | 0 |