| Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models | Oct 18, 2022 | Language ModellingSentence | CodeCode Available | 8 |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Jun 24, 2024 | parameter-efficient fine-tuningSentence | CodeCode Available | 7 |
| Large Concept Models: Language Modeling in a Sentence Representation Space | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| Interactive Prompt Debugging with Sequence Salience | Apr 11, 2024 | Sentencetext-classification | CodeCode Available | 7 |
| AutoTrain: No-code training for state-of-the-art models | Oct 21, 2024 | Classificationimage-classification | CodeCode Available | 7 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation | Mar 29, 2022 | Binary ClassificationSegmentation | CodeCode Available | 4 |
| What Makes Good In-Context Examples for GPT-3? | Jan 17, 2021 | Few-Shot LearningNatural Language Understanding | CodeCode Available | 4 |
| LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | Sep 4, 2024 | Question AnsweringSentence | CodeCode Available | 4 |
| Efficient Few-Shot Learning Without Prompts | Sep 22, 2022 | Few-Shot LearningFew-Shot Text Classification | CodeCode Available | 4 |
| 2D Matryoshka Sentence Embeddings | Feb 22, 2024 | RAGRepresentation Learning | CodeCode Available | 4 |
| Zero-shot Entity Linking with Less Data | Jul 1, 2022 | Entity LinkingMulti-Task Learning | CodeCode Available | 3 |
| Bridging Language and Items for Retrieval and Recommendation | Mar 6, 2024 | RetrievalSentence | CodeCode Available | 3 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts | Oct 17, 2017 | General ClassificationSentence | CodeCode Available | 3 |
| ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora | Dec 31, 2020 | SentenceTranslation | CodeCode Available | 3 |
| Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models | Nov 27, 2024 | ClassificationSentence | CodeCode Available | 3 |
| Diffusion-LM Improves Controllable Text Generation | May 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models | May 23, 2024 | HallucinationSentence | CodeCode Available | 3 |
| Compositional Entailment Learning for Hyperbolic Vision-Language Models | Oct 9, 2024 | Language ModellingRepresentation Learning | CodeCode Available | 2 |
| CLUE: A Chinese Language Understanding Evaluation Benchmark | Apr 13, 2020 | General ClassificationMachine Reading Comprehension | CodeCode Available | 2 |
| Compositional Visual Generation with Composable Diffusion Models | Jun 3, 2022 | Sentence | CodeCode Available | 2 |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing | Feb 21, 2022 | Few-Shot LearningSentence | CodeCode Available | 2 |