| Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models | Oct 18, 2022 | Language ModellingSentence | CodeCode Available | 8 |
| Large Concept Models: Language Modeling in a Sentence Representation Space | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| AutoTrain: No-code training for state-of-the-art models | Oct 21, 2024 | Classificationimage-classification | CodeCode Available | 7 |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Jun 24, 2024 | parameter-efficient fine-tuningSentence | CodeCode Available | 7 |
| Interactive Prompt Debugging with Sequence Salience | Apr 11, 2024 | Sentencetext-classification | CodeCode Available | 7 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Jun 9, 2022 | MisconceptionsSentence | CodeCode Available | 5 |
| LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA | Sep 4, 2024 | Question AnsweringSentence | CodeCode Available | 4 |
| 2D Matryoshka Sentence Embeddings | Feb 22, 2024 | RAGRepresentation Learning | CodeCode Available | 4 |
| Efficient Few-Shot Learning Without Prompts | Sep 22, 2022 | Few-Shot LearningFew-Shot Text Classification | CodeCode Available | 4 |
| Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation | Mar 29, 2022 | Binary ClassificationSegmentation | CodeCode Available | 4 |
| What Makes Good In-Context Examples for GPT-3? | Jan 17, 2021 | Few-Shot LearningNatural Language Understanding | CodeCode Available | 4 |
| Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models | Nov 27, 2024 | ClassificationSentence | CodeCode Available | 3 |
| RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models | May 23, 2024 | HallucinationSentence | CodeCode Available | 3 |
| Bridging Language and Items for Retrieval and Recommendation | Mar 6, 2024 | RetrievalSentence | CodeCode Available | 3 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Zero-shot Entity Linking with Less Data | Jul 1, 2022 | Entity LinkingMulti-Task Learning | CodeCode Available | 3 |
| Diffusion-LM Improves Controllable Text Generation | May 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora | Dec 31, 2020 | SentenceTranslation | CodeCode Available | 3 |
| Language Models are Few-Shot Learners | May 28, 2020 | answerability predictionArticles | CodeCode Available | 3 |
| PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts | Oct 17, 2017 | General ClassificationSentence | CodeCode Available | 3 |
| Thought Anchors: Which LLM Reasoning Steps Matter? | Jun 23, 2025 | counterfactualSentence | CodeCode Available | 2 |
| DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement | Jun 18, 2025 | Graph GenerationHallucination | CodeCode Available | 2 |
| Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models | Feb 19, 2025 | Contrastive LearningSentence | CodeCode Available | 2 |
| Enhancing Retrieval-Augmented Generation: A Study of Best Practices | Jan 13, 2025 | In-Context LearningRAG | CodeCode Available | 2 |
| DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory | Oct 10, 2024 | Document TranslationMachine Translation | CodeCode Available | 2 |
| Compositional Entailment Learning for Hyperbolic Vision-Language Models | Oct 9, 2024 | Language ModellingRepresentation Learning | CodeCode Available | 2 |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Oct 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models | Oct 4, 2024 | Dense Video CaptioningSentence | CodeCode Available | 2 |
| beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems | Sep 16, 2024 | Collaborative FilteringRecommendation Systems | CodeCode Available | 2 |
| AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction | Sep 3, 2024 | RelationRelation Extraction | CodeCode Available | 2 |
| BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning | Jul 9, 2024 | Image GenerationSentence | CodeCode Available | 2 |
| One Thousand and One Pairs: A "novel" challenge for long-context language models | Jun 24, 2024 | RetrievalSentence | CodeCode Available | 2 |
| ANAH: Analytical Annotation of Hallucinations in Large Language Models | May 30, 2024 | Generative Question AnsweringHallucination | CodeCode Available | 2 |
| Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography | May 20, 2024 | Breast Cancer DetectionDiversity | CodeCode Available | 2 |
| SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection | May 16, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Learning representations of learning representations | Apr 12, 2024 | Sentence | CodeCode Available | 2 |
| Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation | Apr 4, 2024 | Contrastive LearningReferring Expression | CodeCode Available | 2 |
| ARAGOG: Advanced RAG Output Grading | Apr 1, 2024 | Document EmbeddingLanguage Modeling | CodeCode Available | 2 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 |
| AutoRE: Document-Level Relation Extraction with Large Language Models | Mar 21, 2024 | Document-level Relation ExtractionRelation | CodeCode Available | 2 |
| DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models | Mar 15, 2024 | RAGRetrieval | CodeCode Available | 2 |
| What Was Your Prompt? A Remote Keylogging Attack on AI Assistants | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale | Mar 13, 2024 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 2 |
| MeaCap: Memory-Augmented Zero-shot Image Captioning | Mar 6, 2024 | Caption GenerationImage Captioning | CodeCode Available | 2 |
| MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection | Mar 4, 2024 | GPUMamba | CodeCode Available | 2 |
| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective | Feb 22, 2024 | HallucinationSentence | CodeCode Available | 2 |
| Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations | Feb 20, 2024 | Sentence | CodeCode Available | 2 |