| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Adaptive Attention Span in Computer Vision | Apr 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CDLM: Cross-Document Language Modeling | Jan 2, 2021 | Citation RecommendationCoreference Resolution | CodeCode Available | 1 | 5 |
| Annotation-Efficient Preference Optimization for Language Model Alignment | May 22, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 | 5 |
| Incorporating Large Language Models into Production Systems for Enhanced Task Automation and Flexibility | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text Representation | Mar 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 | 5 |
| Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models | Jul 31, 2024 | Dictionary LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Critic-Guided Decoding for Controlled Text Generation | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fauno: The Italian Large Language Model that will leave you senza parole! | Jun 26, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular Data | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fast Vocabulary Transfer for Language Model Compression | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| In-Context Learning for Few-Shot Dialogue State Tracking | Mar 16, 2022 | Dialogue State TrackingFew-Shot Learning | CodeCode Available | 1 | 5 |
| CriticEval: Evaluating Large Language Model as Critic | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| In-context Autoencoder for Context Compression in a Large Language Model | Jul 13, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation | Jun 28, 2022 | Entity TypingLanguage Modeling | CodeCode Available | 1 | 5 |
| Bot or Human? Detecting ChatGPT Imposters with A Single Question | May 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages | Nov 7, 2022 | Active LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| In-Context Learning with Many Demonstration Examples | Feb 9, 2023 | 16k8k | CodeCode Available | 1 | 5 |
| AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions | Nov 1, 2021 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generating Query Focused Summaries from Query-Free Resources | Dec 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation | Apr 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning | Aug 8, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 | 5 |
| In-context Pretraining: Language Modeling Beyond Document Boundaries | Oct 16, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| A Fully Differentiable Beam Search Decoder | Feb 16, 2019 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Brain-to-Text Benchmark '24: Lessons Learned | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FiLM: Fill-in Language Models for Any-Order Generation | Oct 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Filtering Noisy Parallel Corpus using Transformers with Proxy Task Learning | Nov 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 | Jul 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout Detection | Feb 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| MotionLM: Multi-Agent Motion Forecasting as Language Modeling | Sep 28, 2023 | Autonomous VehiclesLanguage Modeling | CodeCode Available | 1 | 5 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Inverse Materials Design by Large Language Model-Assisted Generative Framework | Feb 25, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Aligning Large Language Models through Synthetic Feedback | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Transformer Optimization Through Better Initialization | Jan 1, 2020 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| FineRec:Exploring Fine-grained Sequential Recommendation | Apr 19, 2024 | AttributeDiversity | CodeCode Available | 1 | 5 |
| Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |