| Pay Less Attention with Lightweight and Dynamic Convolutions | Jan 29, 2019 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fill in the BLANC: Human-free quality estimation of document summaries | Feb 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| PeptideBERT: A Language Model based on Transformers for Peptide Property Prediction | Aug 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 | 5 |
| Permutation Equivariant Models for Compositional Generalization in Language | May 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| PermuteFormer: Efficient Relative Position Encoding for Long Sequences | Sep 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Filtering Noisy Parallel Corpus using Transformers with Proxy Task Learning | Nov 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition | Oct 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Fine-grained Audible Video Description | Mar 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model | Mar 28, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model | Apr 9, 2023 | Cross-Part Crowd CountingCrowd Counting | CodeCode Available | 1 | 5 |
| Finding Universal Grammatical Relations in Multilingual BERT | May 9, 2020 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 | 5 |
| Configurable Safety Tuning of Language Models with Synthetic Preference Data | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning to engineer protein flexibility | Dec 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning To Retrieve Prompts for In-Context Learning | Dec 16, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| ConfliBERT: A Language Model for Political Conflict | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Length Generalization of Causal Transformers without Position Encoding | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | May 16, 2024 | Decision MakingInformativeness | CodeCode Available | 1 | 5 |
| Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder | Nov 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning | Oct 31, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |