| LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval | Aug 31, 2022 | CPUDecoder | CodeCode Available | 1 | 5 |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Few-shot Reranking for Multi-hop QA via Language Model Prompting | May 25, 2022 | Language ModelingOpen-Domain Question Answering | CodeCode Available | 1 | 5 |
| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-model Control: Improving Multiple Large Language Models in One-time Training | Oct 23, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| ArcGPT: A Large Language Model Tailored for Real-world Archival Applications | Jul 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations | Feb 19, 2024 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Parameter-Efficient Conversational Recommender System as a Language Processing Task | Jan 25, 2024 | Dialogue GenerationKnowledge Graphs | CodeCode Available | 1 | 5 |
| Cascade Speculative Drafting for Even Faster LLM Inference | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular Data | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training | Jun 1, 2022 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 | 5 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| FedJudge: Federated Legal Large Language Model | Sep 15, 2023 | Continual LearningFederated Learning | CodeCode Available | 1 | 5 |
| Parsing as Pretraining | Feb 5, 2020 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 | 5 |
| LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content | Jan 28, 2021 | Argument MiningLanguage Modeling | CodeCode Available | 1 | 5 |
| Cascaded Head-colliding Attention | May 31, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Legilimens: Practical and Unified Content Moderation for Large Language Model Services | Aug 28, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation | Feb 18, 2024 | Cross-Lingual TransferData Augmentation | CodeCode Available | 1 | 5 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 | 5 |
| Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining | Jan 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 | 5 |
| Pay Less Attention with Lightweight and Dynamic Convolutions | Jan 29, 2019 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Fill in the BLANC: Human-free quality estimation of document summaries | Feb 23, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| PeptideBERT: A Language Model based on Transformers for Peptide Property Prediction | Aug 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 | 5 |
| Permutation Equivariant Models for Compositional Generalization in Language | May 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| PermuteFormer: Efficient Relative Position Encoding for Long Sequences | Sep 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Filtering Noisy Parallel Corpus using Transformers with Proxy Task Learning | Nov 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition | Oct 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Fine-grained Audible Video Description | Mar 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model | Mar 28, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model | Apr 9, 2023 | Cross-Part Crowd CountingCrowd Counting | CodeCode Available | 1 | 5 |
| Finding Universal Grammatical Relations in Multilingual BERT | May 9, 2020 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 | 5 |
| Configurable Safety Tuning of Language Models with Synthetic Preference Data | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning to engineer protein flexibility | Dec 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning To Retrieve Prompts for In-Context Learning | Dec 16, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| ConfliBERT: A Language Model for Political Conflict | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Length Generalization of Causal Transformers without Position Encoding | Apr 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | May 16, 2024 | Decision MakingInformativeness | CodeCode Available | 1 | 5 |
| Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder | Nov 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning | Oct 31, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |