| Merging Feed-Forward Sublayers for Compressed Transformers | Jan 10, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| Merging Text Transformer Models from Different Initializations | Mar 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Meta-Adapter: An Online Few-shot Learner for Vision-Language Model | Nov 7, 2023 | Few-Shot Learningimage-classification | CodeCode Available | 1 |
| MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks | Mar 30, 2024 | AllLanguage Modeling | CodeCode Available | 1 |
| MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Nov 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Controllable Dialogue Simulation with In-Context Learning | Oct 9, 2022 | Data AugmentationIn-Context Learning | CodeCode Available | 1 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Empower Entity Set Expansion via Language Model Probing | Apr 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework | Apr 30, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities | Apr 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CTRL: A Conditional Transformer Language Model for Controllable Generation | Sep 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context | May 7, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Effective Attention Sheds Light On Interpretability | May 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AraGPT2: Pre-Trained Transformer for Arabic Language Generation | Dec 31, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Effective Batching for Recurrent Neural Network Grammars | May 31, 2021 | GPULanguage Modeling | CodeCode Available | 1 |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Nov 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROAD | May 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| DziriBERT: a Pre-trained Language Model for the Algerian Dialect | Sep 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Grained Encoder for Vision Transformers | Jan 10, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion | Oct 16, 2023 | AllImage Quality Assessment | CodeCode Available | 1 |
| Contrastive Learning for Prompt-Based Few-Shot Language Learners | May 3, 2022 | Contrastive LearningIn-Context Learning | CodeCode Available | 1 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |
| Contextual information integration for stance detection via cross-attention | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contextualization Distillation from Large Language Model for Knowledge Graph Completion | Jan 28, 2024 | ArticlesKnowledge Graph Completion | CodeCode Available | 1 |
| MMBERT: Multimodal BERT Pretraining for Improved Medical VQA | Apr 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 3D Visual Illusion Depth Estimation | May 19, 2025 | Common Sense ReasoningDepth Estimation | CodeCode Available | 1 |
| Contrastive Distillation on Intermediate Representations for Language Model Compression | Sep 29, 2020 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs | May 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contextualized Perturbation for Textual Adversarial Attack | Sep 16, 2020 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |
| Effective Sequence-to-Sequence Dialogue State Tracking | Aug 31, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| DUnE: Dataset for Unified Editing | Nov 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval | Jul 31, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| ModaVerse: Efficiently Transforming Modalities with LLMs | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Modelling Suspense in Short Stories as Uncertainty Reduction over Neural Representation | Apr 30, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities | Feb 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax | Mar 2, 2023 | DescriptiveImage Captioning | CodeCode Available | 1 |
| Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning | Mar 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic | Feb 20, 2024 | ArabicMMLULanguage Model Evaluation | CodeCode Available | 1 |
| MedualTime: A Dual-Adapter Language Model for Medical Time Series-Text Multimodal Learning | Jun 7, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation | Dec 17, 2024 | Contrastive LearningImage Segmentation | CodeCode Available | 1 |
| Dual-Alignment Pre-training for Cross-lingual Sentence Embedding | May 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |