MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models Feb 2, 2024 Language Modelling Large Language Model
Code Code Available 15 Dealing with Typos for BERT-based Passage Retrieval and Ranking Aug 27, 2021 Information Retrieval Language Modeling
Code Code Available 15 MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration Nov 14, 2023 Benchmarking Language Modeling
Code Code Available 15 LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model Apr 13, 2023 Language Modeling Language Modelling
Code Code Available 15 Counterfactual Token Generation in Large Language Models Sep 25, 2024 Bias Detection counterfactual
Code Code Available 15 The Woman Worked as a Babysitter: On Biases in Language Generation Sep 3, 2019 Language Modeling Language Modelling
Code Code Available 15 SentenceMIM: A Latent Variable Language Model Feb 18, 2020 Language Modeling Language Modelling
Code Code Available 15 Machine learning as a model for cultural learning: Teaching an algorithm what it means to be fat Mar 24, 2020 Articles Cultural Vocal Bursts Intensity Prediction
Code Code Available 15 MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL Jun 18, 2024 Language Modeling Language Modelling
Code Code Available 15 Latin BERT: A Contextual Language Model for Classical Philology Sep 21, 2020 Language Modeling Language Modelling
Code Code Available 15 M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation May 25, 2024 Language Modeling Language Modelling
Code Code Available 15 M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis Feb 17, 2025 Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)
Code Code Available 15 Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models Sep 20, 2022 Few-Shot Learning Language Modeling
Code Code Available 15 Data-to-Text Generation with Iterative Text Editing Nov 3, 2020 Data-to-Text Generation Domain Adaptation
Code Code Available 15 Debiasing Methods in Natural Language Understanding Make Bias More Accessible Sep 9, 2021 Language Modeling Language Modelling
Code Code Available 15 Making AI Less "Thirsty": Uncovering and Addressing the Secret Water Footprint of AI Models Apr 6, 2023 Language Modeling Language Modelling
Code Code Available 15 Luna: Linear Unified Nested Attention Jun 3, 2021 Language Modeling Language Modelling
Code Code Available 15 Learning distributed representations of graphs with Geo2DR Mar 12, 2020 GPU Graph Classification
Code Code Available 15 Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models Mar 12, 2024 Concept Alignment Instruction Following
Code Code Available 15 CoVR-2: Automatic Data Construction for Composed Video Retrieval Aug 28, 2023 Composed Image Retrieval (CoIR) Composed Video Retrieval (CoVR)
Code Code Available 15 LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions May 18, 2024 Language Modeling Language Modelling
Code Code Available 15 LXMERT: Learning Cross-Modality Encoder Representations from Transformers Aug 20, 2019 Language Modeling Language Modelling
Code Code Available 15 BERT got a Date: Introducing Transformers to Temporal Tagging Sep 30, 2021 Classification Decoder
Code Code Available 15 CPLLM: Clinical Prediction with Large Language Models Sep 20, 2023 Disease Prediction Language Modeling
Code Code Available 15 LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal Data Jun 14, 2024 Benchmarking Decision Making
Code Code Available 15 CPM: A Large-scale Generative Chinese Pre-trained Language Model Dec 1, 2020 Cloze Test Language Modeling
Code Code Available 15 LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT Jun 29, 2023 Automatic Lyrics Transcription Language Modeling
Code Code Available 15 BERT Goes Shopping: Comparing Distributional Models for Product Representations Dec 17, 2020 Language Modelling Product Recommendation
Code Code Available 15 MGeo: Multi-Modal Geographic Pre-Training Method Jan 11, 2023 Language Modelling
Code Code Available 15 CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation Sep 13, 2021 Decoder Denoising
Code Code Available 15 An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models Jan 22, 2023 Language Modeling Language Modelling
Code Code Available 15 A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation Feb 21, 2024 Diversity In-Context Learning
Code Code Available 15 Data Movement Is All You Need: A Case Study on Optimizing Transformers Jun 30, 2020 All Language Modelling
Code Code Available 15 LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention Oct 2, 2020 Common Sense Reasoning Entity Typing
Code Code Available 15 M2D2: A Massively Multi-domain Language Modeling Dataset Oct 13, 2022 Domain Adaptation Domain Generalization
Code Code Available 15 Data Efficient Masked Language Modeling for Vision and Language Sep 5, 2021 Language Modeling Language Modelling
Code Code Available 15 LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning Nov 20, 2023 GPU Language Modeling
Code Code Available 15 CrAM: A Compression-Aware Minimizer Jul 28, 2022 GPU Image Classification
Code Code Available 15 Low-Rank Adapting Models for Sparse Autoencoders Jan 31, 2025 Language Modeling Language Modelling
Code Code Available 15 Learning Compact Metrics for MT Oct 12, 2021 Cross-Lingual Transfer Language Modeling
Code Code Available 15 Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data Feb 24, 2023 Arithmetic Reasoning Language Modelling
Code Code Available 15 TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue Apr 15, 2020 Dialogue State Tracking Intent Detection
Code Code Available 15 LSBert: A Simple Framework for Lexical Simplification Jun 25, 2020 Language Modeling Language Modelling
Code Code Available 15 LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery Oct 24, 2023 GPU Language Modeling
Code Code Available 15 Data Augmentation using Pre-trained Transformer Models Mar 4, 2020 Data Augmentation Diversity
Code Code Available 15 Learning Domain Invariant Prompt for Vision-Language Models Dec 8, 2022 Domain Generalization Language Modelling
Code Code Available 15 Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling Oct 8, 2022 Language Modeling Language Modelling
Code Code Available 15 Top1 Solution of QQ Browser 2021 Ai Algorithm Competition Track 1 : Multimodal Video Similarity Oct 30, 2021 Language Modeling Language Modelling
Code Code Available 15 Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation May 16, 2025 Decision Making Language Modeling
Code Code Available 15 Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding Apr 10, 2024 GPU Language Modeling
Code Code Available 15