| Kosmos-2: Grounding Multimodal Large Language Models to the World | Jun 26, 2023 | Image CaptioningIn-Context Learning | CodeCode Available | 1 |
| K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATION | Jan 1, 2021 | ChatbotDecoder | CodeCode Available | 1 |
| Instruction Following without Instruction Tuning | Sep 21, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection | Apr 28, 2020 | Abuse DetectionLanguage Modeling | CodeCode Available | 1 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| L2MAC: Large Language Model Automatic Computer for Extensive Code Generation | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LabTOP: A Unified Model for Lab Test Outcome Prediction on Electronic Health Records | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LAMBERT: Layout-Aware (Language) Modeling for information extraction | Feb 19, 2020 | Key Information ExtractionLanguage Modeling | CodeCode Available | 1 |
| Efficient Content-Based Sparse Attention with Routing Transformers | Mar 12, 2020 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree forCommodity News Event Extraction | Sep 27, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| LAMP: Leveraging Language Prompts for Multi-person Pose Estimation | Jul 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CB-Conformer: Contextual biasing Conformer for biased word recognition | Apr 19, 2023 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 |
| LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images | May 30, 2023 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Language Conditioned Traffic Generation | Jul 16, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts | Oct 31, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event Extraction | Nov 1, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision | Jun 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Effective Sequence-to-Sequence Dialogue State Tracking | Aug 31, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | Aug 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images | May 31, 2021 | Few-Shot LearningImage Classification | CodeCode Available | 1 |
| Efficient Long Sequence Modeling via State Space Augmented Transformer | Dec 15, 2022 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Common Sense Enhanced Knowledge-based Recommendation with Large Language Model | Mar 27, 2024 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 |
| Continued Pretraining for Better Zero- and Few-Shot Promptability | Oct 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax | Mar 2, 2023 | DescriptiveImage Captioning | CodeCode Available | 1 |
| Language modeling via stochastic processes | Mar 21, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Language Modeling with Gated Convolutional Networks | Dec 23, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Causal Structure Learning Supervised by Large Language Model | Nov 20, 2023 | Causal DiscoveryCausal Inference | CodeCode Available | 1 |
| A Refer-and-Ground Multimodal Large Language Model for Biomedicine | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models are Universal Embedders | Oct 12, 2023 | Code SearchLanguage Modeling | CodeCode Available | 1 |
| Effective Attention Sheds Light On Interpretability | May 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Are Emergent Abilities in Large Language Models just In-Context Learning? | Sep 4, 2023 | In-Context LearningInstruction Following | CodeCode Available | 1 |
| CausalLM is not optimal for in-context learning | Aug 14, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Language Models Encode the Value of Numbers Linearly | Jan 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Batching for Recurrent Neural Network Grammars | May 31, 2021 | GPULanguage Modeling | CodeCode Available | 1 |
| LARCH: Large Language Model-based Automatic Readme Creation with Heuristics | Aug 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles | Sep 16, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| Compressed Context Memory For Online Language Model Interaction | Dec 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Nov 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EMMA: Efficient Visual Alignment in Multi-Modal LLMs | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods | Feb 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs | May 1, 2022 | Constituency Grammar InductionLanguage Modeling | CodeCode Available | 1 |
| EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities | Oct 16, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| DziriBERT: a Pre-trained Language Model for the Algerian Dialect | Sep 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Causal Distillation for Language Models | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Causal Discovery with Language Models as Imperfect Experts | Jul 5, 2023 | Causal DiscoveryDecision Making | CodeCode Available | 1 |