| Modular Retrieval for Generalization and Interpretation | Mar 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning | Mar 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Neural Implicit Vision-Language Feature Fields | Mar 20, 2023 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| Reinforcement Learning Friendly Vision-Language Model for Minecraft | Mar 19, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 |
| Trained on 100 million words and still in shape: BERT meets British National Corpus | Mar 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos | Mar 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Jump to Conclusions: Short-Cutting Transformers With Linear Transformations | Mar 16, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| TypeT5: Seq2seq Type Inference using Static Analysis | Mar 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples! | Mar 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NL4Opt Competition: Formulating Optimization Problems Based on Their Natural Language Descriptions | Mar 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability | Mar 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Iterative Few-shot Semantic Segmentation from Image Label Text | Mar 10, 2023 | Few-Shot Semantic SegmentationLanguage Modeling | CodeCode Available | 1 |
| Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models | Mar 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification | Mar 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Mar 4, 2023 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 1 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 |
| Prismer: A Vision-Language Model with Multi-Task Experts | Mar 4, 2023 | Few-Shot LearningImage Captioning | CodeCode Available | 1 |
| Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM | Mar 3, 2023 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax | Mar 2, 2023 | DescriptiveImage Captioning | CodeCode Available | 1 |
| GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation | Feb 28, 2023 | Dialogue EvaluationDialogue Generation | CodeCode Available | 1 |
| BrainBERT: Self-supervised representation learning for intracranial recordings | Feb 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pretraining De-Biased Language Model with Large-scale Click Logs for Document Ranking | Feb 27, 2023 | Document RankingInformation Retrieval | CodeCode Available | 1 |
| The ROOTS Search Tool: Data Transparency for LLMs | Feb 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Federated Learning for ASR based on Wav2vec 2.0 | Feb 20, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains | Feb 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Guiding Pretraining in Reinforcement Learning with Large Language Models | Feb 13, 2023 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 1 |
| The Wisdom of Hindsight Makes Language Models Better Instruction Followers | Feb 10, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| In-Context Learning with Many Demonstration Examples | Feb 9, 2023 | 16k8k | CodeCode Available | 1 |
| UDApter -- Efficient Domain Adaptation Using Adapters | Feb 7, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Representation Deficiency in Masked Language Modeling | Feb 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GLADIS: A General and Large Acronym Disambiguation Benchmark | Feb 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining | Jan 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus | Jan 27, 2023 | Language AcquisitionLanguage Modeling | CodeCode Available | 1 |
| SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient | Jan 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Jan 27, 2023 | Few-Shot LearningGSM8K | CodeCode Available | 1 |
| Prompt-Based Editing for Text Style Transfer | Jan 27, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Domain-Agnostic Molecular Generation with Chemical Feedback | Jan 26, 2023 | Drug DesignLanguage Modeling | CodeCode Available | 1 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 |
| ViDeBERTa: A powerful pre-trained language model for Vietnamese | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ExaRanker: Explanation-Augmented Neural Ranker | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Lexi: Self-Supervised Learning of the UI Language | Jan 23, 2023 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints | Jan 22, 2023 | DecoderDenoising | CodeCode Available | 1 |
| An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Batch Prompting: Efficient Inference with Large Language Model APIs | Jan 19, 2023 | Arithmetic ReasoningIn-Context Learning | CodeCode Available | 1 |
| CLIP the Gap: A Single Domain Generalization Approach for Object Detection | Jan 13, 2023 | Domain Generalizationimage-classification | CodeCode Available | 1 |