| Efficient Hierarchical Domain Adaptation for Pretrained Language Models | Dec 16, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Value Retrieval with Arbitrary Queries for Form-like Documents | Dec 15, 2021 | document understandingForm | CodeCode Available | 1 |
| Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta Information | Dec 15, 2021 | Conversational RecommendationLanguage Modeling | CodeCode Available | 1 |
| Deciphering antibody affinity maturation with language models and weakly supervised learning | Dec 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Step-unrolled Denoising Autoencoders for Text Generation | Dec 13, 2021 | DenoisingLanguage Modeling | CodeCode Available | 1 |
| MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning | Dec 9, 2021 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| MLP Architectures for Vision-and-Language Modeling: An Empirical Study | Dec 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Zero-Shot Recommendation as Language Modeling | Dec 8, 2021 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| Quantifying Adaptability in Pre-trained Language Models with 500 Tasks | Dec 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Causal Distillation for Language Models | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech Dataset | Dec 3, 2021 | Document RankingLanguage Modeling | CodeCode Available | 1 |
| InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation | Dec 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models | Nov 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic | Nov 29, 2021 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| A Simple Long-Tailed Recognition Baseline via Vision-Language Model | Nov 29, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model | Nov 26, 2021 | Image ManipulationLanguage Modeling | CodeCode Available | 1 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 |
| Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples | Nov 22, 2021 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| RoBERTuito: a pre-trained language model for social media text in Spanish | Nov 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework | Nov 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling | Nov 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AESOP: Paraphrase Generation with Adaptive Syntactic Control | Nov 1, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition | Nov 1, 2021 | Action RecognitionLanguage Modeling | CodeCode Available | 1 |
| A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder | Nov 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Small Data? No Problem! Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions | Nov 1, 2021 | Keyphrase ExtractionLanguage Modeling | CodeCode Available | 1 |
| Effective Use of Graph Convolution Network and Contextual Sub-Tree for Commodity News Event Extraction | Nov 1, 2021 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| TSDAE: Using Transformer-based Sequential Denoising Auto-Encoderfor Unsupervised Sentence Embedding Learning | Nov 1, 2021 | DenoisingDomain Adaptation | CodeCode Available | 1 |
| Efficiently Modeling Long Sequences with Structured State Spaces | Oct 31, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Top1 Solution of QQ Browser 2021 Ai Algorithm Competition Track 1 : Multimodal Video Similarity | Oct 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scatterbrain: Unifying Sparse and Low-rank Attention Approximation | Oct 28, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| ÚFAL at MultiLexNorm 2021: Improving Multilingual Lexical Normalization by Fine-tuning ByT5 | Oct 28, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Discovering Non-monotonic Autoregressive Orderings with Variational Inference | Oct 27, 2021 | DecoderImage Captioning | CodeCode Available | 1 |
| Deciphering the Language of Nature: A transformer-based language model for deleterious mutations in proteins | Oct 27, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hierarchical Transformers Are More Efficient Language Models | Oct 26, 2021 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain | Oct 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Spanish Legalese Language Model and Corpora | Oct 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ClimateBert: A Pretrained Language Model for Climate-Related Text | Oct 22, 2021 | ArticlesFact Checking | CodeCode Available | 1 |
| LMSOC: An Approach for Socially Sensitive Pretraining | Oct 20, 2021 | Cloze TestGraph Representation Learning | CodeCode Available | 1 |
| Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization | Oct 18, 2021 | BIG-bench Machine Learningimage-classification | CodeCode Available | 1 |
| GNN-LM: Language Modeling based on Global Contexts via GNN | Oct 17, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models | Oct 16, 2021 | counterfactualData Augmentation | CodeCode Available | 1 |
| Improving Transformers with Probabilistic Attention Keys | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hydra: A System for Large Multi-Model Deep Learning | Oct 16, 2021 | Deep LearningGPU | CodeCode Available | 1 |
| Invariant Language Modeling | Oct 16, 2021 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Coherence boosting: When your pretrained language model is not paying enough attention | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |