| REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory | Dec 10, 2022 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| Structured information extraction from complex scientific text with fine-tuned large language models | Dec 10, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Uniform Masking Prevails in Vision-Language Pretraining | Dec 10, 2022 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| A Unified Knowledge Graph Augmentation Service for Boosting Domain-specific NLP Tasks | Dec 10, 2022 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| Artificial Text Detection with Multiple Training Strategies | Dec 10, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Elixir: Train a Large Language Model on a Small GPU Cluster | Dec 10, 2022 | CPUGPU | CodeCode Available | 7 |
| From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader | Dec 9, 2022 | ClassificationExtractive Question-Answering | CodeCode Available | 0 |
| Structured Like a Language Model: Analysing AI as an Automated Subject | Dec 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SpeechLMScore: Evaluating speech generation using speech language model | Dec 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Implicit causality in GPT-2: a case study | Dec 8, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Generative Approach for Script Event Prediction via Contrastive Fine-tuning | Dec 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks | Dec 7, 2022 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Pre-Training With Scientific Text Improves Educational Question Generation | Dec 7, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning | Dec 7, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation | Dec 6, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CySecBERT: A Domain-Adapted Language Model for the Cybersecurity Domain | Dec 6, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ADIR: Adaptive Diffusion for Image Reconstruction | Dec 6, 2022 | DeblurringDenoising | —Unverified | 0 |
| PØDA: Prompt-driven Zero-shot Domain Adaptation | Dec 6, 2022 | Domain Adaptationimage-classification | CodeCode Available | 1 |
| M-VADER: A Model for Diffusion with Multimodal Context | Dec 6, 2022 | DecoderImage Generation | —Unverified | 0 |
| Meta-Learning Fast Weight Language Models | Dec 5, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification | Dec 5, 2022 | Classificationimage-classification | —Unverified | 0 |
| Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models | Dec 5, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Legal Prompt Engineering for Multilingual Legal Judgement Prediction | Dec 5, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Building Metadata Inference Using a Transducer Based Language Model | Dec 5, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MiLMo:Minority Multilingual Pre-trained Language Model | Dec 4, 2022 | ClassificationLanguage Modeling | —Unverified | 0 |
| Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE | Dec 4, 2022 | Common Sense Reasoningcoreference-resolution | —Unverified | 0 |
| Cross-lingual Similarity of Multilingual Representations Revisited | Dec 4, 2022 | Causal Language ModelingCross-Lingual Transfer | CodeCode Available | 0 |
| Global memory transformer for processing long documents | Dec 3, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models | Dec 3, 2022 | 3D Part SegmentationLanguage Modeling | CodeCode Available | 1 |
| Exploring Stochastic Autoregressive Image Modeling for Visual Representation | Dec 3, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Compound Tokens: Channel Fusion for Vision-Language Representation Learning | Dec 2, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Systematic Analysis for Pretrained Language Model Priming for Parameter-Efficient Fine-tuning | Dec 2, 2022 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Legal Prompting: Teaching a Language Model to Think Like a Lawyer | Dec 2, 2022 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Faster Adaptive Federated Learning | Dec 2, 2022 | Federated Learningimage-classification | —Unverified | 0 |
| Nonparametric Masked Language Modeling | Dec 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CliMedBERT: A Pre-trained Language Model for Climate and Health-related Text | Dec 1, 2022 | Fact CheckingLanguage Modeling | —Unverified | 0 |
| Language Model Pre-training on True Negatives | Dec 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Extensible Prompts for Language Models on Zero-shot Language Style Customization | Dec 1, 2022 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Adapted Multimodal BERT with Layer-wise Fusion for Sentiment Analysis | Dec 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fast Inference from Transformers via Speculative Decoding | Nov 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch? | Nov 30, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning | Nov 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| xTrimoABFold: De novo Antibody Structure Prediction without MSA | Nov 30, 2022 | Computational EfficiencyDrug Design | —Unverified | 0 |
| Improving astroBERT using Semantic Textual Similarity | Nov 29, 2022 | AstronomyLanguage Modeling | —Unverified | 0 |
| Better Transcription of UK Supreme Court Hearings | Nov 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Coder Reviewer Reranking for Code Generation | Nov 29, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Composition based oxidation state prediction of materials using deep learning | Nov 29, 2022 | Deep LearningLanguage Modeling | CodeCode Available | 1 |
| Syntactic Substitutability as Unsupervised Dependency Syntax | Nov 29, 2022 | Dependency ParsingLanguage Modeling | CodeCode Available | 0 |
| Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models | Nov 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Pre-Trained Models with Extra-Large Vocabularies: A Contrastive Analysis of Hebrew BERT Models and a New One to Outperform Them All | Nov 28, 2022 | AllLanguage Modeling | —Unverified | 0 |