| Towards Universal Fake Image Detectors that Generalize Across Generative Models | Feb 20, 2023 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark | Feb 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Simple Hardware-Efficient Long Convolutions for Sequence Modeling | Feb 13, 2023 | GPUimage-classification | CodeCode Available | 2 |
| RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL | Feb 12, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Accelerating Large Language Model Decoding with Speculative Sampling | Feb 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| In-Context Retrieval-Augmented Language Models | Jan 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Jan 31, 2023 | Image RetrievalIn-Context Learning | CodeCode Available | 2 |
| Editing Language Model-based Knowledge Graph Embeddings | Jan 25, 2023 | EDIT Taskknowledge editing | CodeCode Available | 2 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 |
| Hungry Hungry Hippos: Towards Language Modeling with State Space Models | Dec 28, 2022 | 8kCoreference Resolution | CodeCode Available | 2 |
| SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization | Dec 20, 2022 | Dialogue GenerationLanguage Modeling | CodeCode Available | 2 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Precise Zero-Shot Dense Retrieval without Relevance Labels | Dec 20, 2022 | Fact VerificationInstruction Following | CodeCode Available | 2 |
| DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models | Nov 28, 2022 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels | Nov 25, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Ignore Previous Prompt: Attack Techniques For Language Models | Nov 17, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 2 |
| LERT: A Linguistically-motivated Pre-trained Language Model | Nov 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| When Language Model Meets Private Library | Oct 31, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Contrastive Decoding: Open-ended Text Generation as Optimization | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Contrastive Search Is What You Need For Neural Text Generation | Oct 25, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| TabLLM: Few-shot Classification of Tabular Data with Large Language Models | Oct 19, 2022 | ClassificationDeep Learning | CodeCode Available | 2 |
| Deep Bidirectional Language-Knowledge Graph Pretraining | Oct 17, 2022 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 2 |
| Re3: Generating Longer Stories With Recursive Reprompting and Revision | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Mass-Editing Memory in a Transformer | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts | Oct 7, 2022 | ArticlesLanguage Modeling | CodeCode Available | 2 |
| Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding | Oct 7, 2022 | Chart Question AnsweringDiversity | CodeCode Available | 2 |
| LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings | Oct 1, 2022 | Graph Representation LearningKnowledge Graph Completion | CodeCode Available | 2 |
| Mega: Moving Average Equipped Gated Attention | Sep 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Sep 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition | Sep 9, 2022 | AllDomain Generalization | CodeCode Available | 2 |
| Atlas: Few-shot Learning with Retrieval Augmented Language Models | Aug 5, 2022 | Fact CheckingFew-Shot Learning | CodeCode Available | 2 |
| AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model | Aug 2, 2022 | Causal Language ModelingCommon Sense Reasoning | CodeCode Available | 2 |
| Language Model Cascades | Jul 21, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Recurrent Memory Transformer | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scene Text Recognition with Permuted Autoregressive Sequence Models | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action | Jul 10, 2022 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Accurate RNA 3D structure prediction using a language model-based deep learning approach | Jul 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Egocentric Video-Language Pretraining @ Ego4D Challenge 2022 | Jul 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022 | Jul 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing | Jun 30, 2022 | DiversityLanguage Model Evaluation | CodeCode Available | 2 |
| Solving Quantitative Reasoning Problems with Language Models | Jun 29, 2022 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 2 |
| TEVR: Improving Speech Recognition by Token Entropy Variance Reduction | Jun 25, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 2 |
| Mining Error Templates for Grammatical Error Correction | Jun 23, 2022 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 2 |
| GODEL: Large-Scale Pre-Training for Goal-Directed Dialog | Jun 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Revealing Single Frame Bias for Video-and-Language Learning | Jun 7, 2022 | Action RecognitionFine-grained Action Recognition | CodeCode Available | 2 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| A Generalist Agent | May 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Symphony Generation with Permutation Invariant Language Model | May 10, 2022 | Audio GenerationDecoder | CodeCode Available | 2 |