| Towards Universal Fake Image Detectors that Generalize Across Generative Models | Feb 20, 2023 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark | Feb 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Simple Hardware-Efficient Long Convolutions for Sequence Modeling | Feb 13, 2023 | GPUimage-classification | CodeCode Available | 2 |
| RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL | Feb 12, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Accelerating Large Language Model Decoding with Speculative Sampling | Feb 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| In-Context Retrieval-Augmented Language Models | Jan 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Jan 31, 2023 | Image RetrievalIn-Context Learning | CodeCode Available | 2 |
| Editing Language Model-based Knowledge Graph Embeddings | Jan 25, 2023 | EDIT Taskknowledge editing | CodeCode Available | 2 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 |
| Hungry Hungry Hippos: Towards Language Modeling with State Space Models | Dec 28, 2022 | 8kCoreference Resolution | CodeCode Available | 2 |
| SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization | Dec 20, 2022 | Dialogue GenerationLanguage Modeling | CodeCode Available | 2 |
| Precise Zero-Shot Dense Retrieval without Relevance Labels | Dec 20, 2022 | Fact VerificationInstruction Following | CodeCode Available | 2 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models | Nov 28, 2022 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels | Nov 25, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Ignore Previous Prompt: Attack Techniques For Language Models | Nov 17, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 2 |
| LERT: A Linguistically-motivated Pre-trained Language Model | Nov 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| When Language Model Meets Private Library | Oct 31, 2022 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Contrastive Decoding: Open-ended Text Generation as Optimization | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Contrastive Search Is What You Need For Neural Text Generation | Oct 25, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| TabLLM: Few-shot Classification of Tabular Data with Large Language Models | Oct 19, 2022 | ClassificationDeep Learning | CodeCode Available | 2 |
| Deep Bidirectional Language-Knowledge Graph Pretraining | Oct 17, 2022 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 2 |
| Re3: Generating Longer Stories With Recursive Reprompting and Revision | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Mass-Editing Memory in a Transformer | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |