| Fine-tuning language models to find agreement among humans with diverse preferences | Nov 28, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition | Nov 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models | Nov 28, 2022 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| Revisiting Distance Metric Learning for Few-Shot Natural Language Classification | Nov 28, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5 | Nov 27, 2022 | Bug fixingLanguage Modeling | —Unverified | 0 |
| SKDBERT: Compressing BERT via Stochastic Knowledge Distillation | Nov 26, 2022 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels | Nov 25, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Comparison Study Between Token Classification and Sequence Classification In Text Classification | Nov 25, 2022 | ClassificationLanguage Modeling | —Unverified | 0 |
| Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning | Nov 24, 2022 | cross-modal alignmentImage-text Retrieval | CodeCode Available | 1 |
| Question Answering and Question Generation for Finnish | Nov 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-supervised vision-language pretraining for Medical visual question answering | Nov 24, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| Unified Multimodal Model with Unlikelihood Training for Visual Dialog | Nov 23, 2022 | Answer GenerationChatbot | CodeCode Available | 1 |
| Open-vocabulary Attribute Detection | Nov 23, 2022 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Word-Level Representation From Bytes For Language Modeling | Nov 23, 2022 | Cross-Lingual Transferimage-classification | —Unverified | 0 |
| TorchScale: Transformers at Scale | Nov 23, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MarianCG: a code generation transformer model inspired by machine translation | Nov 22, 2022 | Code GenerationCode Translation | CodeCode Available | 1 |
| Retrieval-Augmented Multimodal Language Modeling | Nov 22, 2022 | Caption GenerationImage Captioning | —Unverified | 0 |
| Human-level play in the game of Diplomacy by combining language models with strategic reasoning | Nov 22, 2022 | AI AgentLanguage Modeling | CodeCode Available | 3 |
| Knowledge Prompting for Few-shot Action Recognition | Nov 22, 2022 | Action RecognitionAction Recognition In Videos | —Unverified | 0 |
| Converge to the Truth: Factual Error Correction via Iterative Constrained Editing | Nov 22, 2022 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| HyperTuning: Toward Adapting Large Language Models without Back-propagation | Nov 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Coreference Resolution through a seq2seq Transition-Based System | Nov 22, 2022 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Validating Large Language Models with ReLM | Nov 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model | Nov 21, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| Enhancing Crisis-Related Tweet Classification with Entity-Masked Language Modeling and Multi-Task Learning | Nov 21, 2022 | Hierarchical Multi-label ClassificationLanguage Modeling | CodeCode Available | 0 |
| ClipCrop: Conditioned Cropping Driven by Vision-Language Model | Nov 21, 2022 | DecoderImage Cropping | —Unverified | 0 |
| AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model | Nov 21, 2022 | Continual PretrainingLanguage Modeling | CodeCode Available | 0 |
| Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification | Nov 21, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Deanthropomorphising NLP: Can a Language Model Be Conscious? | Nov 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention | Nov 21, 2022 | Cross-Modal RetrievalLanguage Modeling | CodeCode Available | 1 |
| Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text | Nov 21, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Leveraging per Image-Token Consistency for Vision-Language Pre-training | Nov 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Embracing Ambiguity: Improving Similarity-oriented Tasks with Contextual Synonym Knowledge | Nov 20, 2022 | Entity LinkingLanguage Modeling | —Unverified | 0 |
| Modeling Fine-grained Information via Knowledge-aware Hierarchical Graph for Zero-shot Entity Retrieval | Nov 20, 2022 | Entity RetrievalGraph Attention | CodeCode Available | 0 |
| ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting | Nov 19, 2022 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Knowledge Graph Generation From Text | Nov 18, 2022 | Graph GenerationJoint Entity and Relation Extraction | CodeCode Available | 1 |
| 3d human motion generation from the text via gesture action classification and the autoregressive model | Nov 18, 2022 | Action ClassificationAction Recognition | —Unverified | 0 |
| GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation | Nov 18, 2022 | Conditional Text GenerationData Augmentation | CodeCode Available | 1 |
| Metadata Might Make Language Models Better | Nov 18, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Protein language model rescue mutations highlight variant effects and structure in clinically relevant genes | Nov 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CAPE: Corrective Actions from Precondition Errors using Large Language Models | Nov 17, 2022 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| LongFNT: Long-form Speech Recognition with Factorized Neural Transducer | Nov 17, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| InstructPix2Pix: Learning to Follow Image Editing Instructions | Nov 17, 2022 | Image Editing | CodeCode Available | 5 |
| Ignore Previous Prompt: Attack Techniques For Language Models | Nov 17, 2022 | Adversarial AttackAdversarial Text | CodeCode Available | 2 |
| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 |
| Prompting PaLM for Translation: Assessing Strategies and Performance | Nov 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TSMind: Alibaba and Soochow University's Submission to the WMT22 Translation Suggestion Task | Nov 16, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Towards Computationally Verifiable Semantic Grounding for Language Models | Nov 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning Circuits: Few-shot Multihop Question Generation with Structured Rationales | Nov 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Relationship of the language distance to English ability of a country | Nov 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |