| Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness | Jun 16, 2023 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 |
| AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology | Jun 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation | Jun 16, 2023 | DiagnosticLanguage Modeling | —Unverified | 0 |
| FALL-E: A Foley Sound Synthesis Model and Strategies | Jun 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions | Jun 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Process Knowledge-infused Learning for Clinician-friendly Explanations | Jun 16, 2023 | DiagnosticExplainable Artificial Intelligence (XAI) | —Unverified | 0 |
| Inspire creativity with ORIBA: Transform Artists' Original Characters into Chatbots through Large Language Model | Jun 16, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| CMLM-CSE: Based on Conditional MLM Contrastive Learning for Sentence Embeddings | Jun 16, 2023 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| ChessGPT: Bridging Policy Learning and Language Modeling | Jun 15, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models | Jun 15, 2023 | Electrical EngineeringFew-Shot Learning | —Unverified | 0 |
| Block-State Transformers | Jun 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distillation Strategies for Discriminative Speech Recognition Rescoring | Jun 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation | Jun 15, 2023 | Automatic Speech RecognitionClustering | CodeCode Available | 1 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 |
| Mapping Researcher Activity based on Publication Data by means of Transformers | Jun 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can ChatGPT pass the Vietnamese National High School Graduation Examination? | Jun 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Neural models for Factual Inconsistency Classification with Explanations | Jun 15, 2023 | 8kClassification | CodeCode Available | 0 |
| Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration | Jun 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Generate to Understand for Representation | Jun 14, 2023 | Contrastive LearningGPU | CodeCode Available | 1 |
| Revealing the structure of language model capabilities | Jun 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CLIPXPlore: Coupled CLIP and Shape Spaces for 3D Shape Exploration | Jun 14, 2023 | AttributeLanguage Modeling | —Unverified | 0 |
| Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models | Jun 14, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models | Jun 14, 2023 | Grounded Open Vocabulary AcquisitionLanguage Modeling | CodeCode Available | 1 |
| Radiology-GPT: A Large Language Model for Radiology | Jun 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large-scale Language Model Rescoring on Long-form Data | Jun 13, 2023 | FormLanguage Modeling | —Unverified | 0 |
| AVIS: Autonomous Visual Information Seeking with Large Language Model Agent | Jun 13, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| PauseSpeech: Natural Speech Synthesis via Pre-trained Language Model and Pause-based Prosody Modeling | Jun 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Tokenization with Factorized Subword Encoding | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| NoCoLA: The Norwegian Corpus of Linguistic Acceptability | Jun 13, 2023 | Binary ClassificationDiagnostic | CodeCode Available | 0 |
| XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Augmenting Language Models with Long-Term Memory | Jun 12, 2023 | FormIn-Context Learning | —Unverified | 0 |
| EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Waffling around for Performance: Visual Classification with Random Words and Broad Concepts | Jun 12, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Large language models and (non-)linguistic recursion | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Weakly supervised information extraction from inscrutable handwritten document images | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions | Jun 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Valley: Video Assistant with Large Language model Enhanced abilitY | Jun 12, 2023 | Action RecognitionInstruction Following | CodeCode Available | 2 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search | Jun 11, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model | Jun 11, 2023 | General KnowledgeKnowledge Distillation | CodeCode Available | 1 |
| RoBERTweet: A BERT Language Model for Romanian Tweets | Jun 11, 2023 | Language IdentificationLanguage Modeling | —Unverified | 0 |
| Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method | Jun 11, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Language-Guided Traffic Simulation via Scene-Level Diffusion | Jun 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC | Jun 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models Are Semi-Parametric Reinforcement Learning Agents | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Models Can Learn Exceptions to Syntactic Rules | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect? | Jun 9, 2023 | Adversarial TextLanguage Modeling | —Unverified | 0 |