| Composing Parameter-Efficient Modules with Arithmetic Operations | Jun 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Implementing contextual biasing in GPU decoder for online ASR | Jun 23, 2023 | CPUDecoder | CodeCode Available | 1 |
| Bring Your Own Data! Self-Supervised Evaluation for Large Language Models | Jun 23, 2023 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Generative Multimodal Entity Linking | Jun 22, 2023 | Entity LinkingIn-Context Learning | CodeCode Available | 1 |
| OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue | Jun 21, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning | Jun 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Mass-Producing Failures of Multimodal Systems with Language Models | Jun 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision | Jun 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Sparse Modular Activation for Efficient Sequence Modeling | Jun 19, 2023 | ChunkingLanguage Modeling | CodeCode Available | 1 |
| LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning | Jun 17, 2023 | Boundary CaptioningLanguage Modeling | CodeCode Available | 1 |
| Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness | Jun 16, 2023 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 |
| FALL-E: A Foley Sound Synthesis Model and Strategies | Jun 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Conformal Language Modeling | Jun 16, 2023 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation | Jun 15, 2023 | Automatic Speech RecognitionClustering | CodeCode Available | 1 |
| ChessGPT: Bridging Policy Learning and Language Modeling | Jun 15, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models | Jun 14, 2023 | Grounded Open Vocabulary AcquisitionLanguage Modeling | CodeCode Available | 1 |
| Generate to Understand for Representation | Jun 14, 2023 | Contrastive LearningGPU | CodeCode Available | 1 |
| Tokenization with Factorized Subword Encoding | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Waffling around for Performance: Visual Classification with Random Words and Broad Concepts | Jun 12, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search | Jun 11, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model | Jun 11, 2023 | General KnowledgeKnowledge Distillation | CodeCode Available | 1 |
| Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method | Jun 11, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Large Language Models Are Semi-Parametric Reinforcement Learning Agents | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |