| Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference | Mar 8, 2023 | Hyperparameter OptimizationLanguage Modeling | CodeCode Available | 4 |
| Magnushammer: A Transformer-Based Approach to Premise Selection | Mar 8, 2023 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| Making a Computational Attorney | Mar 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling | Mar 7, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 5 |
| The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset | Mar 7, 2023 | EthicsLanguage Modeling | —Unverified | 0 |
| Preparing the Vuk'uzenzele and ZA-gov-multilingual South African multilingual corpora | Mar 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification | Mar 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatGPT is on the Horizon: Could a Large Language Model be Suitable for Intelligent Traffic Safety Research and Applications? | Mar 6, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Data Portraits: Recording Foundation Model Training Data | Mar 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model | Mar 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenICL: An Open-Source Framework for In-context Learning | Mar 6, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Spelling convention sensitivity in neural language models | Mar 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PaLM-E: An Embodied Multimodal Language Model | Mar 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Model-Agnostic Meta-Learning for Natural Language Understanding Tasks in Finance | Mar 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification | Mar 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Could a Large Language Model be Conscious? | Mar 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Mar 4, 2023 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 1 |
| Prismer: A Vision-Language Model with Multi-Task Experts | Mar 4, 2023 | Few-Shot LearningImage Captioning | CodeCode Available | 1 |
| End-to-End Speech Recognition: A Survey | Mar 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM | Mar 3, 2023 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT | Mar 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RePreM: Representation Pre-training with Masked Model for Reinforcement Learning | Mar 3, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax | Mar 2, 2023 | DescriptiveImage Captioning | CodeCode Available | 1 |
| BenchDirect: A Directed Language Model for Compiler Benchmarks | Mar 2, 2023 | Active LearningCPU | —Unverified | 0 |
| How will Language Modelers like ChatGPT Affect Occupations and Industries? | Mar 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Variance-reduced Clipping for Non-convex Optimization | Mar 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semiparametric Language Models Are Scalable Continual Learners | Mar 2, 2023 | Continual LearningLanguage Modeling | —Unverified | 0 |
| N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space | Mar 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Domain-adapted large language models for classifying nuclear medicine reports | Mar 1, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Almanac: Retrieval-Augmented Language Models for Clinical Medicine | Mar 1, 2023 | Decision MakingDialogue Generation | —Unverified | 0 |
| Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents | Mar 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 |
| SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks | Mar 1, 2023 | ClassificationLanguage Modeling | —Unverified | 0 |
| Weighted Sampling for Masked Language Modeling | Feb 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context | Feb 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BrainBERT: Self-supervised representation learning for intracranial recordings | Feb 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation | Feb 28, 2023 | Dialogue EvaluationDialogue Generation | CodeCode Available | 1 |
| Efficient Masked Autoencoders with Self-Consistency | Feb 28, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Language Is Not All You Need: Aligning Perception with Language Models | Feb 27, 2023 | AllImage Captioning | —Unverified | 0 |
| Pretraining De-Biased Language Model with Large-scale Click Logs for Document Ranking | Feb 27, 2023 | Document RankingInformation Retrieval | CodeCode Available | 1 |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Feb 27, 2023 | Dense Video CaptioningLanguage Modeling | CodeCode Available | 2 |
| SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks | Feb 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The ROOTS Search Tool: Data Transparency for LLMs | Feb 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech | Feb 27, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Choice Fusion as Knowledge for Zero-Shot Dialogue State Tracking | Feb 25, 2023 | DecoderDialogue State Tracking | CodeCode Available | 0 |
| Topic-Selective Graph Network for Topic-Focused Summarization | Feb 25, 2023 | ARCLanguage Modeling | —Unverified | 0 |
| Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling | Feb 25, 2023 | FairnessLanguage Modeling | —Unverified | 0 |
| Leveraging Large Language Model and Story-Based Gamification in Intelligent Tutoring System to Scaffold Introductory Programming Courses: A Design-Based Research Study | Feb 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |