| Factual Consistency Oriented Speech Recognition | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence Representation | Feb 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Language Model Crossover: Variation through Few-Shot Prompting | Feb 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Generative Sentiment Transfer via Adaptive Masking | Feb 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Generalization Ability of Retrieval-Enhanced Transformers | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EVJVQA Challenge: Multilingual Visual Question Answering | Feb 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structure | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT | Feb 21, 2023 | Backdoor AttackLanguage Modeling | —Unverified | 0 |
| kNN-Adapter: Efficient Domain Adaptation for Black-Box Language Models | Feb 21, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 |
| Federated Learning for ASR based on Wav2vec 2.0 | Feb 20, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| Can discrete information extraction prompts generalize across language models? | Feb 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Universal Fake Image Detectors that Generalize Across Generative Models | Feb 20, 2023 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE | Feb 18, 2023 | Contrastive LearningDenoising | —Unverified | 0 |
| BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark | Feb 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Prompting Large Language Models With the Socratic Method | Feb 17, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Privately Customizing Prefinetuning to Better Match User Data in Federated Learning | Feb 17, 2023 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Massively Multilingual Shallow Fusion with Large Language Models | Feb 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multiperiodic Processes: Ergodic Sources with a Sublinear Entropy | Feb 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GPT4MIA: Utilizing Generative Pre-trained Transformer (GPT-3) as A Plug-and-Play Transductive Model for Medical Image Analysis | Feb 17, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories | Feb 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bridge the Gap between Language models and Tabular Understanding | Feb 16, 2023 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition | Feb 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax | Feb 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |