| Factual Consistency Oriented Speech Recognition | Feb 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence Representation | Feb 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Language Model Crossover: Variation through Few-Shot Prompting | Feb 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Generative Sentiment Transfer via Adaptive Masking | Feb 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Generalization Ability of Retrieval-Enhanced Transformers | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EVJVQA Challenge: Multilingual Visual Question Answering | Feb 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structure | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT | Feb 21, 2023 | Backdoor AttackLanguage Modeling | —Unverified | 0 |
| kNN-Adapter: Efficient Domain Adaptation for Black-Box Language Models | Feb 21, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 |
| Federated Learning for ASR based on Wav2vec 2.0 | Feb 20, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 1 |
| Can discrete information extraction prompts generalize across language models? | Feb 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Universal Fake Image Detectors that Generalize Across Generative Models | Feb 20, 2023 | ClassificationLanguage Modeling | CodeCode Available | 2 |
| Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE | Feb 18, 2023 | Contrastive LearningDenoising | —Unverified | 0 |
| BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark | Feb 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Prompting Large Language Models With the Socratic Method | Feb 17, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Privately Customizing Prefinetuning to Better Match User Data in Federated Learning | Feb 17, 2023 | Federated LearningLanguage Modeling | —Unverified | 0 |
| Massively Multilingual Shallow Fusion with Large Language Models | Feb 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multiperiodic Processes: Ergodic Sources with a Sublinear Entropy | Feb 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GPT4MIA: Utilizing Generative Pre-trained Transformer (GPT-3) as A Plug-and-Play Transductive Model for Medical Image Analysis | Feb 17, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories | Feb 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bridge the Gap between Language models and Tabular Understanding | Feb 16, 2023 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition | Feb 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax | Feb 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| What A Situated Language-Using Agent Must be Able to Do: A Top-Down Analysis | Feb 16, 2023 | Incremental LearningLanguage Modeling | —Unverified | 0 |
| Role of Bias Terms in Dot-Product Attention | Feb 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reanalyzing L2 Preposition Learning with Bayesian Mixed Effects and a Pretrained Language Model | Feb 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Learning to Initialize: Can Meta Learning Improve Cross-task Generalization in Prompt Tuning? | Feb 16, 2023 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Augmented Language Models: a Survey | Feb 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BLIAM: Literature-based Data Synthesis for Synergistic Drug Combination Prediction | Feb 14, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models | Feb 14, 2023 | ClusteringLanguage Modeling | —Unverified | 0 |
| AI Chat Assistants can Improve Conversations about Divisive Topics | Feb 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Modeling Complex Event Scenarios via Simple Entity-focused Questions | Feb 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains | Feb 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Model Analysis for Ontology Subsumption Inference | Feb 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Symbolic Discovery of Optimization Algorithms | Feb 13, 2023 | Contrastive Learningimage-classification | CodeCode Available | 0 |
| Simple Hardware-Efficient Long Convolutions for Sequence Modeling | Feb 13, 2023 | GPUimage-classification | CodeCode Available | 2 |
| Targeted Attack on GPT-Neo for the SATML Language Model Data Extraction Challenge | Feb 13, 2023 | Inference AttackLanguage Modeling | —Unverified | 0 |
| Diminished Diversity-of-Thought in a Standard Large Language Model | Feb 13, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| Guiding Pretraining in Reinforcement Learning with Large Language Models | Feb 13, 2023 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 1 |
| Towards Agile Text Classifiers for Everyone | Feb 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predicting Class Distribution Shift for Reliable Domain Adaptive Object Detection | Feb 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Semantic Importance-Aware Communications Using Pre-trained Language Models | Feb 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SemanticAC: Semantics-Assisted Framework for Audio Classification | Feb 12, 2023 | Audio ClassificationClassification | —Unverified | 0 |
| RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL | Feb 12, 2023 | DecoderLanguage Modeling | CodeCode Available | 2 |
| A Brief Report on LawGPT 1.0: A Virtual Legal Assistant Based on GPT-3 | Feb 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis | Feb 11, 2023 | Image-text RetrievalKnowledge Graphs | CodeCode Available | 0 |
| Adversarial Transformer Language Models for Contextual Commonsense Inference | Feb 10, 2023 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| The Wisdom of Hindsight Makes Language Models Better Instruction Followers | Feb 10, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |