| Vocabulary-free Image Classification | Jun 1, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| Graph-Level Embedding for Time-Evolving Graphs | Jun 1, 2023 | Anomaly DetectionGraph Representation Learning | —Unverified | 0 |
| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 |
| Interpretable Math Word Problem Solution Generation Via Step-by-step Planning | Jun 1, 2023 | GSM8KLanguage Modeling | —Unverified | 0 |
| AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration | Jun 1, 2023 | Autonomous DrivingCloud Computing | CodeCode Available | 6 |
| Exposing Attention Glitches with Flip-Flop Language Modeling | Jun 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CapText: Large Language Model-based Caption Generation From Image Context and Description | Jun 1, 2023 | Caption GenerationImage to text | —Unverified | 0 |
| How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics | Jun 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression | Jun 1, 2023 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Training-free Neural Architecture Search for RNNs and Transformers | Jun 1, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data | May 31, 2023 | Code SearchLanguage Modeling | CodeCode Available | 1 |
| An Invariant Learning Characterization of Controlled Text Generation | May 31, 2023 | AttributeLanguage Modeling | CodeCode Available | 0 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 |
| Adverbs, Surprisingly | May 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Human or Not? A Gamified Approach to the Turing Test | May 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Neuron to Graph: Interpreting Language Model Neurons at Scale | May 31, 2023 | GPULanguage Modeling | CodeCode Available | 0 |
| LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting | May 31, 2023 | DecoderImage Captioning | CodeCode Available | 0 |
| Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind | May 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Red Teaming Language Model Detectors with Language Models | May 31, 2023 | Adversarial RobustnessLanguage Modeling | CodeCode Available | 1 |
| Likelihood-Based Diffusion Language Models | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images | May 30, 2023 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction | May 30, 2023 | Image GenerationInstruction Following | CodeCode Available | 2 |
| GPT4GEO: How a Language Model Sees the World's Geography | May 30, 2023 | Disaster ResponseLanguage Modeling | —Unverified | 0 |
| Blockwise Parallel Transformer for Large Context Models | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| KEYword based Sampling (KEYS) for Large Language Models | May 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |