| DnA-Eval: Enhancing Large Language Model Evaluation through Decomposition and Aggregation | May 24, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs | May 24, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization | May 24, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Sparse Matrix in Large Language Model Fine-tuning | May 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 |
| Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor | May 24, 2024 | Human motion predictionIn-Context Learning | —Unverified | 0 |
| Scaling Laws for Discriminative Classification in Large Language Models | May 24, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Sparse maximal update parameterization: A holistic approach to sparse training dynamics | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Emergence of a High-Dimensional Abstraction Phase in Language Transformers | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SEP: Self-Enhanced Prompt Tuning for Visual-Language Model | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs | May 24, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| GECKO: Generative Language Model for English, Code and Korean | May 24, 2024 | kmmluLanguage Modeling | —Unverified | 0 |
| Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning | May 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Aya 23: Open Weight Releases to Further Multilingual Progress | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct | May 23, 2024 | Class-level Code GenerationCode Completion | CodeCode Available | 4 |
| Extracting Prompts by Inverting LLM Outputs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Lessons from the Trenches on Reproducible Evaluation of Language Models | May 23, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| BiMix: A Bivariate Data Mixing Law for Language Model Pretraining | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Medical Question Answering with Knowledge-Augmented Question Generation | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Not All Language Model Features Are Linear | May 23, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation | May 23, 2024 | Audio GenerationDenoising | —Unverified | 0 |
| From Text to Pixel: Advancing Long-Context Understanding in MLLMs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large language models can be zero-shot anomaly detectors for time series? | May 23, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |