| M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions | May 26, 2024 | Dialogue GenerationLanguage Modeling | —Unverified | 0 |
| AdaFisher: Adaptive Second Order Optimization via Fisher Information | May 26, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Chain of Tools: Large Language Model is an Automatic Multi-tool Learner | May 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| gzip Predicts Data-dependent Scaling Laws | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LoQT: Low-Rank Adapters for Quantized Pretraining | May 26, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Towards Multi-Task Multi-Modal Models: A Video Generative Perspective | May 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey of Multimodal Large Language Model from A Data-centric Perspective | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search | May 26, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models | May 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Theoretical Analysis of Weak-to-Strong Generalization | May 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect | May 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MoEUT: Mixture-of-Experts Universal Transformers | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A transfer learning framework for weak-to-strong generalization | May 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evolutionary Large Language Model for Automated Feature Transformation | May 25, 2024 | Efficient ExplorationEvolutionary Algorithms | CodeCode Available | 1 |
| Finetuning Large Language Model for Personalized Ranking | May 25, 2024 | Explainable RecommendationLanguage Modeling | CodeCode Available | 1 |
| Large Language Model Pruning | May 24, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Large Language Model (LLM) for Standard Cell Layout Design Optimization | May 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Sentinel: LLM Agent for Adversarial Purification | May 24, 2024 | Adversarial DefenseAdversarial Purification | —Unverified | 0 |
| ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users | May 24, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Enhancing Augmentative and Alternative Communication with Card Prediction and Colourful Semantics | May 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LM4LV: A Frozen Large Language Model for Low-level Vision Tasks | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RAEE: A Robust Retrieval-Augmented Early Exiting Framework for Efficient Inference | May 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems | May 24, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| DnA-Eval: Enhancing Large Language Model Evaluation through Decomposition and Aggregation | May 24, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs | May 24, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization | May 24, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Sparse Matrix in Large Language Model Fine-tuning | May 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 |
| Off-the-shelf ChatGPT is a Good Few-shot Human Motion Predictor | May 24, 2024 | Human motion predictionIn-Context Learning | —Unverified | 0 |
| Scaling Laws for Discriminative Classification in Large Language Models | May 24, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Sparse maximal update parameterization: A holistic approach to sparse training dynamics | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Emergence of a High-Dimensional Abstraction Phase in Language Transformers | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SEP: Self-Enhanced Prompt Tuning for Visual-Language Model | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs | May 24, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| GECKO: Generative Language Model for English, Code and Korean | May 24, 2024 | kmmluLanguage Modeling | —Unverified | 0 |
| Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning | May 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Aya 23: Open Weight Releases to Further Multilingual Progress | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct | May 23, 2024 | Class-level Code GenerationCode Completion | CodeCode Available | 4 |
| Extracting Prompts by Inverting LLM Outputs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Lessons from the Trenches on Reproducible Evaluation of Language Models | May 23, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| BiMix: A Bivariate Data Mixing Law for Language Model Pretraining | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Medical Question Answering with Knowledge-Augmented Question Generation | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Not All Language Model Features Are Linear | May 23, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation | May 23, 2024 | Audio GenerationDenoising | —Unverified | 0 |
| From Text to Pixel: Advancing Long-Context Understanding in MLLMs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large language models can be zero-shot anomaly detectors for time series? | May 23, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |