| Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture | Oct 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Heterogeneous Federated Learning Using Knowledge Codistillation | Oct 4, 2023 | Federated Learningimage-classification | —Unverified | 0 |
| From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference | Oct 4, 2023 | BenchmarkingGPU | —Unverified | 0 |
| Zero Resource Code-switched Speech Benchmark Using Speech Utterance Pairs For Multiple Spoken Languages | Oct 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HPC-GPT: Integrating Large Language Model for High-Performance Computing | Oct 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An evolutionary model of personality traits related to cooperative behavior using a large language model | Oct 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Linear Recurrent Units for Sequential Recommendation | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SEA: Sparse Linear Attention with Estimated Attention Mask | Oct 3, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Large Language Models for Test-Free Fault Localization | Oct 3, 2023 | Fault localizationLanguage Modeling | CodeCode Available | 1 |
| Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation | Oct 3, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Dodo: Dynamic Contextual Compression for Decoder-only LMs | Oct 3, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Ring Attention with Blockwise Transformers for Near-Infinite Context | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Can a student Large Language Model perform as well as it's teacher? | Oct 3, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |
| TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus | Oct 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond | Oct 3, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Tuning Large language model for End-to-end Speech Translation | Oct 3, 2023 | de-enfr-en | —Unverified | 0 |
| Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving | Oct 3, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Nugget: Neural Agglomerative Embeddings of Text | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What's the Magic Word? A Control Theory of LLM Prompting | Oct 2, 2023 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations | Oct 2, 2023 | In-Context LearningInstruction Following | CodeCode Available | 1 |
| DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model | Oct 2, 2023 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Language Model Decoding as Direct Metrics Optimization | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| L2MAC: Large Language Model Automatic Computer for Extensive Code Generation | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Improving Emotional Expression and Cohesion in Image-Based Playlist Description and Music Topics: A Continuous Parameterization Approach | Oct 2, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CAT-LM: Training Language Models on Aligned Code And Tests | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Syllable-level lyrics generation from melody exploiting character-level language model | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives | Oct 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning | Oct 2, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| A Framework for Inference Inspired by Human Memory Mechanisms | Oct 1, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Parameter-Efficient Tuning Helps Language Model Alignment | Oct 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Meta Semantic Template for Evaluation of Large Language Models | Oct 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive-Solver Framework for Dynamic Strategy Selection in Large Language Model Reasoning | Oct 1, 2023 | Computational EfficiencyLanguage Modeling | CodeCode Available | 0 |
| Comics for Everyone: Generating Accessible Text Descriptions for Comic Strips | Oct 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Source Attribution for Large Language Model-Generated Data | Oct 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ | Sep 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SLM: Bridge the thin gap between speech and text foundation models | Sep 30, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Finding Pragmatic Differences Between Disciplines | Sep 30, 2023 | DiversityDocument Summarization | —Unverified | 0 |
| UPAR: A Kantian-Inspired Prompting Framework for Enhancing Large Language Model Capabilities | Sep 30, 2023 | Causal JudgmentGSM8K | —Unverified | 0 |
| Dynamic Demonstrations Controller for In-Context Learning | Sep 30, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning | Sep 30, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| LoRA ensembles for large language model fine-tuning | Sep 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| LLM-grounded Video Diffusion Models | Sep 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |