| Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture | Oct 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Zero Resource Code-switched Speech Benchmark Using Speech Utterance Pairs For Multiple Spoken Languages | Oct 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Heterogeneous Federated Learning Using Knowledge Codistillation | Oct 4, 2023 | Federated Learningimage-classification | —Unverified | 0 |
| From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference | Oct 4, 2023 | BenchmarkingGPU | —Unverified | 0 |
| HPC-GPT: Integrating Large Language Model for High-Performance Computing | Oct 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An evolutionary model of personality traits related to cooperative behavior using a large language model | Oct 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Linear Recurrent Units for Sequential Recommendation | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dodo: Dynamic Contextual Compression for Decoder-only LMs | Oct 3, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Large Language Models for Test-Free Fault Localization | Oct 3, 2023 | Fault localizationLanguage Modeling | CodeCode Available | 1 |
| Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation | Oct 3, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| SEA: Sparse Linear Attention with Estimated Attention Mask | Oct 3, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Ring Attention with Blockwise Transformers for Near-Infinite Context | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Can a student Large Language Model perform as well as it's teacher? | Oct 3, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration | Oct 3, 2023 | Arithmetic ReasoningCode Generation | CodeCode Available | 1 |
| TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus | Oct 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tuning Large language model for End-to-end Speech Translation | Oct 3, 2023 | de-enfr-en | —Unverified | 0 |
| Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving | Oct 3, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond | Oct 3, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Nugget: Neural Agglomerative Embeddings of Text | Oct 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What's the Magic Word? A Control Theory of LLM Prompting | Oct 2, 2023 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels | Oct 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations | Oct 2, 2023 | In-Context LearningInstruction Following | CodeCode Available | 1 |