| Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving | Oct 2, 2024 | BenchmarkingDocument Summarization | —Unverified | 0 |
| When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Spoken Grammar Assessment Using LLM | Oct 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| End-to-End Speech Recognition with Pre-trained Masked Language Model | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |