| Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model | Oct 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Computational Bottlenecks of Training Small-scale Large Language Models | Oct 25, 2024 | GPULanguage Modeling | —Unverified | 0 |
| COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training | Oct 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FedBaF: Federated Learning Aggregation Biased by a Foundation Model | Oct 24, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Oct 24, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| GCoder: Improving Large Language Model for Generalized Graph Problem Solving | Oct 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Scaling up Masked Diffusion Models on Text | Oct 24, 2024 | GSM8KLanguage Modeling | CodeCode Available | 3 |
| Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Oct 24, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-shot Object Navigation with Vision-Language Models Reasoning | Oct 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LOGO -- Long cOntext aliGnment via efficient preference Optimization | Oct 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms | Oct 24, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Structure Language Models for Protein Conformation Generation | Oct 24, 2024 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Taipan: Efficient and Expressive State Space Language Models with Selective Attention | Oct 24, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Provably Robust Watermarks for Open-Source Language Models | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generalizations across filler-gap dependencies in neural language models | Oct 23, 2024 | Language AcquisitionLanguage Modeling | CodeCode Available | 0 |
| CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation | Oct 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| LEGO: Language Model Building Blocks | Oct 23, 2024 | Federated LearningLanguage Modeling | —Unverified | 0 |
| MojoBench: Language Modeling and Benchmarks for Mojo | Oct 23, 2024 | Code GenerationHumanEval | —Unverified | 0 |
| Scaling Diffusion Language Models via Adaptation from Autoregressive Models | Oct 23, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 |
| Cross-model Control: Improving Multiple Large Language Models in One-time Training | Oct 23, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Oct 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Lightweight Neural App Control | Oct 23, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| LMLPA: Language Model Linguistic Personality Assessment | Oct 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |