| Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts | Aug 28, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis | Aug 27, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Advancing Enterprise Spatio-Temporal Forecasting Applications: Data Mining Meets Instruction Tuning of Language Models For Multi-modal Time Series Analysis in Low-Resource Settings | Aug 24, 2024 | Decision MakingMixture-of-Experts | —Unverified | 0 |
| The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Aug 23, 2024 | Computational EfficiencyInference Optimization | —Unverified | 0 |
| La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection | Aug 23, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Multi-Treatment Multi-Task Uplift Modeling for Enhancing User Growth | Aug 23, 2024 | Causal InferenceMixture-of-Experts | —Unverified | 0 |
| DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation | Aug 23, 2024 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging | Aug 22, 2024 | DiversityMixture-of-Experts | —Unverified | 0 |
| Jamba-1.5: Hybrid Transformer-Mamba Models at Scale | Aug 22, 2024 | ChatbotInstruction Following | CodeCode Available | 5 |
| Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators | Aug 22, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 0 |