| Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking | Dec 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ViLA: Efficient Video-Language Alignment for Video Question Answering | Dec 13, 2023 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention | Dec 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On Diversified Preferences of Large Language Model Alignment | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling | Dec 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hallucination Augmented Contrastive Learning for Multimodal Large Language Model | Dec 12, 2023 | Contrastive LearningHallucination | CodeCode Available | 1 |
| Gated Linear Attention Transformers with Hardware-Efficient Training | Dec 11, 2023 | 2kLanguage Modeling | CodeCode Available | 1 |