| REST: Retrieval-Based Speculative Decoding | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tamil-Llama: A New Tamil Language Model Based on Llama 2 | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Nov 9, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| Large Trajectory Models are Scalable Motion Predictors and Planners | Oct 30, 2023 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution | Oct 25, 2023 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain | Oct 22, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 2 |
| Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Oct 18, 2023 | 4kimage-classification | CodeCode Available | 2 |
| BitNet: Scaling 1-bit Transformers for Large Language Models | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |