| Large Language Model Agent for Hyper-Parameter Optimization | Feb 2, 2024 | AutoMLHyperparameter Optimization | —Unverified | 0 |
| What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement | Feb 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Dynamical Model of Neural Scaling Laws | Feb 2, 2024 | model | —Unverified | 0 |
| KTO: Model Alignment as Prospect Theoretic Optimization | Feb 2, 2024 | Attributemodel | CodeCode Available | 4 |
| A Probabilistic Model Behind Self-Supervised Learning | Feb 2, 2024 | modelRepresentation Learning | CodeCode Available | 0 |
| Need a Small Specialized Language Model? Plan Early! | Feb 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Information of Large Language Model Geometry | Feb 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Efficient Exact Optimization of Language Model Alignment | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CroissantLLM: A Truly Bilingual French-English Language Model | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EuroPED-NN: Uncertainty aware surrogate model | Feb 1, 2024 | model | —Unverified | 0 |
| Masked Conditional Diffusion Model for Enhancing Deepfake Detection | Feb 1, 2024 | Data AugmentationDeepFake Detection | —Unverified | 0 |
| Diffusion Model Compression for Image-to-Image Translation | Jan 31, 2024 | Conditional Image GenerationDenoising | —Unverified | 0 |
| Improving QA Model Performance with Cartographic Inoculation | Jan 30, 2024 | model | —Unverified | 0 |
| CaMU: Disentangling Causal Effects in Deep Model Unlearning | Jan 30, 2024 | Machine Unlearningmodel | CodeCode Available | 0 |
| Dynamical System Identification, Model Selection and Model Uncertainty Quantification by Bayesian Inference | Jan 30, 2024 | Bayesian Inferencemodel | —Unverified | 0 |
| Gradient-Based Language Model Red Teaming | Jan 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Engineering A Large Language Model From Scratch | Jan 30, 2024 | Deep LearningLanguage Modeling | —Unverified | 0 |
| Diffusion model for relational inference | Jan 30, 2024 | Imputationmodel | CodeCode Available | 0 |
| CFTM: Continuous time fractional topic model | Jan 29, 2024 | ArticlesDynamic Topic Modeling | —Unverified | 0 |
| New Foggy Object Detecting Model | Jan 27, 2024 | modelObject | —Unverified | 0 |
| MaLLaM -- Malaysia Large Language Model | Jan 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChemDFM: A Large Language Foundation Model for Chemistry | Jan 26, 2024 | Formmodel | CodeCode Available | 2 |
| Hierarchical Continual Reinforcement Learning via Large Language Model | Jan 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache | Jan 25, 2024 | GPUmodel | CodeCode Available | 3 |
| Accelerating Retrieval-Augmented Language Model Serving with Speculation | Jan 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |