| Confronting Reward Model Overoptimization with Constrained RLHF | Oct 6, 2023 | model | CodeCode Available | 1 |
| Continuous-Time Model-Based Reinforcement Learning | Feb 9, 2021 | modelModel-based Reinforcement Learning | CodeCode Available | 1 |
| A Recurrent Latent Variable Model for Sequential Data | Jun 7, 2015 | model | CodeCode Available | 1 |
| Collaborative Large Language Model for Recommender Systems | Nov 2, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| COMBO: Conservative Offline Model-Based Policy Optimization | Feb 16, 2021 | modelOffline RL | CodeCode Available | 1 |
| CNN Model & Tuning for Global Road Damage Detection | Mar 17, 2021 | AvgGPU | CodeCode Available | 1 |
| Large Language Model Unlearning | Oct 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large-Vocabulary 3D Diffusion Model with Transformer | Sep 14, 2023 | 3D GenerationDiversity | CodeCode Available | 1 |
| clusterBMA: Bayesian model averaging for clustering | Sep 9, 2022 | ClusteringElectroencephalogram (EEG) | CodeCode Available | 1 |
| CLIP model is an Efficient Continual Learner | Oct 6, 2022 | Continual LearningIncremental Learning | CodeCode Available | 1 |