| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results | Feb 25, 2021 | Face Anti-Spoofingvalid | CodeCode Available | 1 |
| Certified Deductive Reasoning with Language Models | Jun 6, 2023 | Logical Reasoningvalid | CodeCode Available | 1 |
| Characterizing information loss in a chaotic double pendulum with the Information Bottleneck | Oct 25, 2022 | valid | CodeCode Available | 1 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference | Feb 8, 2024 | Domain AdaptationUncertainty Quantification | CodeCode Available | 1 |
| CNN-based Approaches For Cross-Subject Classification in Motor Imagery: From The State-of-The-Art to DynamicNet | May 17, 2021 | Brain Computer InterfaceDeep Learning | CodeCode Available | 1 |
| CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality | May 9, 2022 | Machine TranslationSentence | CodeCode Available | 1 |
| Conditional Measurement Density Estimation in Sequential Monte Carlo via Normalizing Flow | Mar 16, 2022 | Density Estimationvalid | CodeCode Available | 1 |
| Benchmarking structure-based three-dimensional molecular generative models using GenBench3D: ligand conformation quality matters | Jul 5, 2024 | Benchmarkingvalid | CodeCode Available | 1 |