| CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents | Jun 17, 2024 | Code GenerationCode Search | CodeCode Available | 0 | 5 |
| Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy | May 24, 2023 | In-Context LearningMultiple-choice | CodeCode Available | 0 | 5 |
| EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations | Feb 20, 2025 | Combinatorial Optimizationvalid | CodeCode Available | 0 | 5 |
| A PAC-Bayes Analysis of Adversarial Robustness | Feb 19, 2021 | Adversarial RobustnessGeneralization Bounds | CodeCode Available | 0 | 5 |
| Model Generalization: A Sharpness Aware Optimization Perspective | Aug 14, 2022 | modelvalid | CodeCode Available | 0 | 5 |
| Enhancing reliability in prediction intervals using point forecasters: Heteroscedastic Quantile Regression and Width-Adaptive Conformal Inference | Jun 21, 2024 | PredictionPrediction Intervals | CodeCode Available | 0 | 5 |
| Endogenous Macrodynamics in Algorithmic Recourse | Aug 16, 2023 | counterfactualvalid | CodeCode Available | 0 | 5 |
| Instrumental Variable Estimation for Compositional Treatments | Jun 21, 2021 | Diversityvalid | CodeCode Available | 0 | 5 |
| Employing self-supervised learning models for cross-linguistic child speech maturity classification | Jun 10, 2025 | Self-Supervised Learningvalid | CodeCode Available | 0 | 5 |
| EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction | Jul 1, 2022 | Document-level Relation ExtractionJoint Entity and Relation Extraction | CodeCode Available | 0 | 5 |