| Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches | Mar 21, 2023 | BenchmarkingThompson Sampling | —Unverified | 0 |
| DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4 | Mar 20, 2023 | BenchmarkingDe-identification | CodeCode Available | 1 |
| A Multi-Task Deep Learning Approach for Sensor-based Human Activity Recognition and Segmentation | Mar 20, 2023 | Activity RecognitionBenchmarking | —Unverified | 0 |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving | Mar 20, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering Regularized Self-Training | Mar 20, 2023 | BenchmarkingClustering | CodeCode Available | 1 |
| COVID-19 event extraction from Twitter via extractive question answering with continuous prompts | Mar 19, 2023 | BenchmarkingEvent Extraction | CodeCode Available | 1 |
| CCTV-Gun: Benchmarking Handgun Detection in CCTV Images | Mar 19, 2023 | Benchmarkingobject-detection | CodeCode Available | 1 |
| NoisyHate: Mining Online Human-Written Perturbations for Realistic Robustness Benchmarking of Content Moderation Models | Mar 18, 2023 | Adversarial AttackBenchmarking | —Unverified | 0 |
| DeAR: Debiasing Vision-Language Models with Additive Residuals | Mar 18, 2023 | AttributeBenchmarking | —Unverified | 0 |
| Highly Accurate Quantum Chemical Property Prediction with Uni-Mol+ | Mar 16, 2023 | BenchmarkingGraph Regression | CodeCode Available | 3 |