| Scaling Up Resonate-and-Fire Networks for Fast Deep Learning | Apr 1, 2025 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| Benchmarking Federated Machine Unlearning methods for Tabular Data | Apr 1, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench | Apr 1, 2025 | Benchmarking | CodeCode Available | 0 |
| Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models | Apr 1, 2025 | BenchmarkingConversational Question Answering | —Unverified | 0 |
| LOCO-EPI: Leave-one-chromosome-out (LOCO) as a benchmarking paradigm for deep learning based prediction of enhancer-promoter interactions | Apr 1, 2025 | Benchmarking | CodeCode Available | 0 |
| On Benchmarking Code LLMs for Android Malware Analysis | Apr 1, 2025 | BenchmarkingMalware Analysis | —Unverified | 0 |
| SciReplicate-Bench: Benchmarking LLMs in Agent-driven Algorithmic Reproduction from Research Papers | Mar 31, 2025 | Benchmarking | CodeCode Available | 1 |
| Uni-Render: A Unified Accelerator for Real-Time Rendering Across Diverse Neural Renderers | Mar 31, 2025 | BenchmarkingNeural Rendering | —Unverified | 0 |
| Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios | Mar 31, 2025 | Adversarial AttackAutonomous Driving | —Unverified | 0 |
| Simple Feedfoward Neural Networks are Almost All You Need for Time Series Forecasting | Mar 30, 2025 | AllBenchmarking | —Unverified | 0 |