| Towards quantitative precision for ECG analysis: Leveraging state space models, self-supervision and patient metadata | Aug 29, 2023 | BenchmarkingDiagnostic | CodeCode Available | 1 |
| MLLM-DataEngine: An Iterative Refinement Approach for MLLM | Aug 25, 2023 | Benchmarking | CodeCode Available | 1 |
| LLMRec: Benchmarking Large Language Models on Recommendation Task | Aug 23, 2023 | BenchmarkingExplanation Generation | CodeCode Available | 1 |
| VI-Net: Boosting Category-level 6D Object Pose Estimation via Learning Decoupled Rotations on the Spherical Representations | Aug 19, 2023 | 6D Pose Estimation using RGBBenchmarking | CodeCode Available | 1 |
| Benchmarking Neural Network Generalization for Grammar Induction | Aug 16, 2023 | Benchmarking | CodeCode Available | 1 |
| Benchmarking Generated Poses: How Rational is Structure-based Drug Design with Generative Models? | Aug 14, 2023 | BenchmarkingDrug Design | CodeCode Available | 1 |
| DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity | Aug 11, 2023 | BenchmarkingDiversity | CodeCode Available | 1 |
| A Comparative Visual Analytics Framework for Evaluating Evolutionary Processes in Multi-objective Optimization | Aug 10, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking | Aug 9, 2023 | BenchmarkingFew-Shot Learning | CodeCode Available | 1 |
| Application-Oriented Benchmarking of Quantum Generative Learning Using QUARK | Aug 8, 2023 | BenchmarkingGPU | CodeCode Available | 1 |
| XFlow: Benchmarking Flow Behaviors over Graphs | Aug 7, 2023 | Benchmarking | CodeCode Available | 1 |
| qgym: A Gym for Training and Benchmarking RL-Based Quantum Compilation | Aug 1, 2023 | BenchmarkingOpenAI Gym | CodeCode Available | 1 |
| Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples | Jul 31, 2023 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| VG-SSL: Benchmarking Self-supervised Representation Learning Approaches for Visual Geo-localization | Jul 31, 2023 | Autonomous NavigationAutonomous Vehicles | CodeCode Available | 1 |
| Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment | Jul 30, 2023 | BenchmarkingEntity Alignment | CodeCode Available | 1 |
| Benchmarking Offline Reinforcement Learning on Real-Robot Hardware | Jul 28, 2023 | Benchmarkingreinforcement-learning | CodeCode Available | 1 |
| PLANTAIN: Diffusion-inspired Pose Score Minimization for Fast and Accurate Molecular Docking | Jul 22, 2023 | BenchmarkingMolecular Docking | CodeCode Available | 1 |
| JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning | Jul 21, 2023 | BenchmarkingCombinatorial Optimization | CodeCode Available | 1 |
| SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models | Jul 20, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory | Jul 20, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Examining the Effects of Degree Distribution and Homophily in Graph Learning Models | Jul 17, 2023 | BenchmarkingGraph Clustering | CodeCode Available | 1 |
| Efficient Prediction of Peptide Self-assembly through Sequential and Graphical Encoding | Jul 17, 2023 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and Toolbox | Jul 17, 2023 | Benchmarking | CodeCode Available | 1 |
| GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection | Jul 16, 2023 | Benchmarking | CodeCode Available | 1 |
| IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation | Jul 13, 2023 | BenchmarkingGraph Embedding | CodeCode Available | 1 |