| OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking | Jul 19, 2024 | BenchmarkingMulti-Object Tracking | —Unverified | 0 |
| Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircraft | Jul 19, 2024 | BenchmarkingTransfer Learning | —Unverified | 0 |
| SHS: Scorpion Hunting Strategy Swarm Algorithm | Jul 19, 2024 | Benchmarking | —Unverified | 0 |
| Realistic Evaluation of Test-Time Adaptation Algorithms: Unsupervised Hyperparameter Selection | Jul 19, 2024 | BenchmarkingModel Selection | —Unverified | 0 |
| Benchmarking deep learning models for bearing fault diagnosis using the CWRU dataset: A multi-label approach | Jul 19, 2024 | BenchmarkingBinary Classification | —Unverified | 0 |
| Enhancing Biomedical Knowledge Discovery for Diseases: An Open-Source Framework Applied on Rett Syndrome and Alzheimer's Disease | Jul 18, 2024 | Benchmarking | CodeCode Available | 0 |
| Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle | Jul 18, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark | Jul 18, 2024 | 3D Human Pose EstimationBenchmarking | —Unverified | 0 |
| Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance | Jul 18, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks | Jul 17, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| FETCH: A Memory-Efficient Replay Approach for Continual Learning in Image Classification | Jul 17, 2024 | BenchmarkingContinual Learning | —Unverified | 0 |
| Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models? | Jul 17, 2024 | BenchmarkingSarcasm Detection | —Unverified | 0 |
| LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models | Jul 17, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 |
| Abstraction Alignment: Comparing Model-Learned and Human-Encoded Conceptual Relationships | Jul 17, 2024 | Benchmarking | CodeCode Available | 0 |
| HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Jul 17, 2024 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| Comprehensive Review and Empirical Evaluation of Causal Discovery Algorithms for Numerical Data | Jul 17, 2024 | ArticlesBenchmarking | —Unverified | 0 |
| Temporal receptive field in dynamic graph learning: A comprehensive analysis | Jul 17, 2024 | BenchmarkingDynamic Link Prediction | CodeCode Available | 0 |
| A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification | Jul 16, 2024 | BenchmarkingFew-Shot Learning | —Unverified | 0 |
| Feature interpretability in BCIs: exploring the role of network lateralization | Jul 16, 2024 | BenchmarkingEEG | CodeCode Available | 0 |
| Benchmarking the Attribution Quality of Vision Models | Jul 16, 2024 | BenchmarkingExplainable Models | CodeCode Available | 0 |
| REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching | Jul 16, 2024 | Benchmarking | CodeCode Available | 0 |
| AstroMLab 1: Who Wins Astronomy Jeopardy!? | Jul 15, 2024 | AstronomyBenchmarking | —Unverified | 0 |
| On Machine Learning Approaches for Protein-Ligand Binding Affinity Prediction | Jul 15, 2024 | Active LearningBenchmarking | —Unverified | 0 |
| ConvBench: A Comprehensive Benchmark for 2D Convolution Primitive Evaluation | Jul 15, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarking Vision Language Models for Cultural Understanding | Jul 15, 2024 | BenchmarkingQuestion Answering | —Unverified | 0 |