| Coherent Feed Forward Quantum Neural Network | Feb 1, 2024 | BenchmarkingDiagnostic | —Unverified | 0 |
| MRAnnotator: multi-Anatomy and many-Sequence MRI segmentation of 44 structures | Feb 1, 2024 | AnatomyBenchmarking | —Unverified | 0 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| Benchmarking Sensitivity of Continual Graph Learning for Skeleton-Based Action Recognition | Jan 31, 2024 | Action RecognitionBenchmarking | —Unverified | 0 |
| ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks | Jan 29, 2024 | BenchmarkingCross-Lingual Transfer | CodeCode Available | 0 |
| Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA | Jan 29, 2024 | BenchmarkingImage Comprehension | —Unverified | 0 |
| PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models | Jan 28, 2024 | BenchmarkingCode Generation | CodeCode Available | 0 |
| Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset | Jan 27, 2024 | BenchmarkingTime Series | —Unverified | 0 |
| SAM-based instance segmentation models for the automation of structural damage detection | Jan 27, 2024 | BenchmarkingInstance Segmentation | —Unverified | 0 |
| Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis | Jan 26, 2024 | BenchmarkingSemantic Segmentation | —Unverified | 0 |
| Benchmarking Large Language Models in Complex Question Answering Attribution using Knowledge Graphs | Jan 26, 2024 | BenchmarkingKnowledge Graphs | —Unverified | 0 |
| Automated legal reasoning with discretion to act using s(LAW) | Jan 25, 2024 | BenchmarkingLegal Reasoning | —Unverified | 0 |
| TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images | Jan 25, 2024 | BenchmarkingSegmentation | —Unverified | 0 |
| Large Malaysian Language Model Based on Mistral for Enhanced Local Language Understanding | Jan 24, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Benchmarking the Fairness of Image Upsampling Methods | Jan 24, 2024 | BenchmarkingDiversity | CodeCode Available | 0 |
| LLpowershap: Logistic Loss-based Automated Shapley Values Feature Selection Method | Jan 23, 2024 | BenchmarkingFairness | CodeCode Available | 0 |
| Deep Neural Network Benchmarks for Selective Classification | Jan 23, 2024 | BenchmarkingClassification | CodeCode Available | 0 |
| What the Weight?! A Unified Framework for Zero-Shot Knowledge Composition | Jan 23, 2024 | Benchmarking | CodeCode Available | 0 |
| Subgroup analysis methods for time-to-event outcomes in heterogeneous randomized controlled trials | Jan 22, 2024 | BenchmarkingSynthetic Data Generation | CodeCode Available | 0 |
| Data-Driven Target Localization: Benchmarking Gradient Descent Using the Cramer-Rao Bound | Jan 20, 2024 | Benchmarking | —Unverified | 0 |
| Data Augmentation for Traffic Classification | Jan 19, 2024 | BenchmarkingClassification | —Unverified | 0 |
| Harnessing Orthogonality to Train Low-Rank Neural Networks | Jan 16, 2024 | Benchmarking | CodeCode Available | 0 |
| NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription | Jan 16, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| OpenDPD: An Open-Source End-to-End Learning & Benchmarking Framework for Wideband Power Amplifier Modeling and Digital Pre-Distortion | Jan 16, 2024 | Benchmarking | —Unverified | 0 |
| Large Language Models are Null-Shot Learners | Jan 16, 2024 | Arithmetic ReasoningBenchmarking | —Unverified | 0 |