| Towards Effective Disambiguation for Machine Translation with Large Language Models | Sep 20, 2023 | BenchmarkingIn-Context Learning | —Unverified | 0 |
| An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder | Sep 20, 2023 | BenchmarkingClustering | CodeCode Available | 0 |
| SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction | Sep 19, 2023 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| Training neural mapping schemes for satellite altimetry with simulation data | Sep 19, 2023 | Benchmarking | —Unverified | 0 |
| The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design | Sep 18, 2023 | Benchmarking | —Unverified | 0 |
| Exploration of TPUs for AI Applications | Sep 16, 2023 | BenchmarkingEdge-computing | —Unverified | 0 |
| Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool | Sep 16, 2023 | BenchmarkingImage Super-Resolution | —Unverified | 0 |
| Anchor Points: Benchmarking Models with Much Fewer Examples | Sep 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations | Sep 14, 2023 | BenchmarkingComputed Tomography (CT) | CodeCode Available | 0 |
| Benchmarking machine learning models for quantum state classification | Sep 14, 2023 | BenchmarkingClassification | —Unverified | 0 |
| Leveraging Contextual Information for Effective Entity Salience Detection | Sep 14, 2023 | ArticlesBenchmarking | —Unverified | 0 |
| So you think you can track? | Sep 13, 2023 | BenchmarkingObject | —Unverified | 0 |
| Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish | Sep 13, 2023 | BenchmarkingTranslation | CodeCode Available | 0 |
| Unveiling the potential of large language models in generating semantic and cross-language clones | Sep 12, 2023 | BenchmarkingCode Generation | —Unverified | 0 |
| AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving | Sep 12, 2023 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learning | Sep 8, 2023 | BenchmarkingContinual Learning | CodeCode Available | 0 |
| DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation | Sep 7, 2023 | BenchmarkingNeural Architecture Search | —Unverified | 0 |
| Better Practices for Domain Adaptation | Sep 7, 2023 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| Using representation balancing to learn conditional-average dose responses from clustered data | Sep 7, 2023 | BenchmarkingCausal Inference | CodeCode Available | 0 |
| Are SNNs Truly Energy-efficient? - A Hardware Perspective | Sep 6, 2023 | Benchmarking | —Unverified | 0 |
| Neural Networks for Fast Optimisation in Model Predictive Control: A Review | Sep 6, 2023 | BenchmarkingModel Predictive Control | —Unverified | 0 |
| AGIBench: A Multi-granularity, Multimodal, Human-referenced, Auto-scoring Benchmark for Large Language Models | Sep 5, 2023 | BenchmarkingZero-Shot Learning | —Unverified | 0 |
| A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking | Sep 5, 2023 | BenchmarkingKnowledge Distillation | —Unverified | 0 |
| Hybrid data driven/thermal simulation model for comfort assessment | Sep 4, 2023 | Benchmarking | —Unverified | 0 |
| Transfer Learning between Motor Imagery Datasets using Deep Learning -- Validation of Framework and Comparison of Datasets | Sep 4, 2023 | BenchmarkingMotor Imagery | CodeCode Available | 0 |
| FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees | Sep 3, 2023 | BenchmarkingInstance Segmentation | —Unverified | 0 |
| Holistic Dynamic Frequency Transformer for Image Fusion and Exposure Correction | Sep 3, 2023 | BenchmarkingExposure Correction | —Unverified | 0 |
| FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning | Sep 1, 2023 | BenchmarkingFederated Learning | —Unverified | 0 |
| NeMig -- A Bilingual News Collection and Knowledge Graph about Migration | Sep 1, 2023 | ArticlesBenchmarking | CodeCode Available | 0 |
| Can humans help BERT gain "confidence"? | Aug 31, 2023 | BenchmarkingEEG | —Unverified | 0 |
| Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO | Aug 30, 2023 | BenchmarkingReinforcement Learning (RL) | —Unverified | 0 |
| Benchmarking Multilabel Topic Classification in the Kyrgyz Language | Aug 30, 2023 | BenchmarkingClassification | CodeCode Available | 0 |
| Speech Self-Supervised Representations Benchmarking: a Case for Larger Probing Heads | Aug 28, 2023 | BenchmarkingSelf-Supervised Learning | —Unverified | 0 |
| Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models | Aug 24, 2023 | Action LocalizationBenchmarking | —Unverified | 0 |
| Beyond Document Page Classification: Design, Datasets, and Challenges | Aug 24, 2023 | BenchmarkingClassification | CodeCode Available | 0 |
| Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0 | Aug 23, 2023 | Benchmarkingregression | CodeCode Available | 0 |
| Benchmarking Causal Study to Interpret Large Language Models for Source Code | Aug 23, 2023 | BenchmarkingCausal Inference | —Unverified | 0 |
| Efficient Benchmarking of Language Models | Aug 22, 2023 | BenchmarkingGPU | —Unverified | 0 |
| Benchmarking Domain Adaptation for Chemical Processes on the Tennessee Eastman Process | Aug 22, 2023 | BenchmarkingDomain Adaptation | CodeCode Available | 0 |
| Beyond MD17: the reactive xxMD dataset | Aug 22, 2023 | BenchmarkingComputational chemistry | CodeCode Available | 0 |
| Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection | Aug 22, 2023 | BenchmarkingOut-of-Distribution Detection | CodeCode Available | 0 |
| UGSL: A Unified Framework for Benchmarking Graph Structure Learning | Aug 21, 2023 | BenchmarkingGraph structure learning | —Unverified | 0 |
| Measuring the Effect of Causal Disentanglement on the Adversarial Robustness of Neural Network Models | Aug 21, 2023 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| Neurological Prognostication of Post-Cardiac-Arrest Coma Patients Using EEG Data: A Dynamic Survival Analysis Framework with Competing Risks | Aug 17, 2023 | BenchmarkingEEG | CodeCode Available | 0 |
| Benchmarking Adversarial Robustness of Compressed Deep Learning Models | Aug 16, 2023 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| A Survey on Model Compression for Large Language Models | Aug 15, 2023 | BenchmarkingKnowledge Distillation | —Unverified | 0 |
| IoT Data Trust Evaluation via Machine Learning | Aug 15, 2023 | BenchmarkingTime Series | CodeCode Available | 0 |
| Benchmarking Scalable Epistemic Uncertainty Quantification in Organ Segmentation | Aug 15, 2023 | BenchmarkingMedical Image Analysis | CodeCode Available | 0 |
| Deep Neural Operator Driven Real Time Inference for Nuclear Systems to Enable Digital Twin Solutions | Aug 15, 2023 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields | Aug 11, 2023 | Benchmarking | —Unverified | 0 |