SOTAVerified

Benchmarking

Papers

Showing 38013850 of 5548 papers

TitleStatusHype
Real-time Webcam Heart-Rate and Variability Estimation with Clean Ground Truth for Evaluation0
One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering0
Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms0
Real-World fNIRS-Based Brain-Computer Interfaces: Benchmarking Deep Learning and Classical Models in Interactive Gaming0
Rearrangement: A Challenge for Embodied AI0
Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models0
Re-assessing ImageNet: How aligned is its single-label assumption with its multi-label nature?0
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?0
Recommendations for Baselines and Benchmarking Approximate Gaussian Processes0
Reconstructing antibody repertoires from error-prone immunosequencing datasets0
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research0
Refer to Anything with Vision-Language Prompts0
Regularization of ML models for Earth systems by using longer model timesteps0
Reinforcement Learning Based Handwritten Digit Recognition with Two-State Q-Learning0
Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse0
Reinforcing Competitive Multi-Agents for Playing So Long Sucker0
Relative Afferent Pupillary Defect Screening through Transfer Learning0
Reliable validation of Reinforcement Learning Benchmarks0
REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models0
Removal of Ocular Artifacts in EEG Using Deep Learning0
Removing Multiple Hybrid Adverse Weather in Video via a Unified Model0
Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training0
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning0
Reproducible evaluation of classification methods in Alzheimer's disease: framework and application to MRI and PET data0
Repurposing Foundation Model for Generalizable Medical Time Series Classification0
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model0
ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness0
ResearchArena: Benchmarking LLMs' Ability to Collect and Organize Information as Research Agents0
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition0
ResearchCodeAgent: An LLM Multi-Agent System for Automated Codification of Research Methodologies0
ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code0
Reservoir Computing with a Single Oscillating Gas Bubble: Emphasizing the Chaotic Regime0
Resistive Neural Hardware Accelerators0
Resource-efficient Medical Image Analysis with Self-adapting Forward-Forward Networks0
RESPONSE: Benchmarking the Ability of Language Models to Undertake Commonsense Reasoning in Crisis Situation0
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go0
Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks0
Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking0
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion0
Review and experimental benchmarking of machine learning algorithms for efficient optimization of cold atom experiments0
Reviewing and Benchmarking Parameter Control Methods in Differential Evolution0
Categorization and analysis of 14 computational methods for estimating cell potency from single-cell RNA-seq data0
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks0
Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking0
Revisiting Safe Exploration in Safe Reinforcement learning0
On the Evaluation and Refinement of Vision-Language Instruction Tuning Datasets0
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning0
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth0
Riemannian Geometry for the classification of brain states with intracortical brain-computer interfaces0
Riemannian Self-Attention Mechanism for SPD Networks0
Show:102550
← PrevPage 77 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified