SOTAVerified

Benchmarking

Papers

Showing 38263850 of 5548 papers

TitleStatusHype
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model0
ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness0
ResearchArena: Benchmarking LLMs' Ability to Collect and Organize Information as Research Agents0
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition0
ResearchCodeAgent: An LLM Multi-Agent System for Automated Codification of Research Methodologies0
ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code0
Reservoir Computing with a Single Oscillating Gas Bubble: Emphasizing the Chaotic Regime0
Resistive Neural Hardware Accelerators0
Resource-efficient Medical Image Analysis with Self-adapting Forward-Forward Networks0
RESPONSE: Benchmarking the Ability of Language Models to Undertake Commonsense Reasoning in Crisis Situation0
Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go0
Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks0
Retrieval-Augmented Generation for Service Discovery: Chunking Strategies and Benchmarking0
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion0
Review and experimental benchmarking of machine learning algorithms for efficient optimization of cold atom experiments0
Reviewing and Benchmarking Parameter Control Methods in Differential Evolution0
Categorization and analysis of 14 computational methods for estimating cell potency from single-cell RNA-seq data0
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks0
Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking0
Revisiting Safe Exploration in Safe Reinforcement learning0
On the Evaluation and Refinement of Vision-Language Instruction Tuning Datasets0
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning0
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth0
Riemannian Geometry for the classification of brain states with intracortical brain-computer interfaces0
Riemannian Self-Attention Mechanism for SPD Networks0
Show:102550
← PrevPage 154 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified