Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3351–3375 of 5548 papers

Title	Date	Tasks	Status	Hype
UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark Suite	Apr 18, 2023	BenchmarkingInstance Segmentation	—Unverified	0
OOD-CV-v2: An extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images	Apr 17, 2023	3D Pose EstimationBenchmarking	—Unverified	0
Towards Computational Performance Engineering for Unsupervised Concept Drift Detection -- Complexities, Benchmarking, Performance Analysis	Apr 17, 2023	BenchmarkingDrift Detection	CodeCode Available	0
Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy	Apr 14, 2023	Benchmarking	—Unverified	0
Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation	Apr 11, 2023	BenchmarkingConversational Recommendation	—Unverified	0
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection	Apr 11, 2023	Adversarial AttackAdversarial Robustness	—Unverified	0
OpenAGI: When LLM Meets Domain Experts	Apr 10, 2023	BenchmarkingNatural Language Queries	CodeCode Available	4
NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems	Apr 10, 2023	Benchmarking	CodeCode Available	1
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence	Apr 10, 2023	Benchmarkingspeech-recognition	CodeCode Available	0
On Evaluation of Bangla Word Analogies	Apr 10, 2023	BenchmarkingWord Embeddings	—Unverified	0
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit	Apr 10, 2023	BenchmarkingSimultaneous Speech-to-Text Translation	—Unverified	0
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning	Apr 9, 2023	BenchmarkingDeep Reinforcement Learning	CodeCode Available	2
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis	Apr 9, 2023	BenchmarkingDeep Learning	—Unverified	0
Benchmarking the Robustness of Quantized Models	Apr 8, 2023	BenchmarkingQuantization	—Unverified	0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented Data	Apr 8, 2023	BenchmarkingData Augmentation	CodeCode Available	0
Probing Conceptual Understanding of Large Visual-Language Models	Apr 7, 2023	Benchmarking	CodeCode Available	0
Interpretable statistical representations of neural population dynamics and geometry	Apr 6, 2023	BenchmarkingDecision Making	CodeCode Available	1
Benchmarking Robustness to Text-Guided Corruptions	Apr 6, 2023	BenchmarkingData Augmentation	CodeCode Available	0
DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images	Apr 5, 2023	BenchmarkingData Augmentation	—Unverified	0
MMVC: Learned Multi-Mode Video Compression with Block-based Prediction Mode Selection and Density-Adaptive Entropy Coding	Apr 5, 2023	BenchmarkingMS-SSIM	CodeCode Available	1
LogoNet: a fine-grained network for instance-level logo sketch retrieval	Apr 5, 2023	2kBenchmarking	CodeCode Available	0
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical Systems	Apr 5, 2023	Benchmarking	CodeCode Available	0
The Saudi Privacy Policy Dataset	Apr 5, 2023	Benchmarking	CodeCode Available	0
OpenContrails: Benchmarking Contrail Detection on GOES-16 ABI	Apr 4, 2023	Benchmarking	—Unverified	0
SLPerf: a Unified Framework for Benchmarking Split Learning	Apr 4, 2023	BenchmarkingDiversity	CodeCode Available	1

Show:10 25 50

← PrevPage 135 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified