Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2976–3000 of 5548 papers

Title	Date	Tasks	Status	Hype
Training neural mapping schemes for satellite altimetry with simulation data	Sep 19, 2023	Benchmarking	—Unverified	0
SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction	Sep 19, 2023	3D ReconstructionBenchmarking	—Unverified	0
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design	Sep 18, 2023	Benchmarking	—Unverified	0
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool	Sep 16, 2023	BenchmarkingImage Super-Resolution	—Unverified	0
Exploration of TPUs for AI Applications	Sep 16, 2023	BenchmarkingEdge-computing	—Unverified	0
Anchor Points: Benchmarking Models with Much Fewer Examples	Sep 14, 2023	BenchmarkingLanguage Modeling	CodeCode Available	0
M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations	Sep 14, 2023	BenchmarkingComputed Tomography (CT)	CodeCode Available	0
Leveraging Contextual Information for Effective Entity Salience Detection	Sep 14, 2023	ArticlesBenchmarking	—Unverified	0
Benchmarking machine learning models for quantum state classification	Sep 14, 2023	BenchmarkingClassification	—Unverified	0
VerilogEval: Evaluating Large Language Models for Verilog Code Generation	Sep 14, 2023	BenchmarkingCode Generation	CodeCode Available	2
So you think you can track?	Sep 13, 2023	BenchmarkingObject	—Unverified	0
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish	Sep 13, 2023	BenchmarkingTranslation	CodeCode Available	0
An Image Dataset for Benchmarking Recommender Systems with Raw Pixels	Sep 13, 2023	BenchmarkingRecommendation Systems	CodeCode Available	1
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving	Sep 12, 2023	Autonomous DrivingBenchmarking	—Unverified	0
Unveiling the potential of large language models in generating semantic and cross-language clones	Sep 12, 2023	BenchmarkingCode Generation	—Unverified	0
Formalizing Multimedia Recommendation through Multimodal Deep Learning	Sep 11, 2023	BenchmarkingDeep Learning	CodeCode Available	1
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions	Sep 10, 2023	3D Human Pose Estimation3D Pose Estimation	CodeCode Available	1
RecAD: Towards A Unified Library for Recommender Attack and Defense	Sep 9, 2023	BenchmarkingRecommendation Systems	CodeCode Available	1
Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learning	Sep 8, 2023	BenchmarkingContinual Learning	CodeCode Available	0
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation	Sep 7, 2023	BenchmarkingNeural Architecture Search	—Unverified	0
PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips	Sep 7, 2023	BenchmarkingKnowledge Graphs	CodeCode Available	2
Using representation balancing to learn conditional-average dose responses from clustered data	Sep 7, 2023	BenchmarkingCausal Inference	CodeCode Available	0
Better Practices for Domain Adaptation	Sep 7, 2023	BenchmarkingDomain Adaptation	—Unverified	0
Evaluation of large language models for discovery of gene set function	Sep 7, 2023	BenchmarkingLanguage Modelling	CodeCode Available	1
Neural Networks for Fast Optimisation in Model Predictive Control: A Review	Sep 6, 2023	BenchmarkingModel Predictive Control	—Unverified	0

Show:10 25 50

← PrevPage 120 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified