Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3651–3675 of 5548 papers

Title	Date	Tasks	Status
Towards Effective Disambiguation for Machine Translation with Large Language Models	Sep 20, 2023	BenchmarkingIn-Context Learning	—Unverified
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder	Sep 20, 2023	BenchmarkingClustering	CodeCode Available
SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction	Sep 19, 2023	3D ReconstructionBenchmarking	—Unverified
Training neural mapping schemes for satellite altimetry with simulation data	Sep 19, 2023	Benchmarking	—Unverified
The Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design	Sep 18, 2023	Benchmarking	—Unverified
Exploration of TPUs for AI Applications	Sep 16, 2023	BenchmarkingEdge-computing	—Unverified
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool	Sep 16, 2023	BenchmarkingImage Super-Resolution	—Unverified
Anchor Points: Benchmarking Models with Much Fewer Examples	Sep 14, 2023	BenchmarkingLanguage Modeling	CodeCode Available
M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations	Sep 14, 2023	BenchmarkingComputed Tomography (CT)	CodeCode Available
Benchmarking machine learning models for quantum state classification	Sep 14, 2023	BenchmarkingClassification	—Unverified
Leveraging Contextual Information for Effective Entity Salience Detection	Sep 14, 2023	ArticlesBenchmarking	—Unverified
So you think you can track?	Sep 13, 2023	BenchmarkingObject	—Unverified
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish	Sep 13, 2023	BenchmarkingTranslation	CodeCode Available
Unveiling the potential of large language models in generating semantic and cross-language clones	Sep 12, 2023	BenchmarkingCode Generation	—Unverified
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving	Sep 12, 2023	Autonomous DrivingBenchmarking	—Unverified
Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learning	Sep 8, 2023	BenchmarkingContinual Learning	CodeCode Available
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation	Sep 7, 2023	BenchmarkingNeural Architecture Search	—Unverified
Better Practices for Domain Adaptation	Sep 7, 2023	BenchmarkingDomain Adaptation	—Unverified
Using representation balancing to learn conditional-average dose responses from clustered data	Sep 7, 2023	BenchmarkingCausal Inference	CodeCode Available
Are SNNs Truly Energy-efficient? - A Hardware Perspective	Sep 6, 2023	Benchmarking	—Unverified
Neural Networks for Fast Optimisation in Model Predictive Control: A Review	Sep 6, 2023	BenchmarkingModel Predictive Control	—Unverified
AGIBench: A Multi-granularity, Multimodal, Human-referenced, Auto-scoring Benchmark for Large Language Models	Sep 5, 2023	BenchmarkingZero-Shot Learning	—Unverified
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking	Sep 5, 2023	BenchmarkingKnowledge Distillation	—Unverified
Hybrid data driven/thermal simulation model for comfort assessment	Sep 4, 2023	Benchmarking	—Unverified
Transfer Learning between Motor Imagery Datasets using Deep Learning -- Validation of Framework and Comparison of Datasets	Sep 4, 2023	BenchmarkingMotor Imagery	CodeCode Available

Show:10 25 50

← PrevPage 147 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified