Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4051–4100 of 5548 papers

Title	Date	Tasks	Status	Hype
Benchmarking Deep Models for Salient Object Detection	Feb 7, 2022	BenchmarkingObject	CodeCode Available	1
Evaluation Methods and Measures for Causal Learning Algorithms	Feb 7, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified	0
RerrFact: Reduced Evidence Retrieval Representations for Scientific Claim Verification	Feb 5, 2022	BenchmarkingBinary Classification	CodeCode Available	0
Structured Prediction Problem Archive	Feb 4, 2022	BenchmarkingPrediction	CodeCode Available	0
Quality Assessment of Low Light Restored Images: A Subjective Study and an Unsupervised Model	Feb 4, 2022	BenchmarkingContrastive Learning	—Unverified	0
Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization	Feb 3, 2022	3D ReconstructionBenchmarking	—Unverified	0
A quantitative method for benchmarking fair income distribution	Feb 2, 2022	Benchmarking	—Unverified	0
Black-box Bayesian inference for economic agent-based models	Feb 1, 2022	Bayesian InferenceBenchmarking	—Unverified	0
When Do Flat Minima Optimizers Work?	Feb 1, 2022	BenchmarkingGraph Learning	CodeCode Available	1
AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation	Jan 29, 2022	Bayesian OptimisationBenchmarking	—Unverified	0
Benchmarking Resource Usage for Efficient Distributed Deep Learning	Jan 28, 2022	BenchmarkingDeep Learning	—Unverified	0
Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition Dataset	Jan 28, 2022	Action RecognitionBenchmarking	—Unverified	0
Benchmarking Robustness of 3D Point Cloud Recognition Against Common Corruptions	Jan 28, 2022	3D Point Cloud Classification3D Point Cloud Data Augmentation	CodeCode Available	2
Benchmarking learned non-Cartesian k-space trajectories and reconstruction networks	Jan 27, 2022	Benchmarking	—Unverified	0
A Multi-rater Comparative Study of Automatic Target Localization Methods for Epilepsy Deep Brain Stimulation Procedures	Jan 26, 2022	Benchmarking	—Unverified	0
MeltpoolNet: Melt pool Characteristic Prediction in Metal Additive Manufacturing Using Machine Learning	Jan 26, 2022	ArticlesBenchmarking	—Unverified	0
Jointly Learning Knowledge Embedding and Neighborhood Consensus with Relational Knowledge Distillation for Entity Alignment	Jan 25, 2022	BenchmarkingEntity Alignment	—Unverified	0
DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations	Jan 24, 2022	BenchmarkingDrug Discovery	CodeCode Available	0
Visual Object Tracking on Multi-modal RGB-D Videos: A Review	Jan 23, 2022	BenchmarkingObject	—Unverified	0
Out of Distribution Detection on ImageNet-O	Jan 23, 2022	BenchmarkingOut-of-Distribution Detection	CodeCode Available	0
Towards Private Learning on Decentralized Graphs with Local Differential Privacy	Jan 23, 2022	BenchmarkingGraph Learning	—Unverified	0
AiTLAS: Artificial Intelligence Toolbox for Earth Observation	Jan 21, 2022	BenchmarkingEarth Observation	CodeCode Available	2
Individual Treatment Effect Estimation Through Controlled Neural Network Training in Two Stages	Jan 21, 2022	BenchmarkingRepresentation Learning	—Unverified	0
A Simple Evolutionary Algorithm for Multi-modal Multi-objective Optimization	Jan 18, 2022	Benchmarking	—Unverified	0
High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing	Jan 18, 2022	BenchmarkingFeature Engineering	—Unverified	0
Benchmarking Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective Optimization	Jan 18, 2022	Benchmarking	CodeCode Available	0
A Comparative study of Hyper-Parameter Optimization Tools	Jan 17, 2022	Bayesian OptimizationBenchmarking	—Unverified	0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks	Jan 16, 2022	BenchmarkingFederated Learning	—Unverified	0
Feasibility of BERT Embeddings For Domain-Specific Knowledge Mining	Jan 16, 2022	BenchmarkingLanguage Modelling	—Unverified	0
Context-guided Triple Matching for Multiple Choice Question Answering	Jan 16, 2022	BenchmarkingMultiple-choice	—Unverified	0
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding	Jan 16, 2022	Benchmarking	—Unverified	0
A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-19	Jan 13, 2022	BenchmarkingLesion Segmentation	—Unverified	0
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics	Jan 11, 2022	BenchmarkingDeep Reinforcement Learning	—Unverified	0
A Baseline Statistical Method For Robust User-Assisted Multiple Segmentation	Jan 8, 2022	BenchmarkingImage Segmentation	CodeCode Available	0
Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling	Jan 6, 2022	Aerial Scene ClassificationBenchmarking	—Unverified	0
Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software Platform	Jan 5, 2022	Benchmarking	CodeCode Available	0
DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation	Jan 1, 2022	Benchmarking	—Unverified	0
Are we really making much progress? Revisiting, benchmarking, and refining heterogeneous graph neural networks	Dec 30, 2021	BenchmarkingHeterogeneous Node Classification	CodeCode Available	1
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study	Dec 30, 2021	AttributeBenchmarking	CodeCode Available	1
Leveraging Trust for Joint Multi-Objective and Multi-Fidelity Optimization	Dec 27, 2021	Bayesian OptimizationBenchmarking	CodeCode Available	1
MPCLeague: Robust MPC Platform for Privacy-Preserving Machine Learning	Dec 26, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)	Dec 24, 2021	BenchmarkingPosition	—Unverified	0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition	Dec 23, 2021	BenchmarkingDeep Learning	CodeCode Available	0
TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish Language	Dec 23, 2021	BenchmarkingClustering	CodeCode Available	0
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment	Dec 22, 2021	Autonomous DrivingBenchmarking	CodeCode Available	0
CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding	Dec 19, 2021	BenchmarkingPrediction	—Unverified	0
QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking Results	Dec 19, 2021	BenchmarkingBrain Tumor Segmentation	CodeCode Available	0
Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent	Dec 17, 2021	BenchmarkingDiagnostic	—Unverified	0
Autonomous Reinforcement Learning: Formalism and Benchmarking	Dec 17, 2021	Benchmarkingreinforcement-learning	CodeCode Available	1
Benchmarking Uncertainty Quantification on Biosignal Classification Tasks under Dataset Shift	Dec 16, 2021	BenchmarkingClassification	—Unverified	0

Show:10 25 50

← PrevPage 82 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified