Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4501–4550 of 5548 papers

Title	Date	Tasks	Status
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models	Sep 29, 2021	BenchmarkingDiagnostic	—Unverified
Benchmarking Sample Selection Strategies for Batch Reinforcement Learning	Sep 29, 2021	BenchmarkingImitation Learning	—Unverified
Benchmarking Algorithms from Machine Learning for Low-Budget Black-Box Optimization	Sep 29, 2021	Bayesian OptimizationBenchmarking	—Unverified
Stabilized Self-training with Negative Sampling on Few-labeled Graph Data	Sep 29, 2021	BenchmarkingNode Classification	—Unverified
Learning to Schedule Learning rate with Graph Neural Networks	Sep 29, 2021	Benchmarkingimage-classification	—Unverified
A Systematic Evaluation of Domain Adaptation Algorithms On Time Series Data	Sep 29, 2021	BenchmarkingDomain Adaptation	—Unverified
Imitation Learning from Pixel Observations for Continuous Control	Sep 29, 2021	Benchmarkingcontinuous-control	—Unverified
Extensible Logging and Empirical Attainment Function for IOHexperimenter	Sep 28, 2021	Benchmarking	—Unverified
Context-guided Triple Matching for Multiple Choice Question Answering	Sep 27, 2021	BenchmarkingMultiple-choice	—Unverified
Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation	Sep 26, 2021	BenchmarkingMachine Translation	—Unverified
Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning	Sep 22, 2021	Autonomous DrivingBenchmarking	—Unverified
Benchmarking Augmentation Methods for Learning Robust Navigation Agents: the Winning Entry of the 2021 iGibson Challenge	Sep 22, 2021	BenchmarkingData Augmentation	—Unverified
Efficiently solving the thief orienteering problem with a max-min ant colony optimization approach	Sep 21, 2021	Benchmarking	CodeCode Available
A Novel Cluster Detection of COVID-19 Patients and Medical Disease Conditions Using Improved Evolutionary Clustering Algorithm Star	Sep 20, 2021	BenchmarkingClustering	—Unverified
Hybrid Transceiver Design for Tera-Hertz MIMO Systems Relying on Bayesian Learning Aided Sparse Channel Estimation	Sep 20, 2021	Benchmarking	—Unverified
WiSoSuper: Benchmarking Super-Resolution Methods on Wind and Solar Data	Sep 17, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Messing Up 3D Virtual Environments: Transferable Adversarial 3D Objects	Sep 17, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available
DiS-ReX: A Multilingual Dataset for Distantly Supervised Relation Extraction	Sep 17, 2021	BenchmarkingRelation	—Unverified
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics	Sep 17, 2021	AttributeBenchmarking	—Unverified
Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical Optimization	Sep 17, 2021	Benchmarking	CodeCode Available
A Survey on Temporal Sentence Grounding in Videos	Sep 16, 2021	Action LocalizationBenchmarking	—Unverified
A Continuous Optimisation Benchmark Suite from Neural Network Regression	Sep 12, 2021	BenchmarkingEvolutionary Algorithms	CodeCode Available
Benchmarking Processor Performance by Multi-Threaded Machine Learning Algorithms	Sep 11, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Application of DEA in International Market Selection for the export of products from Spain	Sep 10, 2021	BenchmarkingDecision Making	—Unverified
A framework for benchmarking uncertainty in deep regression	Sep 10, 2021	Benchmarkingregression	—Unverified
Characterization of Constrained Continuous Multiobjective Optimization Problems: A Feature Space Perspective	Sep 9, 2021	BenchmarkingMultiobjective Optimization	—Unverified
CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization	Sep 9, 2021	BenchmarkingSelf-Driving Cars	—Unverified
Towards Efficient Synchronous Federated Training: A Survey on System Optimization Strategies	Sep 9, 2021	BenchmarkingFederated Learning	CodeCode Available
Resistive Neural Hardware Accelerators	Sep 8, 2021	Benchmarking	—Unverified
Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene	Sep 7, 2021	BenchmarkingFine-Grained Image Recognition	CodeCode Available
Benchmarking the Robustness of Instance Segmentation Models	Sep 2, 2021	BenchmarkingDomain Adaptation	—Unverified
Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media	Sep 1, 2021	BenchmarkingSentiment Analysis	—Unverified
Benchmarking down-scaled (not so large) pre-trained language models	Sep 1, 2021	Benchmarking	CodeCode Available
Cross-Lingual Text Classification of Transliterated Hindi and Malayalam	Aug 31, 2021	BenchmarkingClassification	CodeCode Available
Benchmarking the Accuracy and Robustness of Feedback Alignment Algorithms	Aug 30, 2021	Benchmarking	—Unverified
Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization	Aug 30, 2021	BenchmarkingData Augmentation	—Unverified
BioFors: A Large Biomedical Image Forensics Dataset	Aug 30, 2021	BenchmarkingImage Forensics	CodeCode Available
Technological Approaches to Detecting Online Disinformation and Manipulation	Aug 26, 2021	BenchmarkingFact Checking	—Unverified
Benchmarking high-fidelity pedestrian tracking systems for research, real-time monitoring and crowd control	Aug 26, 2021	BenchmarkingDensity Estimation	—Unverified
A Benchmark for Spray from Nearby Cutting Vehicles	Aug 24, 2021	Autonomous DrivingBenchmarking	—Unverified
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices	Aug 21, 2021	BenchmarkingEdge-computing	—Unverified
Evolving Evolutionary Algorithms using Linear Genetic Programming	Aug 21, 2021	BenchmarkingEvolutionary Algorithms	—Unverified
AutoLay: Benchmarking amodal layout estimation for autonomous driving	Aug 20, 2021	Amodal Layout EstimationAutonomous Driving	—Unverified
Discriminating modelling approaches for Point in Time Economic Scenario Generation	Aug 19, 2021	Benchmarking	—Unverified
Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks	Aug 19, 2021	BenchmarkingClassification	—Unverified
SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks	Aug 14, 2021	Benchmarking	—Unverified
Distributional Depth-Based Estimation of Object Articulation Models	Aug 12, 2021	BenchmarkingObject	CodeCode Available
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search	Aug 9, 2021	BenchmarkingGPU	CodeCode Available
A Look at the Evaluation Setup of the M5 Forecasting Competition	Aug 8, 2021	BenchmarkingDecision Making	—Unverified
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption	Aug 7, 2021	BenchmarkingFederated Learning	—Unverified

Show:10 25 50

← PrevPage 91 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified