Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3101–3150 of 5548 papers

Title	Date	Tasks	Status	Hype
Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types	Jul 17, 2023	Benchmarking	—Unverified	0
Machine Learning for Ranking f-wave Extraction Methods in Single-Lead ECGs	Jul 17, 2023	Benchmarking	—Unverified	0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients	Jul 17, 2023	BenchmarkingGPU	CodeCode Available	0
EasyTPP: Towards Open Benchmarking Temporal Point Processes	Jul 16, 2023	BenchmarkingPoint Processes	CodeCode Available	2
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks	Jul 16, 2023	Benchmarking	—Unverified	0
GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection	Jul 16, 2023	Benchmarking	CodeCode Available	1
Joint Batching and Scheduling for High-Throughput Multiuser Edge AI with Asynchronous Task Arrivals	Jul 15, 2023	BenchmarkingScheduling	—Unverified	0
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans	Jul 15, 2023	BenchmarkingDimensionality Reduction	—Unverified	0
Benchmarking Explanatory Models for Inertia Forecasting using Public Data of the Nordic Area	Jul 14, 2023	BenchmarkingTime Series	—Unverified	0
Challenge Results Are Not Reproducible	Jul 14, 2023	BenchmarkingImage Segmentation	—Unverified	0
A Dynamic Points Removal Benchmark in Point Cloud Maps	Jul 14, 2023	BenchmarkingDynamic Point Removal	CodeCode Available	2
IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation	Jul 13, 2023	BenchmarkingGraph Embedding	CodeCode Available	1
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning	Jul 13, 2023	BenchmarkingOffline RL	CodeCode Available	1
Pathway: a fast and flexible unified stream data processing framework for analytical and Machine Learning applications	Jul 12, 2023	Benchmarking	—Unverified	0
A Comprehensive Overview of Large Language Models	Jul 12, 2023	Benchmarking	CodeCode Available	1
Deep Generative Models for Physiological Signals: A Systematic Literature Review	Jul 12, 2023	BenchmarkingEEG	—Unverified	0
AnuraSet: A dataset for benchmarking Neotropical anuran calls identification in passive acoustic monitoring	Jul 11, 2023	Benchmarking	CodeCode Available	1
Temporal Graphs Anomaly Emergence Detection: Benchmarking For Social Media Interactions	Jul 11, 2023	Anomaly DetectionBenchmarking	—Unverified	0
Benchmarking Algorithms for Federated Domain Generalization	Jul 11, 2023	BenchmarkingDiversity	CodeCode Available	1
Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation	Jul 11, 2023	BenchmarkingCausal Discovery	—Unverified	0
A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified Benchmark	Jul 10, 2023	Age EstimationBenchmarking	CodeCode Available	1
Assessing the efficacy of large language models in generating accurate teacher responses	Jul 9, 2023	BenchmarkingIn-Context Learning	—Unverified	0
Fairness-Aware Graph Neural Networks: A Survey	Jul 8, 2023	BenchmarkingFairness	—Unverified	0
Fast Empirical Scenarios	Jul 8, 2023	BenchmarkingDecision Making	—Unverified	0
Benchmarking Test-Time Adaptation against Distribution Shifts in Image Classification	Jul 6, 2023	BenchmarkingDomain Adaptation	CodeCode Available	1
Structural Property Prediction	Jul 5, 2023	BenchmarkingPrediction	—Unverified	0
Performance Modeling of Data Storage Systems using Generative Models	Jul 5, 2023	Benchmarking	CodeCode Available	0
Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks	Jul 5, 2023	BenchmarkingDemosaicking	—Unverified	0
ClimateLearn: Benchmarking Machine Learning for Weather and Climate Modeling	Jul 4, 2023	BenchmarkingWeather Forecasting	CodeCode Available	2
OpenSiteRec: An Open Dataset for Site Recommendation	Jul 3, 2023	BenchmarkingInformation Retrieval	—Unverified	0
A Synthetic Benchmarking Pipeline to Compare Camera Calibration Algorithms	Jul 3, 2023	BenchmarkingCamera Calibration	—Unverified	0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity	Jul 2, 2023	BenchmarkingData Integration	—Unverified	0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency	Jul 1, 2023	BenchmarkingData Augmentation	—Unverified	0
InstructEval: Systematic Evaluation of Instruction Selection Methods	Jul 1, 2023	BenchmarkingIn-Context Learning	—Unverified	0
Learning Environment Models with Continuous Stochastic Dynamics	Jun 29, 2023	AcrobotBenchmarking	—Unverified	0
Benchmarking Large Language Model Capabilities for Conditional Generation	Jun 29, 2023	BenchmarkingFew-Shot Learning	—Unverified	0
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms	Jun 29, 2023	BenchmarkingRobot Navigation	—Unverified	0
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors	Jun 29, 2023	Benchmarking	—Unverified	0
Uncovering the Limits of Machine Learning for Automatic Vulnerability Detection	Jun 28, 2023	BenchmarkingData Augmentation	CodeCode Available	1
Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity	Jun 28, 2023	BenchmarkingImage Captioning	—Unverified	0
Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection	Jun 28, 2023	BenchmarkingDiversity	—Unverified	0
Emotion Analysis of Tweets Banning Education in Afghanistan	Jun 28, 2023	BenchmarkingEmotion Classification	—Unverified	0
Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool	Jun 27, 2023	BenchmarkingLanguage Modeling	—Unverified	0
Pulse Shape-Aided Multipath Delay Estimation for Fine-Grained WiFi Sensing	Jun 27, 2023	Benchmarking	—Unverified	0
Benchmarking Stroke Forecasting with Stroke-Level Badminton Dataset	Jun 27, 2023	Benchmarking	—Unverified	0
Enhancing Navigation Benchmarking and Perception Data Generation for Row-based Crops in Simulation	Jun 27, 2023	Autonomous NavigationBenchmarking	—Unverified	0
SCENEREPLICA: Benchmarking Real-World Robot Manipulation by Creating Replicable Scenes	Jun 27, 2023	BenchmarkingMotion Planning	CodeCode Available	1
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback	Jun 26, 2023	BenchmarkingCode Generation	CodeCode Available	2
Improving Reference-based Distinctive Image Captioning with Contrastive Rewards	Jun 25, 2023	BenchmarkingContrastive Learning	—Unverified	0
Hybrid Precoder and Combiner Designs for Decentralized Parameter Estimation in mmWave MIMO Wireless Sensor Networks	Jun 25, 2023	Benchmarkingparameter estimation	—Unverified	0

Show:10 25 50

← PrevPage 63 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified