Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2675 of 5548 papers

Title	Date	Tasks	Status	Hype
TAO-Amodal: A Benchmark for Tracking Any Object Amodally	Dec 19, 2023	Amodal TrackingAutonomous Driving	CodeCode Available	1
Bio-Image Informatics Index BIII: A unique database of image analysis tools and workflows for and by the bioimaging community	Dec 18, 2023	Benchmarking	—Unverified	0
QDA^2: A principled approach to automatically annotating charge stability diagrams	Dec 18, 2023	Benchmarking	—Unverified	0
MA-BBOB: A Problem Generator for Black-Box Optimization Using Affine Combinations and Shifts	Dec 18, 2023	Benchmarking	—Unverified	0
Code Ownership in Open-Source AI Software Security	Dec 18, 2023	Benchmarking	CodeCode Available	0
FER-C: Benchmarking Out-of-Distribution Soft Calibration for Facial Expression Recognition	Dec 16, 2023	BenchmarkingFacial Expression Recognition	—Unverified	0
How to Train Neural Field Representations: A Comprehensive Study and Benchmark	Dec 16, 2023	Benchmarking	CodeCode Available	1
Enabling Accelerators for Graph Computing	Dec 16, 2023	Benchmarking	—Unverified	0
A Novel Hybrid Ordinal Learning Model with Health Care Application	Dec 15, 2023	BenchmarkingDiagnostic	—Unverified	0
ChemTime: Rapid and Early Classification for Multivariate Time Series Classification of Chemical Sensors	Dec 15, 2023	BenchmarkingClassification	—Unverified	0
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models	Dec 15, 2023	BenchmarkingCode Summarization	CodeCode Available	1
SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration	Dec 14, 2023	BenchmarkingPoint Cloud Registration	—Unverified	0
Efficiently Quantifying Individual Agent Importance in Cooperative MARL	Dec 13, 2023	BenchmarkingMulti-agent Reinforcement Learning	—Unverified	0
EventAid: Benchmarking Event-aided Image/Video Enhancement Algorithms with Real-captured Hybrid Dataset	Dec 13, 2023	BenchmarkingDeblurring	—Unverified	0
Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation	Dec 12, 2023	BenchmarkingColumns Property Annotation	—Unverified	0
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition	Dec 12, 2023	BenchmarkingDeep Learning	—Unverified	0
Meta-survey on outlier and anomaly detection	Dec 12, 2023	Anomaly DetectionBenchmarking	CodeCode Available	0
How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation	Dec 12, 2023	Anomaly DetectionAutonomous Driving	CodeCode Available	1
Benchmarking Pretrained Vision Embeddings for Near- and Duplicate Detection in Medical Images	Dec 12, 2023	BenchmarkingRetrieval	—Unverified	0
EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning	Dec 11, 2023	BenchmarkingHuman-Object Interaction Detection	CodeCode Available	1
Implementing hosting capacity analysis in distribution networks: Practical considerations, advancements and future directions	Dec 11, 2023	BenchmarkingCapacity Estimation	—Unverified	0
Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection	Dec 11, 2023	BenchmarkingDomain Adaptation	—Unverified	0
EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models	Dec 11, 2023	BenchmarkingEmotional Intelligence	CodeCode Available	2
Benchmarking Distribution Shift in Tabular Data with TableShift	Dec 10, 2023	BenchmarkingBinary Classification	CodeCode Available	1
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One	Dec 10, 2023	AllBenchmarking	CodeCode Available	3

Show:10 25 50

← PrevPage 107 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified