Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1926–1950 of 5548 papers

Title	Date	Tasks	Status
Challenges in Benchmarking Stream Learning Algorithms with Real-world Data	Apr 30, 2020	Benchmarking	—Unverified
CRS Arena: Crowdsourced Benchmarking of Conversational Recommender Systems	Dec 13, 2024	BenchmarkingRecommendation Systems	—Unverified
Benchmarking Edge AI Platforms for High-Performance ML Inference	Sep 23, 2024	BenchmarkingCPU	—Unverified
Challenges and Pitfalls of Machine Learning Evaluation and Benchmarking	Apr 29, 2019	BenchmarkingBIG-bench Machine Learning	—Unverified
Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation	Dec 15, 2024	3D GenerationBenchmarking	—Unverified
CSPO: Cross-Market Synergistic Stock Price Movement Forecasting with Pseudo-volatility Optimization	Mar 26, 2025	Benchmarking	—Unverified
CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories	Feb 10, 2025	Benchmarking	—Unverified
Challenges and perspectives in computational deconvolution of genomics data	Nov 21, 2022	Benchmarking	—Unverified
Evaluation of simulation methods for tumor subclonal reconstruction	Feb 14, 2024	Benchmarking	—Unverified
Benchmarking and In-depth Performance Study of Large Language Models on Habana Gaudi Processors	Sep 29, 2023	BenchmarkingComputational Efficiency	—Unverified
AN ELIXIR FOR BLOCKCHAIN SCALABILITY WITH CHANNEL BASED CLUSTERED SHARDING	Dec 20, 2023	Benchmarking	—Unverified
Challenges and Advancements in Modeling Shock Fronts with Physics-Informed Neural Networks: A Review and Benchmarking Study	Mar 14, 2025	Benchmarking	—Unverified
CubeSat-Enabled Free-Space Optics: Joint Data Communication and Fine Beam Tracking	Jun 13, 2024	Benchmarking	—Unverified
Benchmarking End-to-end Learning of MIMO Physical-Layer Communication	May 19, 2020	Benchmarking	—Unverified
Challenge Results Are Not Reproducible	Jul 14, 2023	BenchmarkingImage Segmentation	—Unverified
A Dataset Similarity Evaluation Framework for Wireless Communications and Sensing	Dec 7, 2024	BenchmarkingDimensionality Reduction	—Unverified
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms	Jul 3, 2024	BenchmarkingCPU	—Unverified
ChakmaNMT: A Low-resource Machine Translation On Chakma Language	Oct 14, 2024	BenchmarkingMachine Translation	—Unverified
CURE: Concept Unlearning via Orthogonal Representation Editing in Diffusion Models	May 19, 2025	BenchmarkingRed Teaming	—Unverified
Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data	Dec 3, 2020	BenchmarkingInductive Bias	—Unverified
Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning	Jan 8, 2024	BenchmarkingCoLA	—Unverified
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese	May 16, 2025	BenchmarkingLanguage Modeling	—Unverified
Curse of Slicing: Why Sliced Mutual Information is a Deceptive Measure of Statistical Dependence	Jun 4, 2025	Benchmarking	—Unverified
Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm	Sep 6, 2024	Benchmarkingregression	—Unverified
C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System	Dec 17, 2024	BenchmarkingRAG	—Unverified

Show:10 25 50

← PrevPage 78 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified