Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4076–4100 of 5548 papers

Title	Date	Tasks	Status
Efficient Demand Response Location Targeting for Price Spike Mitigation by Exploiting Price-demand Relationship	Nov 27, 2022	Benchmarking	—Unverified
TARGO: Benchmarking Target-driven Object Grasping under Occlusions	Jul 8, 2024	BenchmarkingObject	—Unverified
Task-oriented Over-the-air Computation for Edge-device Co-inference with Balanced Classification Accuracy	Jul 1, 2024	Benchmarking	—Unverified
TBD: Benchmarking and Analyzing Deep Neural Network Training	Mar 16, 2018	BenchmarkingGeneral Classification	—Unverified
TDDBench: A Benchmark for Training data detection	Nov 5, 2024	BenchmarkingComputational Efficiency	—Unverified
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs	May 26, 2025	BenchmarkingLarge Language Model	—Unverified
TeamTrack: A Dataset for Multi-Sport Multi-Object Tracking in Full-pitch Videos	Apr 22, 2024	BenchmarkingMulti-Object Tracking	—Unverified
Teaspoon: A comprehensive python package for topological signal processing	Oct 10, 2020	BenchmarkingTopological Data Analysis	—Unverified
Technical report of a DMD-based Characterization Method for Vision Sensors	Mar 4, 2025	BenchmarkingDataset Generation	—Unverified
Technological Approaches to Detecting Online Disinformation and Manipulation	Aug 26, 2021	BenchmarkingFact Checking	—Unverified
TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain	Dec 20, 2024	Benchmarking	—Unverified
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks	May 19, 2023	Benchmarking	—Unverified
Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation	Nov 8, 2022	BenchmarkingRetrieval	—Unverified
Temporal cross-validation impacts multivariate time series subsequence anomaly detection evaluation	Jun 13, 2025	Anomaly DetectionBenchmarking	—Unverified
Temporal Graphs Anomaly Emergence Detection: Benchmarking For Social Media Interactions	Jul 11, 2023	Anomaly DetectionBenchmarking	—Unverified
Temporal Validity Change Prediction	Jan 1, 2024	BenchmarkingPrediction	—Unverified
TEP-GNN: Accurate Execution Time Prediction of Functional Tests using Graph Neural Networks	Aug 25, 2022	BenchmarkingGraph Neural Network	—Unverified
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney	Aug 4, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Term-Class-Max-Support (TCMS): A Simple Text Document Categorization Approach Using Term-Class Relevance Measure	Oct 16, 2016	BenchmarkingText Categorization	—Unverified
Test-driven Software Experimentation with LASSO: an LLM Prompt Benchmarking Example	Oct 11, 2024	BenchmarkingCode Generation	—Unverified
Tetrad: Actively Secure 4PC for Secure Training and Inference	Jun 5, 2021	BenchmarkingFairness	—Unverified
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation	Feb 18, 2025	Benchmarking	—Unverified
Text-To-Speech Synthesis In The Wild	Sep 13, 2024	BenchmarkingSpeaker Recognition	—Unverified
Thalamic nuclei segmentation from T_1-weighted MRI: unifying and benchmarking state-of-the-art methods with young and old cohorts	Sep 26, 2023	BenchmarkingSegmentation	—Unverified
The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition	Feb 29, 2024	Action Unit DetectionArousal Estimation	—Unverified

Show:10 25 50

← PrevPage 164 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified