SOTAVerified

Benchmarking

Papers

Showing 42014225 of 5548 papers

TitleStatusHype
Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room0
Towards Robust and Generalizable Gerchberg Saxton based Physics Inspired Neural Networks for Computer Generated Holography: A Sensitivity Analysis Framework0
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models0
Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media0
Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems0
Towards Stable 3D Object Detection0
Towards Toxic Positivity Detection0
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages0
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge0
Towards Visual Text Grounding of Multimodal Large Language Model0
Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning0
Tracking Everything in Robotic-Assisted Surgery0
Training Mixed-Domain Translation Models via Federated Learning0
Training neural mapping schemes for satellite altimetry with simulation data0
Training Transformers with Enforced Lipschitz Constants0
Trajectory Normalized Gradients for Distributed Optimization0
TRAM: Benchmarking Temporal Reasoning for Large Language Models0
Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards0
TransBench: Benchmarking Machine Translation for Industrial-Scale Applications0
Transfer of Knowledge through Reverse Annealing: A Preliminary Analysis of the Benefits and What to Share0
Transformed Subspace Clustering0
Transformers in Protein: A Survey0
Transformers Utilization in Chart Understanding: A Review of Recent Advances & Future Trends0
Show:102550
← PrevPage 169 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified