SOTAVerified

Benchmarking

Papers

Showing 30013025 of 5548 papers

TitleStatusHype
Empirical Analysis of Privacy-Fairness-Accuracy Trade-offs in Federated Learning: A Step Towards Responsible AI0
Empirical Analysis of the Dynamic Binary Value Problem with IOHprofiler0
Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices0
Enabling Accelerators for Graph Computing0
Automated Machine Learning: A Case Study on Non-Intrusive Appliance Load Monitoring0
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design0
EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting0
1-D Convlutional Neural Networks for the Analysis of Pupil Size Variations in Scotopic Conditions0
End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings0
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption0
Energy & Force Regression on DFT Trajectories is Not Enough for Universal Machine Learning Interatomic Potentials0
Energy Management in Storage-Augmented, Grid-Connected Prosumer Buildings and Neighbourhoods Using a Modified Simulated Annealing Optimization0
Enhanced Multiobjective Evolutionary Algorithm based on Decomposition for Solving the Unit Commitment Problem0
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration0
Enhancing Explainability and Reliable Decision-Making in Particle Swarm Optimization through Communication Topologies0
Enhancing Hand Palm Motion Gesture Recognition by Eliminating Reference Frame Bias via Frame-Invariant Similarity Measures0
Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement0
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages0
Enhancing Navigation Benchmarking and Perception Data Generation for Row-based Crops in Simulation0
Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification0
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG0
Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries0
Enhancing TCR-Peptide Interaction Prediction with Pretrained Language Models and Molecular Representations0
Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs0
EnronQA: Towards Personalized RAG over Private Documents0
Show:102550
← PrevPage 121 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified