SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 997610000 of 474278 papers

TitleStatusHype
BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image SegmentationCode2
LLaGA: Large Language and Graph AssistantCode2
Translating Images to Road Network: A Sequence-to-Sequence PerspectiveCode2
Transductive Active Learning: Theory and ApplicationsCode2
RBF-PINN: Non-Fourier Positional Embedding in Physics-Informed Neural NetworksCode2
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
Do Membership Inference Attacks Work on Large Language Models?Code2
Mercury: A Code Efficiency Benchmark for Code Large Language ModelsCode2
Customizable Perturbation Synthesis for Robust SLAM BenchmarkingCode2
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity KnowledgeCode2
Fairness Evaluation for Uplift Modeling in the Absence of Ground TruthCode2
Cartesian atomic cluster expansion for machine learning interatomic potentialsCode2
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical TextsCode2
One Train for Two Tasks: An Encrypted Traffic Classification Framework Using Supervised Contrastive LearningCode2
AIR-Bench: Benchmarking Large Audio-Language Models via Generative ComprehensionCode2
ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary PlanningCode2
KVQ: Kwai Video Quality Assessment for Short-form VideosCode2
GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended TasksCode2
Feature Mapping in Physics-Informed Neural Networks (PINNs)Code2
A Change Detection Reality CheckCode2
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine TranslatorsCode2
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph ConstructionCode2
Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learningCode2
Neural SPH: Improved Neural Modeling of Lagrangian Fluid DynamicsCode2
Iterated Denoising Energy Matching for Sampling from Boltzmann DensitiesCode2
Show:102550
← PrevPage 400 of 18972Next →