SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2045120500 of 474278 papers

TitleStatusHype
Is Your LLM Overcharging You? Tokenization, Transparency, and IncentivesCode0
A Structured Unplugged Approach for Foundational AI Literacy in Primary EducationCode0
Learning Individual Behavior in Agent-Based Models with Graph Diffusion NetworksCode0
Visual Cues Enhance Predictive Turn-Taking for Two-Party Human InteractionCode0
Efficient Identity and Position Graph Embedding via Spectral-Based Random Feature AggregationCode0
Fedivertex: a Graph Dataset based on Decentralized Social Networks for Trustworthy Machine LearningCode0
Laparoscopic Image Desmoking Using the U-Net with New Loss Function and Integrated Differentiable Wiener FilterCode0
Taylor expansion-based Kolmogorov-Arnold network for blind image quality assessmentCode1
Paper2Poster: Towards Multimodal Poster Automation from Scientific PapersCode7
Wideband RF Radiance Field Modeling Using Frequency-embedded 3D Gaussian SplattingCode0
Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation0
Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use0
Physics-Informed Neural Network for Cross-Domain Predictive Control of Tapered Amplifier Thermal Stabilization0
MRSD: Multi-Resolution Skill Discovery for HRL Agents0
Debiased Ill-Posed Regression0
Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution0
Multi-VQC: A Novel QML Approach for Enhancing Healthcare Classification0
AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Crop MappingCode1
Research Community Perspectives on "Intelligence" and Large Language Models0
Humble AI in the real-world: the case of algorithmic hiring0
Efficient and Microphone-Fault-Tolerant 3D Sound Source Localization0
Dissecting Physics Reasoning in Small Language Models: A Multi-Dimensional Analysis from an Educational Perspective0
Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations0
Hume: Introducing System-2 Thinking in Visual-Language-Action Model0
REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion0
SOSBENCH: Benchmarking Safety Alignment on Scientific Knowledge0
Memorization to Generalization: Emergence of Diffusion Models from Associative Memory0
Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence0
VoiceMark: Zero-Shot Voice Cloning-Resistant Watermarking Approach Leveraging Speaker-Specific Latents0
Structure from Collision0
Network classification through random walks0
VideoMarkBench: Benchmarking Robustness of Video WatermarkingCode0
SV-TrustEval-C: Evaluating Structure and Semantic Reasoning in Large Language Models for Source Code Vulnerability AnalysisCode0
REAL-Prover: Retrieval Augmented Lean Prover for Mathematical ReasoningCode1
Learning Where to Learn: Training Distribution Selection for Provable OOD PerformanceCode0
AdInject: Real-World Black-Box Attacks on Web Agents via Advertising DeliveryCode0
AITEE -- Agentic Tutor for Electrical EngineeringCode0
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event DetectionCode0
A Physics-Augmented GraphGPS Framework for the Reconstruction of 3D Riemann Problems from Sparse DataCode0
AMSFL: Adaptive Multi-Step Federated Learning via Gradient Difference-Based Error Modeling0
Respond to Change with Constancy: Instruction-tuning with LLM for Non-I.I.D. Network Traffic Classification0
CogAD: Cognitive-Hierarchy Guided End-to-End Autonomous Driving0
OmniIndoor3D: Comprehensive Indoor 3D Reconstruction0
Multitemporal Latent Dynamical Framework for Hyperspectral Images Unmixing0
Intelligent Incident Hypertension Prediction in Obstructive Sleep Apnea0
VoxAging: Continuously Tracking Speaker Aging with a Large-Scale Longitudinal Dataset in English and Mandarin0
Recognition of Physiological Patterns during Activities of Daily Living Using Wearable Biosignal Sensors0
Expert Survey: AI Reliability & Security Research Priorities0
Streamlining Knowledge Graph Creation with PyRML0
Algorithms and SQ Lower Bounds for Robustly Learning Real-valued Multi-index Models0
Show:102550
← PrevPage 410 of 9486Next →