SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1805118100 of 474278 papers

TitleStatusHype
Neurosymbolic Artificial Intelligence for Robust Network Intrusion Detection: From Scratch to Transfer Learning0
OpenThoughts: Data Recipes for Reasoning ModelsCode7
Training-free AI for Earth Observation Change Detection using Physics Aware Neuromorphic Networks0
Multiscale guidance of AlphaFold3 with heterogeneous cryo-EM data0
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-OrderCode0
Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons0
SF^2Bench: Evaluating Data-Driven Models for Compound Flood Forecasting in South Florida0
Softlog-Softmax Layers and Divergences Contribute to a Computationally Dependable Ensemble Learning0
A Statistical Physics of Language Model Reasoning0
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language ModelsCode0
Behavioural vs. Representational Systematicity in End-to-End Models: An Opinionated Survey0
An AI-Based Public Health Data Monitoring System0
Even Faster Hyperbolic Random Forests: A Beltrami-Klein Wrapper ApproachCode1
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection0
FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers0
HUMOF: Human Motion Forecasting in Interactive Social Scenes0
Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model0
Rectified Sparse Attention0
From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation0
Pseudo-Simulation for Autonomous DrivingCode4
"Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation0
Finding signatures of low-dimensional geometric landscapes in high-dimensional cell fate transitionsCode0
MFLA: Monotonic Finite Look-ahead Attention for Streaming Speech Recognition0
Understanding Mental Models of Generative Conversational Search and The Effect of Interface Transparency0
Uniqueness of phase retrieval from offset linear canonical transform0
Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems0
Autonomous Collaborative Scheduling of Time-dependent UAVs, Workers and Vehicles for Crowdsensing in Disaster Response0
From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context0
Sounding that Object: Interactive Object-Aware Image to Audio Generation0
IntLevPy: A Python library to classify and model intermittent and Lévy processes0
Solving engineering eigenvalue problems with neural networks using the Rayleigh quotient0
CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking0
Unsupervised Meta-Testing with Conditional Neural Processes for Hybrid Meta-Reinforcement Learning0
Object-centric 3D Motion Field for Robot Learning from Human Videos0
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL0
Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration0
Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation0
Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions0
SemNav: A Model-Based Planner for Zero-Shot Object Goal Navigation Using Vision-Foundation Models0
Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion0
Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering0
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing0
Generating Automotive Code: Large Language Models for Software Development and Verification in Safety-Critical Systems0
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation0
An Improved Finite Element Modeling Method for Triply Periodic Minimal Surface Structures Based on Element Size and Minimum Jacobian0
Discrete Element Parameter Calibration of Livestock Salt Based on Particle Scaling0
Topology-Aware Graph Neural Network-based State Estimation for PMU-Unobservable Power Systems0
BridgeNet: A Hybrid, Physics-Informed Machine Learning Framework for Solving High-Dimensional Fokker-Planck Equations0
Show:102550
← PrevPage 362 of 9486Next →