SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1550115550 of 474278 papers

TitleStatusHype
Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems0
Logical Expressiveness of Graph Neural Networks with Hierarchical Node IndividualizationCode0
Delving Into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses0
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and ExplainabilityCode0
A Survey on World Models Grounded in Acoustic Physical InformationCode0
Fake it till You Make it: Reward Modeling as Discriminative Prediction0
Towards Pervasive Distributed Agentic Generative AI -- A State of The Art0
OPTIMUS: Observing Persistent Transformations in Multi-temporal Unlabeled Satellite-data0
GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining0
Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models0
Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes0
A Survey on Imitation Learning for Contact-Rich Tasks in Robotics0
From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars0
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists0
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders0
DoA Estimation using MUSIC with Range/Doppler Multiplexing for MIMO-OFDM Radar0
Stability Analysis of Physics-Informed Neural Networks via Variational Coercivity, Perturbation Bounds, and Concentration Estimates0
Dynamic Preference Multi-Objective Reinforcement Learning for Internet Network Management0
IKDiffuser: A Generative Inverse Kinematics Solver for Multi-arm Robots via Diffusion Model0
ROSA: Harnessing Robot States for Vision-Language and Action Alignment0
Agent Capability Negotiation and Binding Protocol (ACNBP)Code0
TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian SplattingCode0
Polyra Swarms: A Shape-Based Approach to Machine Learning0
JENGA: Object selection and pose estimation for robotic grasping from a stack0
VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models0
Block-wise Adaptive Caching for Accelerating Diffusion Policy0
FrontendBench: A Benchmark for Evaluating LLMs on Front-End Development via Automatic Evaluation0
Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models0
PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep LearningCode1
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech ModelCode5
Membership Inference Attacks as Privacy Tools: Reliability, Disparity and EnsembleCode1
ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow MatchingCode4
SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop ClosureCode2
Global Convergence of Adjoint-Optimized Neural PDEsCode0
EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware OptimizationCode0
Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much BetterCode1
SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction0
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional DependenciesCode1
Evaluating Cell Type Inference in Vision Language Models Under Varying Visual ContextCode0
Dynamic Scheduling for Enhanced Performance in RIS-assisted Cooperative Network with Interference0
Effect Decomposition of Functional-Output Computer Experiments via Orthogonal Additive Gaussian Processes0
PDCNet: a benchmark and general deep learning framework for activity prediction of peptide-drug conjugates0
MORIC: CSI Delay-Doppler Decomposition for Robust Wi-Fi-based Human Activity Recognition0
Improving spliced alignment by modeling splice sites with deep learningCode2
Uncovering Social Network Activity Using Joint User and Topic Interaction0
KCLNet: Physics-Informed Power Flow Prediction via Constraints Projections0
Nonlinear Model Order Reduction of Dynamical Systems in Process Engineering: Review and Comparison0
GM-LDM: Latent Diffusion Model for Brain Biomarker Identification through Functional Data-Driven Gray Matter Synthesis0
Predicting Genetic Mutations from Single-Cell Bone Marrow Images in Acute Myeloid Leukemia Using Noise-Robust Deep Learning Models0
Show:102550
← PrevPage 311 of 9486Next →