SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1690116950 of 474278 papers

TitleStatusHype
Adaptive Per-Tree Canopy Volume Estimation Using Mobile LiDAR in Structured and Unstructured Orchards0
Standard LSParameter Estimators Ensure Finite Convergence Time for Linear Regression Equations Under an Interval Excitation Assumption0
IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents0
Fast Geometric Embedding for Node Influence MaximizationCode0
Profiling Electric Vehicles via Early Charging Voltage PatternsCode0
JavelinGuard: Low-Cost Transformer Architectures for LLM Security0
Boosting Vulnerability Detection of LLMs via Curriculum Preference Optimization with Synthetic Reasoning DataCode0
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation0
Premise Selection for a Lean HammerCode1
LLM Unlearning Should Be Form-Independent0
Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework0
Decentralized Optimization on Compact Submanifolds by Quantized Riemannian Gradient Tracking0
Unified Semi-Supervised Pipeline for Automatic Speech Recognition0
Does Residuals-on-Residuals Regression Produce Representative Estimates of Causal Effects?Code0
SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion RefinementCode4
PolitiSky24: U.S. Political Bluesky Dataset with User Stance Labels0
G-Memory: Tracing Hierarchical Memory for Multi-Agent SystemsCode3
Fairness Overfitting in Machine Learning: An Information-Theoretic Perspective0
MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language ModelsCode1
AI-Assisted Rapid Crystal Structure Generation Towards a Target Local EnvironmentCode0
LLMs Caught in the Crossfire: Malware Requests and Jailbreak ChallengesCode0
LeVo: High-Quality Song Generation with Multi-Preference AlignmentCode5
Novel software for continuous wavelet analysis enable EEG real-time analysis on portable computers0
Cost-Optimal Active AI Model Evaluation0
Egocentric Event-Based Vision for Ping Pong Ball Trajectory PredictionCode1
τ^2-Bench: Evaluating Conversational Agents in a Dual-Control EnvironmentCode5
PairEdit: Learning Semantic Variations for Exemplar-based Image EditingCode1
A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling0
MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation0
M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration0
Neural Tangent Kernel Analysis to Probe Convergence in Physics-informed Neural Solvers: PIKANs vs. PINNs0
Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU0
Reinforcement Learning via Implicit Imitation Guidance0
A weighted quantum ensemble of homogeneous quantum classifiers0
Device-Free Localization with Multiple Antenna Receivers: Simulations and Results0
GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution0
Incorporating Uncertainty-Guided and Top-k Codebook Matching for Real-World Blind Image Super-Resolution0
Adaptive Blind Super-Resolution Network for Spatial-Specific and Spatial-Agnostic Degradations0
Explicit Preference Optimization: No Need for an Implicit Reward ModelCode0
LiteVLM: A Low-Latency Vision-Language Model Inference Pipeline for Resource-Constrained Environments0
OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data0
No Stupid Questions: An Analysis of Question Query Generation for Citation Recommendation0
OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting0
Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation0
Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods0
Explore the vulnerability of black-box models via diffusion models0
CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing0
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement0
F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation0
REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models0
Show:102550
← PrevPage 339 of 9486Next →