SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1265112700 of 474278 papers

TitleStatusHype
BrainLesion Suite: A Flexible and User-Friendly Framework for Modular Brain Lesion Image AnalysisCode1
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning0
Exploiting Leaderboards for Large-Scale Distribution of Malicious Models0
Towards Imperceptible JPEG Image Hiding: Multi-range Representations-driven Adversarial Stego Generation0
Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training0
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks0
Towards Collaborative Fairness in Federated Learning Under Imbalanced Covariate Shift0
SFedKD: Sequential Federated Learning with Discrepancy-Aware Multi-Teacher Knowledge Distillation0
Lizard: An Efficient Linearization Framework for Large Language Models0
From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion0
FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation0
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling0
Compress Any Segment Anything Model (SAM)Code1
Disentangling Instance and Scene Contexts for 3D Semantic Scene CompletionCode1
Dual Dimensions Geometric Representation Learning Based Document DewarpingCode1
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in ConversationCode0
Unsupervised Methods for Video Quality Improvement: A Survey of Restoration and Enhancement Techniques0
Lumos-1: On Autoregressive Video Generation from a Unified Model PerspectiveCode0
Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT0
Geo-ORBIT: A Federated Digital Twin Framework for Scene-Adaptive Lane Geometry DetectionCode0
Repairing Language Model Pipelines by Meta Self-Refining Competing Constraints at RuntimeCode0
An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan0
RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics FeaturesCode1
VIP: Visual Information Protection through Adversarial Attacks on Vision-Language ModelsCode0
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation0
The Bayesian Approach to Continual Learning: An Overview0
AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs0
Fairness Is Not Enough: Auditing Competence and Intersectional Bias in AI-powered Resume ScreeningCode0
Model Parallelism With Subnetwork Data Parallelism0
Scaling Attention to Very Long Sequences in Linear Time with Wavelet-Enhanced Random Spectral Attention (WERSA)Code0
When and Where do Data Poisons Attack Textual Inversion?Code0
An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation0
ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way0
Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security0
An Adaptive Volatility-based Learning Rate Scheduler0
Comparative Analysis of Vision Transformers and Traditional Deep Learning Approaches for Automated Pneumonia Detection in Chest X-Rays0
From Classical Machine Learning to Emerging Foundation Models: Review on Multimodal Data Integration for Cancer ResearchCode0
Exploring Design of Multi-Agent LLM Dialogues for Research IdeationCode1
Car Object Counting and Position Estimation via Extension of the CLIP-EBC FrameworkCode0
RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting0
A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement LearningCode1
KAT-V1: Kwai-AutoThink Technical Report0
Prospective Learning in RetrospectCode0
Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning0
Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models0
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs0
Shifting from Ranking to Set Selection for Retrieval Augmented GenerationCode0
Multi-modal Representations for Fine-grained Multi-label Critical View of Safety RecognitionCode0
Dual Semantic-Aware Network for Noise Suppressed Ultrasound Video SegmentationCode0
RLEP: Reinforcement Learning with Experience Replay for LLM ReasoningCode0
Show:102550
← PrevPage 254 of 9486Next →