SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1175111800 of 661570 papers

TitleStatusHype
Q-BERT4Rec: Quantized Semantic-ID Representation Learning for Multimodal Recommendation0
CHAMMI-75: Pre-training multi-channel models with heterogeneous microscopy images0
Automated Data Enrichment using Confidence-Aware Fine-Grained Debate among Open-Source LLMs for Mental Health and Online Safety0
Learning to Evolve for Optimization via Stability-Inducing Neural Unrolling0
Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving0
Multi-Scenario Highway Lane-Change Intention Prediction: A Temporal Physics-Informed Multi-Modal Framework0
Quantized SO(3)-Equivariant Graph Neural Networks for Efficient Molecular Property Prediction0
UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving0
Entropy Sentinel: Continuous LLM Accuracy Monitoring from Decoding Entropy Traces in STEM0
Discrete Solution Operator Learning for Geometry-Dependent PDEs0
Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling0
SwiftRepertoire: Few-Shot Immune-Signature Synthesis via Dynamic Kernel Codes0
Zero-Permission Manipulation: Can We Trust Large Multimodal Model Powered GUI Agents?0
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery0
Multimodal Multi-Agent Ransomware Analysis Using AutoGen0
Sustainable Materials Discovery in the Era of Artificial Intelligence0
On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks0
Semantic-level Backdoor Attack against Text-to-Image Diffusion Models0
Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins0
MoToRec: Sparse-Regularized Multimodal Tokenization for Cold-Start Recommendation0
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection0
Classroom Final Exam: An Instructor-Tested Reasoning BenchmarkCode0
A Researcher's Guide to Empirical Risk Minimization0
RuCL: Stratified Rubric-Based Curriculum Learning for Multimodal Large Language Model Reasoning0
Learning-Augmented Moment Estimation on Time-Decay Models0
PSQE: A Theoretical-Practical Approach to Pseudo Seed Quality Enhancement for Unsupervised Multimodal Entity Alignment0
Uni-Animator: Towards Unified Visual Colorization0
Tell Me What To Learn: Generalizing Neural Memory to be Controllable in Natural Language0
FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation0
APPO: Attention-guided Perception Policy Optimization for Video Reasoning0
DeepXiv-SDK: An Agentic Data Interface for Scientific Literature0
GLIDE-Reg: Global-to-Local Deformable Registration Using Co-Optimized Foundation and Handcrafted Features0
CoPeP: Benchmarking Continual Pretraining for Protein Language Models0
NeuroHex: Highly-Efficient Hex Coordinate System for Creating World Models to Enable Adaptive AI0
PreciseCache: Precise Feature Caching for Efficient and High-fidelity Video Generation0
HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views0
FACE: A Face-based Autoregressive Representation for High-Fidelity and Efficient Mesh Generation0
InterCoG: Towards Spatially Precise Image Editing with Interleaved Chain-of-Grounding Reasoning0
RubricBench: Aligning Model-Generated Rubrics with Human Standards1
PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts0
QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions0
Solving Inverse PDE Problems using Minimization Methods and AI0
What Capable Agents Must Know: Selection Theorems for Robust Decision-Making under Uncertainty0
Rethinking Policy Diversity in Ensemble Policy Gradient in Large-Scale Reinforcement Learning0
Recursive Think-Answer Process for LLMs and VLMs0
Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy0
From Shallow to Deep: Pinning Semantic Intent via Causal GRPO0
OnlineX: Unified Online 3D Reconstruction and Understanding with Active-to-Stable State Evolution0
Geometric structures and deviations on James' symmetric positive-definite matrix bicone domain0
HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images2
Show:102550
← PrevPage 236 of 13232Next →