SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2065120700 of 474278 papers

TitleStatusHype
Diffusion Auto-regressive Transformer for Effective Self-supervised Time Series ForecastingCode1
Batched Bayesian optimization by maximizing the probability of including the optimumCode1
Evaluating Performance and Bias of Negative Sampling in Large-Scale Sequential Recommendation ModelsCode1
Physics-Informed Regularization for Domain-Agnostic Dynamical System ModelingCode1
Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud RegistrationCode1
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image EditingCode1
Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many ClassesCode1
GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction BenchmarksCode1
FACMIC: Federated Adaptative CLIP Model for Medical Image ClassificationCode1
Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time SeriesCode1
Feature Selection Gates with Gradient Routing for Endoscopic Image ComputingCode1
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image ClassificationCode1
Image Watermarks are Removable Using Controllable Regeneration from Clean NoiseCode1
SePPO: Semi-Policy Preference Optimization for Diffusion AlignmentCode1
Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the WildCode1
From Sparse Dependence to Sparse Attention: Unveiling How Chain-of-Thought Enhances Transformer Sample EfficiencyCode1
ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question AnsweringCode1
ImProver: Agent-Based Automated Proof OptimizationCode1
Collaboration! Towards Robust Neural Methods for Routing ProblemsCode1
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic CompositionalityCode1
Toward General Object-level Mapping from Sparse Views with 3D Diffusion PriorsCode1
Can LLMs Understand Time Series Anomalies?Code1
Neural Fourier Modelling: A Highly Compact Approach to Time-Series AnalysisCode1
Fast Training of Sinusoidal Neural Fields via Scaling InitializationCode1
Continuous Ensemble Weather Forecasting with Diffusion modelsCode1
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language ModelsCode1
Fine-Tuning CLIP's Last Visual Projector: A Few-Shot CornucopiaCode1
RoWeeder: Unsupervised Weed Mapping through Crop-Row DetectionCode1
What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMsCode1
PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud FusionCode1
DAPE V2: Process Attention Score as Feature Map for Length ExtrapolationCode1
MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space TerrainCode1
DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image RegistrationCode1
ActiView: Evaluating Active Perception Ability for Multimodal Large Language ModelsCode1
Beyond FVD: Enhanced Evaluation Metrics for Video Generation QualityCode1
Refining Counterfactual Explanations With Joint-Distribution-Informed Shapley Towards Actionable MinimalityCode1
MOFFlow: Flow Matching for Structure Prediction of Metal-Organic FrameworksCode1
Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context ModelsCode1
A Recipe For Building a Compliant Real Estate ChatbotCode1
Enhanced Super-Resolution Training via Mimicked Alignment for Real-World ScenesCode1
NeuroBOLT: Resting-state EEG-to-fMRI Synthesis with Multi-dimensional Feature MappingCode1
R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?Code1
PostEdit: Posterior Sampling for Efficient Zero-Shot Image EditingCode1
D-PoSE: Depth as an Intermediate Representation for 3D Human Pose and Shape EstimationCode1
Hyper-Representations: Learning from Populations of Neural NetworksCode1
Unsupervised Representation Learning from Sparse Transformation AnalysisCode1
Spatio-Temporal 3D Point Clouds from WiFi-CSI Data via Transformer NetworksCode1
CogDevelop2K: Reversed Cognitive Development in Multimodal Large Language ModelsCode1
Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-TuningCode1
Algorithmic Capabilities of Random TransformersCode1
Show:102550
← PrevPage 414 of 9486Next →