SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2080120850 of 474278 papers

TitleStatusHype
EvoMesh: Adaptive Physical Simulation with Hierarchical Graph EvolutionsCode1
RDEIC: Accelerating Diffusion-Based Extreme Image Compression with Relay Residual DiffusionCode1
Encryption-Friendly LLM ArchitectureCode1
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short VideosCode1
Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo MatchingCode1
Erasing Conceptual Knowledge from Language ModelsCode1
Response Estimation and System Identification of Dynamical Systems via Physics-Informed Neural NetworksCode1
TorchSISSO: A PyTorch-Based Implementation of the Sure Independence Screening and Sparsifying Operator for Efficient and Interpretable Model DiscoveryCode1
CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessmentCode1
Revisiting Hierarchical Text Classification: Inference and MetricsCode1
OmniSR: Shadow Removal under Direct and Indirect LightingCode1
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion ModelsCode1
PASS:Test-Time Prompting to Adapt Styles and Semantic Shapes in Medical Image SegmentationCode1
Saliency-Guided DETR for Moment Retrieval and Highlight DetectionCode1
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade DevicesCode1
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model AlignmentCode1
Imaging foundation model for universal enhancement of non-ideal measurement CTCode1
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed BanditsCode1
Explainable Earth Surface Forecasting under Extreme EventsCode1
Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language ModelsCode1
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer AccelerationCode1
Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence ModelingCode1
Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question AnsweringCode1
DeepProtein: Deep Learning Library and Benchmark for Protein Sequence LearningCode1
Were RNNs All We Needed?Code1
Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge AcquisitionCode1
Text2PDE: Latent Diffusion Models for Accessible Physics SimulationCode1
VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion ModelsCode1
Positional Attention: Expressivity and Learnability of Algorithmic ComputationCode1
A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particlesCode1
ANTIPASTI: interpretable prediction of antibody binding affinity exploiting Normal Modes and Deep LearningCode1
UW-GS: Distractor-Aware 3D Gaussian Splatting for Enhanced Underwater Scene ReconstructionCode1
TPP-LLM: Modeling Temporal Point Processes by Efficiently Fine-Tuning Large Language ModelsCode1
Multi-Scale Fusion for Object RepresentationCode1
Edge-preserving noise for diffusion modelsCode1
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory LearningCode1
FlexLMM: a Nextflow linear mixed model framework for GWASCode1
FactAlign: Long-form Factuality Alignment of Large Language ModelsCode1
Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document RevisionsCode1
Integrative Decoding: Improve Factuality via Implicit Self-consistencyCode1
MONICA: Benchmarking on Long-tailed Medical Image ClassificationCode1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard ModelsCode1
High-quality Animatable Eyelid Shapes from Lightweight CapturesCode1
EMMA: Efficient Visual Alignment in Multi-Modal LLMsCode1
Integrating Visual and Textual Inputs for Searching Large-Scale Map Collections with CLIPCode1
MM-LDM: Multi-Modal Latent Diffusion Model for Sounding Video GenerationCode1
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model CompressionCode1
MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE FrameworkCode1
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object TrackingCode1
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic DataCode1
Show:102550
← PrevPage 417 of 9486Next →