SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 76517675 of 474278 papers

TitleStatusHype
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition AbilitiesCode2
Image Segmentation in Foundation Model Era: A SurveyCode2
Towards Evaluating and Building Versatile Large Language Models for MedicineCode2
Scalable Autoregressive Image Generation with MambaCode2
MuMA-ToM: Multi-modal Multi-Agent Theory of MindCode2
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMCode2
UMERegRobust -- Universal Manifold Embedding Compatible Features for Robust Point Cloud RegistrationCode2
UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing ImagesCode2
Critique-out-Loud Reward ModelsCode2
RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions TransformCode2
HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image SegmentationCode2
biorecap: an R package for summarizing bioRxiv preprints with a local LLMCode2
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal RepresentationCode2
Pano2Room: Novel View Synthesis from a Single Indoor PanoramaCode2
KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?Code2
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality AssessmentCode2
Personality Alignment of Large Language ModelsCode2
PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series ForecastingCode2
BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language ModelCode2
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further TuningCode2
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative DecodingCode2
deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural NetworksCode2
PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation AnalysisCode2
FLAME: Learning to Navigate with Multimodal LLM in Urban EnvironmentsCode2
ConFIG: Towards Conflict-free Training of Physics Informed Neural NetworksCode2
Show:102550
← PrevPage 307 of 18972Next →