SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1960119650 of 474278 papers

TitleStatusHype
LEDRO: LLM-Enhanced Design Space Reduction and Optimization for Analog CircuitsCode1
PR-ENDO: Physically Based Relightable Gaussian Splatting for EndoscopyCode1
Diffusion-Inspired Cold Start with Sufficient Prior in Computerized Adaptive TestingCode1
Translating Electrocardiograms to Cardiac Magnetic Resonance Imaging Useful for Cardiac Assessment and Disease Screening: A Multi-Center Study AI for ECG to CMR Translation StudyCode1
DLBacktrace: A Model Agnostic Explainability for any Deep Learning ModelsCode1
PyAWD: A Library for Generating Large Synthetic Datasets of Acoustic Wave Propagation with DevitoCode1
Evaluating the Prompt Steerability of Large Language ModelsCode1
Stylecodes: Encoding Stylistic Information For Image GenerationCode1
SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank ApproximationCode1
Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussian SplattingCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Signformer is all you need: Towards Edge AI for Sign LanguageCode1
UrbanDiT: A Foundation Model for Open-World Urban Spatio-Temporal LearningCode1
libcll: an Extendable Python Toolkit for Complementary-Label LearningCode1
ProSec: Fortifying Code LLMs with Proactive Security AlignmentCode1
Harnessing Scale and Physics: A Multi-Graph Neural Operator Framework for PDEs on Arbitrary GeometriesCode1
CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware Integration and a Foundational DatasetCode1
The Sound of Water: Inferring Physical Properties from Pouring LiquidsCode1
Introducing Milabench: Benchmarking Accelerators for AICode1
Towards Open-Vocabulary Audio-Visual Event LocalizationCode1
Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery DetectionCode1
TSPRank: Bridging Pairwise and Listwise Methods with a Bilinear Travelling Salesman ModelCode1
TimeFormer: Capturing Temporal Relationships of Deformable 3D Gaussians for Robust ReconstructionCode1
CROW: Eliminating Backdoors from Large Language Models via Internal Consistency RegularizationCode1
Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image FusionCode1
PerfCodeGen: Improving Performance of LLM Generated Code with Execution FeedbackCode1
Continuous Speculative Decoding for Autoregressive Image GenerationCode1
Equivariant spatio-hemispherical networks for diffusion MRI deconvolutionCode1
Improved GUI Grounding via Iterative NarrowingCode1
Generalizable Person Re-identification via Balancing Alignment and UniformityCode1
Unveiling Redundancy in Diffusion Transformers (DiTs): A Systematic StudyCode1
Aligning Few-Step Diffusion Models with Dense Reward Difference LearningCode1
Graph Neural Networks for Quantifying Compatibility Mechanisms in Traditional Chinese MedicineCode1
HistoEncoder: a digital pathology foundation model for prostate cancerCode1
TSINR: Capturing Temporal Continuity via Implicit Neural Representations for Time Series Anomaly DetectionCode1
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-trainingCode1
Temporal and Spatial Reservoir Ensembling Techniques for Liquid State MachinesCode1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot LearningCode1
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural EnhancementsCode1
Exploiting VLM Localizability and Semantics for Open Vocabulary Action DetectionCode1
Constrained Diffusion with Trust SamplingCode1
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?Code1
TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language ModelsCode1
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code GenerationCode1
Multilingual Large Language Models: A Systematic SurveyCode1
PickScan: Object discovery and reconstruction from handheld interactionsCode1
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference OptimizationCode1
BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense EvaluationCode1
AIGS: Generating Science from AI-Powered Automated FalsificationCode1
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language ModelCode1
Show:102550
← PrevPage 393 of 9486Next →