SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1790117950 of 474278 papers

TitleStatusHype
FOCUS - Multi-View Foot Reconstruction From Synthetically Trained Dense CorrespondencesCode1
HODDI: A Dataset of High-Order Drug-Drug Interactions for Computational PharmacovigilanceCode1
evclust: Python library for evidential clusteringCode1
Calibrating LLMs with Information-Theoretic Evidential Deep LearningCode1
Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation AssessmentCode1
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLMCode1
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language ModelsCode1
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEGCode1
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation LearningCode1
Krutrim LLM: Multilingual Foundational Model for over a Billion PeopleCode1
From Pixels to Components: Eigenvector Masking for Visual Representation LearningCode1
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video EnvironmentsCode1
Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image DenoisingCode1
When Data Manipulation Meets Attack Goals: An In-depth Survey of Attacks for VLMsCode1
UniZyme: A Unified Protein Cleavage Site Predictor Enhanced with Enzyme Active-Site KnowledgeCode1
Retrieving Filter Spectra in CNN for Explainable Sleep Stage ClassificationCode1
SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object CountingCode1
A Data-Efficient Pan-Tumor Foundation Model for Oncology CT InterpretationCode1
CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale AnalysisCode1
WyckoffDiff -- A Generative Diffusion Model for Crystal SymmetryCode1
ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing MechanismsCode1
Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction UncertaintyCode1
Conditional diffusion model with spatial attention and latent embedding for medical image segmentationCode1
A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and ExplainerCode1
Implicit Language Models are RNNs: Balancing Parallelization and ExpressivityCode1
Foundation Model of Electronic Medical Records for Adaptive Risk EstimationCode1
DGenNO: A Novel Physics-aware Neural Operator for Solving Forward and Inverse PDE Problems based on Deep, Generative Probabilistic ModelingCode1
RelGNN: Composite Message Passing for Relational Deep LearningCode1
LANTERN++: Enhancing Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive ModelsCode1
Habitizing Diffusion Planning for Efficient and Effective Decision MakingCode1
Combining Large Language Models with Static Analyzers for Code Review GenerationCode1
Geometry-aware RL for Manipulation of Varying Shapes and Deformable ObjectsCode1
AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate StatementsCode1
Large Language Models Meet Symbolic Provers for Logical Reasoning EvaluationCode1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoECode1
Learning Clustering-based Prototypes for Compositional Zero-shot LearningCode1
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene GenerationCode1
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot ControlCode1
MERGE^3: Efficient Evolutionary Merging on Consumer-grade GPUsCode1
LM2: Large Memory ModelsCode1
UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal ControlCode1
Reinforced Lifelong Editing for Language ModelsCode1
Preventing Rogue Agents Improves Multi-Agent CollaborationCode1
Training Language Models for Social Deduction with Multi-Agent Reinforcement LearningCode1
Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation ModelsCode1
Semantic Role Labeling: A Systematical SurveyCode1
Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image GenerationCode1
Known Unknowns: Out-of-Distribution Property Prediction in Materials and MoleculesCode1
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic TransformationsCode1
Injecting Universal Jailbreak Backdoors into LLMs in MinutesCode1
Show:102550
← PrevPage 359 of 9486Next →