SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 96769700 of 474278 papers

TitleStatusHype
MeaCap: Memory-Augmented Zero-shot Image CaptioningCode2
What do we learn from inverting CLIP models?Code2
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing ModelCode2
Semantic Human Mesh Reconstruction with TexturesCode2
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model AgentsCode2
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical LabelsCode2
Interactive Continual Learning: Fast and Slow ThinkingCode2
Android in the Zoo: Chain-of-Action-Thought for GUI AgentsCode2
PPFlow: Target-aware Peptide Design with Torsional Flow MatchingCode2
TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of ExpertsCode2
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language TransformerCode2
ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular ModelingCode2
Towards Measuring and Modeling "Culture" in LLMs: A SurveyCode2
Multi-perspective Improvement of Knowledge Graph Completion with Large Language ModelsCode2
Exposing the Deception: Uncovering More Forgery Clues for Deepfake DetectionCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
Large language models surpass human experts in predicting neuroscience resultsCode2
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target DetectionCode2
xT: Nested Tokenization for Larger Context in Large ImagesCode2
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPTCode2
A Simple Baseline for Efficient Hand Mesh ReconstructionCode2
Applied Causal Inference Powered by ML and AICode2
REAL-Colon: A dataset for developing real-world AI applications in colonoscopyCode2
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language ModelsCode2
Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation ModelsCode2
Show:102550
← PrevPage 388 of 18972Next →