SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 86018625 of 177340 papers

TitleStatusHype
DARWIN Series: Domain Specific Large Language Models for Natural ScienceCode2
Selective Prompt Anchoring for Code GenerationCode2
Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution ShiftCode2
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction TuningCode2
Temporal Action Localization with Enhanced Instant DiscriminabilityCode2
PyMOLfold: Interactive Protein and Ligand Structure Prediction in PyMOLCode2
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error CorrectionCode2
Commands as AI ConversationsCode2
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier TransformCode2
Grasp-Anything: Large-scale Grasp Dataset from Foundation ModelsCode2
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured SparsityCode2
RMT: Retentive Networks Meet Vision TransformersCode2
Detect Everything with Few ExamplesCode2
Detecting and Grounding Multi-Modal Media Manipulation and BeyondCode2
Improving CLIP Fine-tuning PerformanceCode2
PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking AttacksCode2
GenSim: Generating Robotic Simulation Tasks via Large Language ModelsCode2
Interpreting CLIP's Image Representation via Text-Based DecompositionCode2
Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud RegistrationCode2
TopoMLP: A Simple yet Strong Pipeline for Driving Topology ReasoningCode2
A Semantic Invariant Robust Watermark for Large Language ModelsCode2
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-SpecificityCode2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic ScenesCode2
OmniControl: Control Any Joint at Any Time for Human Motion GenerationCode2
Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody ModellingCode2
Show:102550
← PrevPage 345 of 7094Next →