SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30513075 of 177340 papers

TitleStatusHype
rLLM: Relational Table Learning with LLMsCode3
WildGaussians: 3D Gaussian Splatting in the WildCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
Scaling Retrieval-Based Language Models with a Trillion-Token DatastoreCode3
Compact Language Models via Pruning and Knowledge DistillationCode3
PyABSA: A Modularized Framework for Reproducible Aspect-based Sentiment AnalysisCode3
Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object DetectionCode3
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for MedicineCode3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality DataCode3
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge DevicesCode3
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language ModelsCode3
LoopSplat: Loop Closure by Registering 3D Gaussian SplatsCode3
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video GenerationCode3
AnyGraph: Graph Foundation Model in the WildCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-MarquardtCode3
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language InstructionsCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text PromptsCode3
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot NavigationCode3
Results of the Big ANN: NeurIPS'23 competitionCode3
Diffusion Models are Evolutionary AlgorithmsCode3
ControlAR: Controllable Image Generation with Autoregressive ModelsCode3
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character controlCode3
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video GenerationCode3
Show:102550
← PrevPage 123 of 7094Next →