SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 33513375 of 661570 papers

TitleStatusHype
What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language ModelsCode3
Retrieval-augmented generation in multilingual settingsCode3
BERGEN: A Benchmarking Library for Retrieval-Augmented GenerationCode3
StyleShot: A Snapshot on Any StyleCode3
xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba CounterpartCode3
Evaluation of Text-to-Video Generation Models: A Dynamics PerspectiveCode3
Searching for Best Practices in Retrieval-Augmented GenerationCode3
Tree Search for Language Model AgentsCode3
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model AgentsCode3
Instruct-IPT: All-in-One Image Processing Transformer via Weight ModulationCode3
Deep Frequency Derivative Learning for Non-stationary Time Series ForecastingCode3
SpotlessSplats: Ignoring Distractors in 3D Gaussian SplattingCode3
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything ModelCode3
Segment Anything without SupervisionCode3
LLaRA: Supercharging Robot Learning Data for Vision-Language PolicyCode3
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleCode3
Diffusion Model-Based Video Editing: A SurveyCode3
A Survey on Mixture of ExpertsCode3
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMsCode3
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha FactorsCode3
A Review of Large Language Models and Autonomous Agents in ChemistryCode3
Point-SAM: Promptable 3D Segmentation Model for Point CloudsCode3
Director3D: Real-world Camera Trajectory and 3D Scene Generation from TextCode3
Adam-mini: Use Fewer Learning Rates To Gain MoreCode3
Lossless data compression by large modelsCode3
Show:102550
← PrevPage 135 of 26463Next →