SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 96519675 of 474278 papers

TitleStatusHype
An Item is Worth a Prompt: Versatile Image Editing with Disentangled ControlCode2
JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics FrameworkCode2
LLMs in the Imaginarium: Tool Learning through Simulated Trial and ErrorCode2
Active Generalized Category DiscoveryCode2
Large Language Models are In-Context Molecule LearnersCode2
Automatic and Universal Prompt Injection Attacks against Large Language ModelsCode2
AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit DetectorsCode2
BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel ModelingCode2
Online Adaptation of Language Models with a Memory of Amortized ContextsCode2
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual ScenariosCode2
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and MergingCode2
Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal ReasoningCode2
Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance ExtensionCode2
An L-BFGS-B approach for linear and nonlinear system identification under _1 and group-Lasso regularizationCode2
MolNexTR: A Generalized Deep Learning Model for Molecular Image RecognitionCode2
VastTrack: Vast Category Visual Object TrackingCode2
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B PeopleCode2
Learning to Decode Collaboratively with Multiple Language ModelsCode2
Mamba4Rec: Towards Efficient Sequential Recommendation with Selective State Space ModelsCode2
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-TrainingCode2
ShortGPT: Layers in Large Language Models are More Redundant Than You ExpectCode2
GPTopic: Dynamic and Interactive Topic RepresentationsCode2
Task Attribute Distance for Few-Shot Learning: Theoretical Analysis and ApplicationsCode2
Backtracing: Retrieving the Cause of the QueryCode2
Diffusion-based Generative Prior for Low-Complexity MIMO Channel EstimationCode2
Show:102550
← PrevPage 387 of 18972Next →