SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 70517075 of 474278 papers

TitleStatusHype
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy ReductionCode2
Foundation Models for Remote Sensing and Earth Observation: A SurveyCode2
PAPILLON: Privacy Preservation from Internet-based and Local Language Model EnsemblesCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
xLSTM-Mixer: Multivariate Time Series Forecasting by Mixing via Scalar MemoriesCode2
MiniPLM: Knowledge Distillation for Pre-Training Language ModelsCode2
Beyond Browsing: API-Based Web AgentsCode2
Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions FollowingCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
Mitigating Object Hallucination via Concentric Causal AttentionCode2
Analysing the Residual Stream of Language Models Under Knowledge ConflictsCode2
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and EvolutionCode2
Compute-Constrained Data SelectionCode2
LLaVA-KD: A Framework of Distilling Multimodal Large Language ModelsCode2
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and StyleCode2
Diffusion Transformer PolicyCode2
Reducing Hallucinations in Vision-Language Models via Latent Space SteeringCode2
CamI2V: Camera-Controlled Image-to-Video Diffusion ModelCode2
RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D RegistrationCode2
Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and PerspectivesCode2
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation EngineeringCode2
Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4Code2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
3D-GANTex: 3D Face Reconstruction with StyleGAN3-based Multi-View Images and 3DDFA based Mesh GenerationCode2
Improve Vision Language Model Chain-of-thought ReasoningCode2
Show:102550
← PrevPage 283 of 18972Next →