SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 98269850 of 474278 papers

TitleStatusHype
Foundation Policies with Hilbert RepresentationsCode2
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase PartitionCode2
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept CompositionCode2
GraphEdit: Large Language Models for Graph Structure LearningCode2
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure PriorCode2
MACRec: a Multi-Agent Collaboration Framework for RecommendationCode2
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic ManipulationCode2
Morphological Symmetries in RoboticsCode2
Machine Unlearning of Pre-trained Large Language ModelsCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
An Empirical Study of Data Ability Boundary in LLMs' Math ReasoningCode2
Fast Adversarial Attacks on Language Models In One GPU MinuteCode2
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)Code2
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender SystemsCode2
Measuring Multimodal Mathematical Reasoning with MATH-Vision DatasetCode2
HyperFast: Instant Classification for Tabular DataCode2
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn DialoguesCode2
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level RecognitionCode2
HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced AttentionCode2
Less is More: Mitigating Multimodal Hallucination from an EOS Decision PerspectiveCode2
Subobject-level Image TokenizationCode2
GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising DiffusionCode2
PALO: A Polyglot Large Multimodal Model for 5B PeopleCode2
Data Science with LLMs and Interpretable ModelsCode2
Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series DataCode2
Show:102550
← PrevPage 394 of 18972Next →