SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 78517875 of 474278 papers

TitleStatusHype
Language Arithmetics: Towards Systematic Language Neuron Identification and ManipulationCode0
SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based AgentsCode0
Rethinking LLM Human Simulation: When a Graph is What You NeedCode0
UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNsCode0
Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory ProbingCode0
SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignmentCode0
NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image TranslationCode0
Driving scenario generation and evaluation using a structured layer representation and foundational modelsCode0
HADSF: Aspect Aware Semantic Control for Explainable RecommendationCode0
Perturb a Model, Not an Image: Towards Robust Privacy Protection via Anti-Personalized Diffusion ModelsCode0
ChartAB: A Benchmark for Chart Grounding & Dense Alignment0
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation0
Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models0
Enhancing Time Awareness in Generative RecommendationCode0
MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image SegmentationCode0
Web-Scale Collection of Video Data for 4D Animal ReconstructionCode0
MVSMamba: Multi-View Stereo with State Space ModelCode0
DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible PatchesCode0
FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-DesignCode0
MicroRemed: Benchmarking LLMs in Microservices RemediationCode0
FEval-TTC: Fair Evaluation Protocol for Test-Time ComputeCode0
Detecting Generated Images by Fitting Natural Image DistributionsCode0
Pragmatic Heterogeneous Collaborative Perception via Generative Communication MechanismCode0
Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point CloudsCode0
Video Models Start to Solve Chess, Maze, Sudoku, Mental Rotation, and Raven' MatricesCode0
Show:102550
← PrevPage 315 of 18972Next →