SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 84768500 of 474278 papers

TitleStatusHype
From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning TasksCode2
Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image GenerationCode2
XRec: Large Language Models for Explainable RecommendationCode2
Generative Active Learning for Long-tailed Instance SegmentationCode2
Block Transformer: Global-to-Local Language Modeling for Fast InferenceCode2
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer DevicesCode2
Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate ControlCode2
ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLUCode2
Extended Mind TransformersCode2
A Temporal Kolmogorov-Arnold Transformer for Time Series ForecastingCode2
GrootVL: Tree Topology is All You Need in State Space ModelCode2
ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localizationCode2
Demystifying the Compression of Mixture-of-Experts Through a Unified FrameworkCode2
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their DefensesCode2
CodeR: Issue Resolving with Multi-Agent and Task GraphsCode2
FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor SynthesisCode2
Poisoning Attacks and Defenses in Recommender Systems: A SurveyCode2
Two Tales of Persona in LLMs: A Survey of Role-Playing and PersonalizationCode2
TabPedia: Towards Comprehensive Visual Table Understanding with Concept SynergyCode2
EduNLP: Towards a Unified and Modularized Library for Educational ResourcesCode2
Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language ModelsCode2
Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image GenerationCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
Generative Pre-trained Speech Language Model with Efficient Hierarchical TransformerCode2
TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese MedicineCode2
Show:102550
← PrevPage 340 of 18972Next →