SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 24512500 of 177339 papers

TitleStatusHype
NeuMan: Neural Human Radiance Field from a Single VideoCode3
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem SolvingCode3
One Transformer Fits All Distributions in Multi-Modal Diffusion at ScaleCode3
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN TicketCode3
Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity AnalysisCode3
Bridging Language and Items for Retrieval and RecommendationCode3
Graph Retrieval-Augmented Generation: A SurveyCode3
GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo ViewsCode3
ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series TransformerCode3
FruitNeRF: A Unified Neural Radiance Field based Fruit Counting FrameworkCode3
HuatuoGPT, towards Taming Language Model to Be a DoctorCode3
Improving Transformers with Dynamically Composable Multi-Head AttentionCode3
NeRF: Representing Scenes as Neural Radiance Fields for View SynthesisCode3
GARField: Group Anything with Radiance FieldsCode3
Do We Need Anisotropic Graph Neural Networks?Code3
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language ModelsCode3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of ExpertsCode3
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wildCode3
T^3Bench: Benchmarking Current Progress in Text-to-3D GenerationCode3
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token SequencesCode3
Human-like Episodic Memory for Infinite Context LLMsCode3
OctFusion: Octree-based Diffusion Models for 3D Shape GenerationCode3
OneBit: Towards Extremely Low-bit Large Language ModelsCode3
MoMask: Generative Masked Modeling of 3D Human MotionsCode3
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language ModelsCode3
SlimPajama-DC: Understanding Data Combinations for LLM TrainingCode3
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMsCode3
Splatter Image: Ultra-Fast Single-View 3D ReconstructionCode3
MatterGen: a generative model for inorganic materials designCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
EfficientNet: Rethinking Model Scaling for Convolutional Neural NetworksCode3
Beat this! Accurate beat tracking without DBN postprocessingCode3
BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object DetectionCode3
Parametric Retrieval Augmented GenerationCode3
LLMs Get Lost In Multi-Turn ConversationCode3
ORLM: A Customizable Framework in Training Large Models for Automated Optimization ModelingCode3
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible PipelineCode3
ViNT: A Foundation Model for Visual NavigationCode3
The Prusti project: Formal verification for RustCode3
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic SegmentationCode3
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernelsCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
RecurrentGPT: Interactive Generation of (Arbitrarily) Long TextCode3
Punica: Multi-Tenant LoRA ServingCode3
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image GenerationCode3
RepViT-SAM: Towards Real-Time Segmenting AnythingCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient InferenceCode3
Show:102550
← PrevPage 50 of 3547Next →