SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 22512300 of 177339 papers

TitleStatusHype
LLM Inference Unveiled: Survey and Roofline Model InsightsCode4
Multimodal Whole Slide Foundation Model for PathologyCode4
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorchCode4
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with NothingCode4
MonSter: Marry Monodepth to Stereo Unleashes PowerCode4
Large Models for Time Series and Spatio-Temporal Data: A Survey and OutlookCode4
Cost-Effective Hyperparameter Optimization for Large Language Model Generation InferenceCode4
Efficient Post-training Quantization with FP8 FormatsCode4
Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournamentsCode4
Transformers in Time Series: A SurveyCode4
RaTEScore: A Metric for Radiology Report GenerationCode4
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow MatchingCode4
Atom of Thoughts for Markov LLM Test-Time ScalingCode4
Mixtral of ExpertsCode4
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency PolicyCode3
KwaiAgents: Generalized Information-seeking Agent System with Large Language ModelsCode3
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented GenerationCode3
How Far Are We From AGI: Are LLMs All We Need?Code3
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation FrameworkCode3
Controllable Text-to-3D Generation via Surface-Aligned Gaussian SplattingCode3
TKAN: Temporal Kolmogorov-Arnold NetworksCode3
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data CompositionCode3
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?Code3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
AutoAugment: Learning Augmentation Policies from DataCode3
Attention Heads of Large Language Models: A SurveyCode3
Toward Generalist Anomaly Detection via In-context Residual Learning with Few-shot Sample PromptsCode3
Time-series Transformer Generative Adversarial NetworksCode3
Denoising Vision TransformersCode3
High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain ActivityCode3
CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal ConcatenationCode3
SOAP: Improving and Stabilizing Shampoo using AdamCode3
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic ChipsCode3
M+: Extending MemoryLLM with Scalable Long-Term MemoryCode3
MDocAgent: A Multi-Modal Multi-Agent Framework for Document UnderstandingCode3
MiniViT: Compressing Vision Transformers with Weight MultiplexingCode3
SPMamba: State-space model is all you need in speech separationCode3
Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight DetectionCode3
Vision as LoRACode3
Deep Limit Order Book ForecastingCode3
Video Mamba Suite: State Space Model as a Versatile Alternative for Video UnderstandingCode3
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual ShiftingCode3
EfficientFormer: Vision Transformers at MobileNet SpeedCode3
Demystify Mamba in Vision: A Linear Attention PerspectiveCode3
Visual Large Language Models for Generalized and Specialized ApplicationsCode3
Order Matters: Sequence to sequence for setsCode3
MotionBERT: A Unified Perspective on Learning Human Motion RepresentationsCode3
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic SegmentationCode3
Large Language Models as Tool MakersCode3
Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical PerceptionCode3
Show:102550
← PrevPage 46 of 3547Next →