SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 31013150 of 659983 papers

TitleStatusHype
CAX: Cellular Automata Accelerated in JAXCode3
Diffusion Models are Evolutionary AlgorithmsCode3
How to Train Long-Context Language Models (Effectively)Code3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based AgentsCode3
FAN: Fourier Analysis NetworksCode3
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi DecodingCode3
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation ModelsCode3
ImageFolder: Autoregressive Image Generation with Folded TokensCode3
Deep Learning Alternatives of the Kolmogorov Superposition TheoremCode3
MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series ForecastingCode3
SonicSim: A customizable simulation platform for speech processing in moving sound source scenariosCode3
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing ImagesCode3
MVGS: Multi-view-regulated Gaussian Splatting for Novel View SynthesisCode3
MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K ParametersCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Simple and Fast Distillation of Diffusion ModelsCode3
PhysGen: Rigid-Body Physics-Grounded Image-to-Video GenerationCode3
Emu3: Next-Token Prediction is All You NeedCode3
DANA: Domain-Aware Neurosymbolic Agents for Consistency and AccuracyCode3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic PatternsCode3
Does End-to-End Autonomous Driving Really Need Perception Tasks?Code3
The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection BenchmarkCode3
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A SurveyCode3
Generative Modeling of Molecular Dynamics TrajectoriesCode3
Cascade Prompt Learning for Vision-Language Model AdaptationCode3
Programming Every Example: Lifting Pre-training Data Quality like Experts at ScaleCode3
Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text PromptsCode3
Degradation-Guided One-Step Image Super-Resolution with Diffusion PriorsCode3
Results of the Big ANN: NeurIPS'23 competitionCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
Language-based Audio Moment RetrievalCode3
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker ExtractionCode3
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and SpeechCode3
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous DrivingCode3
Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio DistanceCode3
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language InstructionsCode3
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot NavigationCode3
Data Augmentation for Sequential Recommendation: A SurveyCode3
Colorful Diffuse Intrinsic Image Decomposition in the WildCode3
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing TasksCode3
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented GenerationCode3
DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view InputCode3
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary ResolutionCode3
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-MarquardtCode3
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wildCode3
SOAP: Improving and Stabilizing Shampoo using AdamCode3
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent BenchmarkCode3
Deep Graph Anomaly Detection: A Survey and New PerspectivesCode3
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language ModelsCode3
Towards Kinetic Manipulation of the Latent SpaceCode3
Show:102550
← PrevPage 63 of 13200Next →