SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1005110100 of 177340 papers

TitleStatusHype
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot LearningCode2
LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR ModelsCode2
SiriuS: Self-improving Multi-agent Systems via Bootstrapped ReasoningCode2
Harder Tasks Need More Experts: Dynamic Routing in MoE ModelsCode2
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression SegmentationCode2
Towards Language Models That Can See: Computer Vision Through the LENS of Natural LanguageCode2
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse SamplingCode2
Adaptive Keyframe Sampling for Long Video UnderstandingCode2
A Survey of Deep Learning for Mathematical ReasoningCode2
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and PredictionCode2
Foundation Models for Spatio-Temporal Data Science: A Tutorial and SurveyCode2
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length InputsCode2
DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change DetectionCode2
Data Science with LLMs and Interpretable ModelsCode2
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D ScenesCode2
Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024Code2
Preventing Local Pitfalls in Vector Quantization via Optimal TransportCode2
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane BenchmarkCode2
A Survey of Financial AI: Architectures, Advances and Open ChallengesCode2
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence ModelingCode2
Habitat: A Platform for Embodied AI ResearchCode2
Masked Siamese Networks for Label-Efficient LearningCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Mamba-ST: State Space Model for Efficient Style TransferCode2
recommenderlab: An R Framework for Developing and Testing Recommendation AlgorithmsCode2
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning AbilitiesCode2
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion TransformerCode2
RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic ReportsCode2
MaGGIe: Masked Guided Gradual Human Instance MattingCode2
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models EvaluationCode2
Phi-4 Technical ReportCode2
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image SegmentationCode2
PubTables-1M: Towards comprehensive table extraction from unstructured documentsCode2
CoqPilot, a plugin for LLM-based generation of proofsCode2
Formalizing and Benchmarking Prompt Injection Attacks and DefensesCode2
AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical KnowledgeCode2
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration ModelCode2
DeeperHistReg: Robust Whole Slide Images Registration FrameworkCode2
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering TransformerCode2
Common Diffusion Noise Schedules and Sample Steps are FlawedCode2
Multi-Target XGBoostLSS RegressionCode2
Recent advances in the Self-Referencing Embedding Strings (SELFIES) libraryCode2
RETVec: Resilient and Efficient Text VectorizerCode2
Document Expansion by Query PredictionCode2
Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation FrameworkCode2
EdgeGaussians -- 3D Edge Mapping via Gaussian SplattingCode2
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language ModelCode2
RobustNeRF: Ignoring Distractors with Robust LossesCode2
Building Normalizing Flows with Stochastic InterpolantsCode2
Efficient World Models with Context-Aware TokenizationCode2
Show:102550
← PrevPage 202 of 3547Next →