SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15511600 of 659983 papers

TitleStatusHype
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationCode4
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry RefinerCode4
UniTok: A Unified Tokenizer for Visual Generation and UnderstandingCode4
LangCell: Language-Cell Pre-training for Cell Identity UnderstandingCode4
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow EstimationCode4
Kwai Keye-VL Technical ReportCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
Towards One-shot Federated Learning: Advances, Challenges, and Future DirectionsCode4
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video GenerationCode4
DemoFusion: Democratising High-Resolution Image Generation With No $Code4
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
The All-Seeing Project V2: Towards General Relation Comprehension of the Open WorldCode4
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human InterplayCode4
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
High Fidelity Neural Audio CompressionCode4
MIGC++: Advanced Multi-Instance Generation Controller for Image SynthesisCode4
Qiskit Machine Learning: an open-source library for quantum machine learning tasks at scale on quantum hardware and classical simulatorsCode4
StudioGAN: A Taxonomy and Benchmark of GANs for Image SynthesisCode4
CoTracker: It is Better to Track TogetherCode4
PromptSource: An Integrated Development Environment and Repository for Natural Language PromptsCode4
Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINOCode4
Context-Aware Drift DetectionCode4
A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch EstimationCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
GCoNet+: A Stronger Group Collaborative Co-Salient Object DetectorCode4
RLlib: Abstractions for Distributed Reinforcement LearningCode4
Vision + Language Applications: A SurveyCode4
A Survey on Large Language Model-Based Game AgentsCode4
Z-Code++: A Pre-trained Language Model Optimized for Abstractive SummarizationCode4
R^3LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state EstimatorCode4
Recent Advances in RecBole: Extensions with more Practical ConsiderationsCode4
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video GenerationCode4
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language ModelsCode4
AudioLDM: Text-to-Audio Generation with Latent Diffusion ModelsCode4
LORE: Lagrangian-Optimized Robust Embeddings for Visual EncodersCode4
Transcoders Beat Sparse Autoencoders for InterpretabilityCode4
Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of LightCode4
Memory-aided Contrastive Consensus Learning for Co-salient Object DetectionCode4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
A Survey on Large Language Models for RecommendationCode4
Segment Anything in Medical ImagesCode4
mPLUG-Owl: Modularization Empowers Large Language Models with MultimodalityCode4
The Ideal Continual Learner: An Agent That Never ForgetsCode4
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for RoboticsCode4
The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One ShotCode4
Turning Whisper into Real-Time Transcription SystemCode4
EasyJailbreak: A Unified Framework for Jailbreaking Large Language ModelsCode4
Neural general circulation models optimized to predict satellite-based precipitation observationsCode4
Show:102550
← PrevPage 32 of 13200Next →