SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 71517200 of 661570 papers

TitleStatusHype
ADATIME: A Benchmarking Suite for Domain Adaptation on Time Series DataCode2
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept ExtractionCode2
InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction FeaturesCode2
Specializing Smaller Language Models towards Multi-Step ReasoningCode2
Stitchable Neural NetworksCode2
Respecting causality is all you need for training physics-informed neural networksCode2
Towards Interpretable Mental Health Analysis with Large Language ModelsCode2
Cross-Modality Safety AlignmentCode2
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality LocalizationCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
Target conversation extraction: Source separation using turn-taking dynamicsCode2
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-ExpertsCode2
GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language ModelsCode2
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual QuestionsCode2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureCode2
normflows: A PyTorch Package for Normalizing FlowsCode2
WidthFormer: Toward Efficient Transformer-based BEV View TransformationCode2
Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV SystemCode2
Deep Incubation: Training Large Models by Divide-and-ConqueringCode2
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language ModelsCode2
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed BenchmarkCode2
MARLIN: Masked Autoencoder for facial video Representation LearnINgCode2
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localizationCode2
Large Language Models for Anomaly and Out-of-Distribution Detection: A SurveyCode2
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video StreamsCode2
eVAE: Evolutionary Variational AutoencoderCode2
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario GenerationCode2
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-SupervisionCode2
Omni-Video: Democratizing Unified Video Understanding and GenerationCode2
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness BenchmarkingCode2
Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and SimplificationCode2
VRL3: A Data-Driven Framework for Visual Deep Reinforcement LearningCode2
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and BenchmarkCode2
Neural interval-censored survival regression with feature selectionCode2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented GenerationCode2
DiffusionBERT: Improving Generative Masked Language Models with Diffusion ModelsCode2
Executing your Commands via Motion Diffusion in Latent SpaceCode2
NMS Strikes BackCode2
DiffFace: Diffusion-based Face Swapping with Facial GuidanceCode2
Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model CapabilityCode2
Efficient Speech Enhancement via Embeddings from Pre-trained Generative AudioencodersCode2
Watermarking Autoregressive Image GenerationCode2
Investigating Affective Use and Emotional Well-being on ChatGPTCode2
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model ParallelismCode2
Autonomous Improvement of Instruction Following Skills via Foundation ModelsCode2
MemoryBank: Enhancing Large Language Models with Long-Term MemoryCode2
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language ModelsCode2
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image EditingCode2
GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D ReconstructionCode2
Unified Continuous Generative ModelsCode2
Show:102550
← PrevPage 144 of 13232Next →