SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 60016050 of 661570 papers

TitleStatusHype
EmoFace: Audio-driven Emotional 3D Face AnimationCode2
OmniBench: Towards The Future of Universal Omni-Language ModelsCode2
ADATIME: A Benchmarking Suite for Domain Adaptation on Time Series DataCode2
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept ExtractionCode2
InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction FeaturesCode2
Specializing Smaller Language Models towards Multi-Step ReasoningCode2
Stitchable Neural NetworksCode2
Respecting causality is all you need for training physics-informed neural networksCode2
Towards Interpretable Mental Health Analysis with Large Language ModelsCode2
Cross-Modality Safety AlignmentCode2
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality LocalizationCode2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
Target conversation extraction: Source separation using turn-taking dynamicsCode2
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-ExpertsCode2
GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language ModelsCode2
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual QuestionsCode2
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer TrainingCode2
Beyond Next Token Prediction: Patch-Level Training for Large Language ModelsCode2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureCode2
normflows: A PyTorch Package for Normalizing FlowsCode2
WidthFormer: Toward Efficient Transformer-based BEV View TransformationCode2
Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV SystemCode2
Deep Incubation: Training Large Models by Divide-and-ConqueringCode2
Fortuna: A Library for Uncertainty Quantification in Deep LearningCode2
BinsFormer: Revisiting Adaptive Bins for Monocular Depth EstimationCode2
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and GenerationCode2
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language ModelsCode2
Etalon: Holistic Performance Evaluation Framework for LLM Inference SystemsCode2
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed BenchmarkCode2
A Diffusion-Based Generative Equalizer for Music RestorationCode2
Omnizart: A General Toolbox for Automatic Music TranscriptionCode2
MARLIN: Masked Autoencoder for facial video Representation LearnINgCode2
Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potentialCode2
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localizationCode2
Large Language Models for Anomaly and Out-of-Distribution Detection: A SurveyCode2
Towards Scalable Automated Alignment of LLMs: A SurveyCode2
ViTime: A Visual Intelligence-Based Foundation Model for Time Series ForecastingCode2
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video StreamsCode2
Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and LeaderboardingCode2
eVAE: Evolutionary Variational AutoencoderCode2
Let Images Give You More:Point Cloud Cross-Modal Training for Shape AnalysisCode2
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario GenerationCode2
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-SupervisionCode2
L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial AttacksCode2
Omni-Video: Democratizing Unified Video Understanding and GenerationCode2
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness BenchmarkingCode2
ExpeL: LLM Agents Are Experiential LearnersCode2
MuMA-ToM: Multi-modal Multi-Agent Theory of MindCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body SimulationCode2
Show:102550
← PrevPage 121 of 13232Next →