SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 75517600 of 661570 papers

TitleStatusHype
OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal AssociationCode2
MedViT: A Robust Vision Transformer for Generalized Medical Image ClassificationCode2
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor CoresCode2
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer TrainingCode2
Training-free CryoET Tomogram SegmentationCode2
Beyond Next Token Prediction: Patch-Level Training for Large Language ModelsCode2
Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile ManipulationCode2
BAD-NeRF: Bundle Adjusted Deblur Neural Radiance FieldsCode2
BLASER: A Text-Free Speech-to-Speech Translation Evaluation MetricCode2
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain ScenariosCode2
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent SystemsCode2
A Review of Graph Neural Networks in Epidemic ModelingCode2
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
Towards Interpreting Visual Information Processing in Vision-Language ModelsCode2
Seg-metrics: a Python package to compute segmentation metricsCode2
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object TrackingCode2
GAN Prior Embedded Network for Blind Face Restoration in the WildCode2
Fortuna: A Library for Uncertainty Quantification in Deep LearningCode2
Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume ImprovementCode2
BinsFormer: Revisiting Adaptive Bins for Monocular Depth EstimationCode2
CogView: Mastering Text-to-Image Generation via TransformersCode2
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and GenerationCode2
RGL: A Graph-Centric, Modular Framework for Efficient Retrieval-Augmented Generation on GraphsCode2
A Contrastive Framework for Neural Text GenerationCode2
Understanding Multi-Granularity for Open-Vocabulary Part SegmentationCode2
Etalon: Holistic Performance Evaluation Framework for LLM Inference SystemsCode2
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation ModelsCode2
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?Code2
PsycoLLM: Enhancing LLM for Psychological Understanding and EvaluationCode2
A Diffusion-Based Generative Equalizer for Music RestorationCode2
Efficient and Modular Implicit DifferentiationCode2
Omnizart: A General Toolbox for Automatic Music TranscriptionCode2
PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and DevelopmentCode2
PolyRoom: Room-aware Transformer for Floorplan ReconstructionCode2
deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural NetworksCode2
Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potentialCode2
Revisiting Contrastive Methods for Unsupervised Learning of Visual RepresentationsCode2
LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object DetectionCode2
Realistic and Efficient Face Swapping: A Unified Approach with Diffusion ModelsCode2
Interpretable Machine Learning for Science with PySR and SymbolicRegression.jlCode2
ReVersion: Diffusion-Based Relation Inversion from ImagesCode2
Towards Scalable Automated Alignment of LLMs: A SurveyCode2
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse AutoencodersCode2
ViTime: A Visual Intelligence-Based Foundation Model for Time Series ForecastingCode2
Less is More: Fewer Interpretable Region via Submodular Subset SelectionCode2
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context InferenceCode2
NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view ReconstructionCode2
Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and LeaderboardingCode2
VSA: Learning Varied-Size Window Attention in Vision TransformersCode2
Pano2Room: Novel View Synthesis from a Single Indoor PanoramaCode2
Show:102550
← PrevPage 152 of 13232Next →