SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1410114150 of 474278 papers

TitleStatusHype
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving PerformanceCode2
ChemDFM: A Large Language Foundation Model for ChemistryCode2
Mamba-SEUNet: Mamba UNet for Monaural Speech EnhancementCode2
ETTA: Elucidating the Design Space of Text-to-Audio ModelsCode2
FastSpeech: Fast,Robustand Controllable Text-to-SpeechCode2
Pruning Filters for Efficient ConvNetsCode2
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape GenerationCode2
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math DataCode2
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent CollaborationCode2
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction ModelCode2
MaskTerial: A Foundation Model for Automated 2D Material Flake DetectionCode2
Sensitive Data Detection with High-Throughput Neural Network Models for Financial InstitutionsCode2
LARP: Tokenizing Videos with a Learned Autoregressive Generative PriorCode2
Streaming Keyword Spotting Boosted by Cross-layer Discrimination ConsistencyCode2
AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual AlignmentCode2
Counterfactual Phenotyping with Censored Time-to-EventsCode2
FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHFCode2
Semantic Editing Increment Benefits Zero-Shot Composed Image RetrievalCode2
Concat-ID: Towards Universal Identity-Preserving Video SynthesisCode2
MemSeg: A semi-supervised method for image surface defect detection using differences and commonalitiesCode2
Event-Based Motion MagnificationCode2
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level LatenciesCode2
Motion Inversion for Video CustomizationCode2
Patchwork++: Fast and Robust Ground Segmentation Solving Partial Under-Segmentation Using 3D Point CloudCode2
Multi-Document Grounded Multi-Turn Synthetic Dialog GenerationCode2
Panacea: Panoramic and Controllable Video Generation for Autonomous DrivingCode2
YAKE! Keyword extraction from single documents using multiple local featuresCode2
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAICode2
MetaFed: Federated Learning among Federations with Cyclic Knowledge Distillation for Personalized HealthcareCode2
Prompt-CAM: A Simpler Interpretable Transformer for Fine-Grained AnalysisCode2
cuSLINK: Single-linkage Agglomerative Clustering on the GPUCode2
Consistency Diffusion Bridge ModelsCode2
A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation ModelCode2
Pillar R-CNN for Point Cloud 3D Object DetectionCode2
Compiler Optimization via LLM Reasoning for Efficient Model ServingCode2
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame InterpolationCode2
Garment3DGen: 3D Garment Stylization and Texture GenerationCode2
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion ModelsCode2
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier TransformCode2
Online Video Understanding: OVBench and VideoChat-OnlineCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
Task-wise Sampling Convolutions for Arbitrary-Oriented Object Detection in Aerial ImagesCode2
On the Continuity of Rotation Representations in Neural NetworksCode2
PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime CharactersCode2
Evaluation of Bio-Inspired Models under Different Learning Settings For Energy Efficiency in Network Traffic PredictionCode2
Retrieving Semantics from the Deep: an RAG Solution for Gesture SynthesisCode2
Chimp: Efficient Lossless Floating Point Compression for Time Series DatabasesCode2
Laughing Hyena Distillery: Extracting Compact Recurrences From ConvolutionsCode2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture DesignCode2
BrainMVP: Multi-modal Vision Pre-training for Brain Image Analysis using Multi-parametric MRICode2
Show:102550
← PrevPage 283 of 9486Next →