SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 53265350 of 177340 papers

TitleStatusHype
Perception Test: A Diagnostic Benchmark for Multimodal ModelsCode2
Log-based Anomaly Detection with Deep Learning: How Far Are We?Code2
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene UnderstandingCode2
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial NetworkCode2
Generating Diverse and Natural 3D Human Motions From TextCode2
DiGress: Discrete Denoising diffusion for graph generationCode2
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
SceneTracker: Long-term Scene Flow Estimation NetworkCode2
MoVA: Adapting Mixture of Vision Experts to Multimodal ContextCode2
The ArtBench Dataset: Benchmarking Generative Models with ArtworksCode2
Identifying and Combating Bias in Segmentation Networks by leveraging multiple resolutionsCode2
Space-time 2D Gaussian Splatting for Accurate Surface Reconstruction under Complex Dynamic ScenesCode2
Evolutionary Computation in the Era of Large Language Model: Survey and RoadmapCode2
A Generalizable Anomaly Detection Method in Dynamic GraphsCode2
Is ChatGPT A Good Translator? Yes With GPT-4 As The EngineCode2
What Matters In The Structured Pruning of Generative Language Models?Code2
Efficient 3D Semantic Segmentation with Superpoint TransformerCode2
A differentiable brain simulator bridging brain simulation and brain-inspired computingCode2
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and DereverberationCode2
Plenoxels: Radiance Fields without Neural NetworksCode2
Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation AccuracyCode2
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language ModelsCode2
Birbal: An efficient 7B instruct-model fine-tuned with curated datasetsCode2
Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluationCode2
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph PriorCode2
Show:102550
← PrevPage 214 of 7094Next →