SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1200112050 of 177340 papers

TitleStatusHype
TextBox: A Unified, Modularized, and Extensible Framework for Text GenerationCode2
Generative Image as Action ModelsCode2
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable DiffusionCode2
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question AnsweringCode2
DataComp: In search of the next generation of multimodal datasetsCode2
Do We Need Domain-Specific Embedding Models? An Empirical InvestigationCode2
MACE: An Efficient Model-Agnostic Framework for Counterfactual ExplanationCode2
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-TrainingCode2
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the WildCode2
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and TasksCode2
MAIRA-2: Grounded Radiology Report GenerationCode2
ADAPT: Action-aware Driving Caption TransformerCode2
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world LearningCode2
Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operatorsCode2
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language ModelsCode2
Contrastive language and vision learning of general fashion conceptsCode2
Macro Graph Neural Networks for Online Billion-Scale Recommender SystemsCode2
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%Code2
RoMe: Towards Large Scale Road Surface Reconstruction via Mesh RepresentationCode2
Programming Refusal with Conditional Activation SteeringCode2
MI-GAN: A Simple Baseline for Image Inpainting on Mobile DevicesCode2
A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and EfficiencyCode2
Granite GuardianCode2
BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated MotionCode2
High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion ModelCode2
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers FasterCode2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsCode2
Multi-Task Dense Prediction via Mixture of Low-Rank ExpertsCode2
Three scenarios for continual learningCode2
Distributional Gradient Boosting MachinesCode2
Practical tradeoffs between memory, compute, and performance in learned optimizersCode2
AnalogCoder: Analog Circuit Design via Training-Free Code GenerationCode2
RouteFinder: Towards Foundation Models for Vehicle Routing ProblemsCode2
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in SummarizationCode2
Higher Layers Need More LoRA ExpertsCode2
FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio ProcessingCode2
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEsCode2
Dataset QuantizationCode2
Frouros: A Python library for drift detection in machine learning systemsCode2
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task DatasetsCode2
Training Language Models to Reason EfficientlyCode2
Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFsCode2
DOT: A Distillation-Oriented TrainerCode2
Image Super-Resolution Using Very Deep Residual Channel Attention NetworksCode2
WaferLLM: Large Language Model Inference at Wafer ScaleCode2
FABLES: Evaluating faithfulness and content selection in book-length summarizationCode2
CMax-SLAM: Event-based Rotational-Motion Bundle Adjustment and SLAM System using Contrast MaximizationCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
ViewFormer: NeRF-free Neural Rendering from Few Images Using TransformersCode2
OpenFE: Automated Feature Generation with Expert-level PerformanceCode2
Show:102550
← PrevPage 241 of 3547Next →