SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1080110850 of 661570 papers

TitleStatusHype
AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly DetectionCode2
Foundational Models in Medical Imaging: A Comprehensive Survey and Future VisionCode2
Laughing Hyena Distillery: Extracting Compact Recurrences From ConvolutionsCode2
FP8-LM: Training FP8 Large Language ModelsCode2
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single ImageCode2
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference TimeCode2
MimicGen: A Data Generation System for Scalable Robot Learning using Human DemonstrationsCode2
JudgeLM: Fine-tuned Large Language Models are Scalable JudgesCode2
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer NetworkCode2
QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter ModelsCode2
Hierarchical Text Spotter for Joint Text Spotting and Layout AnalysisCode2
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AICode2
LLM-FP4: 4-Bit Floating-Point Quantized TransformersCode2
Detecting Pretraining Data from Large Language ModelsCode2
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons ImagesCode2
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationCode2
TD-MPC2: Scalable, Robust World Models for Continuous ControlCode2
Discrete Diffusion Modeling by Estimating the Ratios of the Data DistributionCode2
Neural Potential Field for Obstacle-Aware Local Motion PlanningCode2
PERF: Panoramic Neural Radiance Field from a Single PanoramaCode2
Pre-training Music Classification Models via Music Source SeparationCode2
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio ModelsCode2
Mixture of Tokens: Continuous MoE through Cross-Example AggregationCode2
A Survey on Detection of LLMs-Generated ContentCode2
BianQue: Balancing the Questioning and Suggestion Ability of Health LLMs with Multi-turn Health Conversations Polished by ChatGPTCode2
Woodpecker: Hallucination Correction for Multimodal Large Language ModelsCode2
Representation Learning with Large Language Models for RecommendationCode2
Brainchop: Next Generation Web-Based Neuroimaging ApplicationCode2
Breaking of brightness consistency in optical flow with a lightweight CNN networkCode2
RoboDepth: Robust Out-of-Distribution Depth Estimation under CorruptionsCode2
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future DirectionsCode2
RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic EnvironmentsCode2
Matryoshka Diffusion ModelsCode2
SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical ImagesCode2
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuningCode2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
BatteryML:An Open-source platform for Machine Learning on Battery DegradationCode2
Is Weakly-supervised Action Segmentation Ready For Human-Robot Interaction? No, Let's Improve It With Action-union LearningCode2
Vision Language Models in Autonomous Driving: A Survey and OutlookCode2
A Pytorch Reproduction of Masked Generative Image TransformerCode2
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical DomainCode2
RL-X: A Deep Reinforcement Learning Library (not only) for RoboCupCode2
Improving Molecular Properties Prediction Through Latent Space FusionCode2
Formalizing and Benchmarking Prompt Injection Attacks and DefensesCode2
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
Frozen Transformers in Language Models Are Effective Visual Encoder LayersCode2
HumanTOMATO: Text-aligned Whole-body Motion GenerationCode2
SRAI: Towards Standardization of Geospatial AICode2
Position Interpolation Improves ALiBi ExtrapolationCode2
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based ArchitectureCode2
Show:102550
← PrevPage 217 of 13232Next →