SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 61016150 of 661570 papers

TitleStatusHype
Deep Learning Based Automatic Modulation Recognition: Models, Datasets, and ChallengesCode2
Robust Human Matting via Semantic GuidanceCode2
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight GenerationCode2
Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular DataCode2
PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLMCode2
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
Making LLaMA SEE and Draw with SEED TokenizerCode2
Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained OptimizationCode2
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning BenchmarksCode2
RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head AvatarsCode2
ReliableSwap: Boosting General Face Swapping Via Reliable SupervisionCode2
Structure-Aware Transformer for Graph Representation LearningCode2
High-Order Control Barrier Functions: Insights and a Truncated Taylor-Based FormulationCode2
Contrastive Learning of Asset Embeddings from Financial Time SeriesCode2
Pix2NeRF: Unsupervised Conditional p-GAN for Single Image to Neural Radiance Fields TranslationCode2
CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language ModelsCode2
Graph Neural Network-based surrogate model for granular flowsCode2
Chasing Low-Carbon Electricity for Practical and Sustainable DNN TrainingCode2
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuningCode2
GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian SplattingCode2
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive DiffusionCode2
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language ModelsCode2
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point CloudsCode2
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language ModelsCode2
mdCATH: A Large-Scale MD Dataset for Data-Driven Computational BiophysicsCode2
BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task TuningCode2
Melting Pot 2.0Code2
A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online AdaptationCode2
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image InterpretationCode2
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short DramaCode2
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual LocalizationCode2
TraDiffusion: Trajectory-Based Training-Free Image GenerationCode2
Mephisto: A Framework for Portable, Reproducible, and Iterative CrowdsourcingCode2
Attention-based Deep Multiple Instance LearningCode2
Interacting Attention Graph for Single Image Two-Hand ReconstructionCode2
Frequency-domain MLPs are More Effective Learners in Time Series ForecastingCode2
REALY: Rethinking the Evaluation of 3D Face ReconstructionCode2
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-TrainingCode2
Does Image Anonymization Impact Computer Vision Training?Code2
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving ScenarioCode2
In-Context Imitation Learning via Next-Token PredictionCode2
A Hybrid Transformer-Mamba Network for Single Image DerainingCode2
Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning IncentivizationCode2
LViT: Language meets Vision Transformer in Medical Image SegmentationCode2
gRNAde: Geometric Deep Learning for 3D RNA inverse designCode2
Uni3D: Exploring Unified 3D Representation at ScaleCode2
OstQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution FittingCode2
VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentationCode2
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for ReasoningCode2
Show:102550
← PrevPage 123 of 13232Next →