SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 53015350 of 661570 papers

TitleStatusHype
REEF: Representation Encoding Fingerprints for Large Language ModelsCode2
Modeling the Label Distributions for Weakly-Supervised Semantic SegmentationCode2
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion ModelCode2
Large language models surpass human experts in predicting neuroscience resultsCode2
Owl-1: Omni World Model for Consistent Long Video GenerationCode2
Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk AssessmentCode2
K2: A Foundation Language Model for Geoscience Knowledge Understanding and UtilizationCode2
GenSim: A General Social Simulation Platform with Large Language Model based AgentsCode2
Metric Flow Matching for Smooth Interpolations on the Data ManifoldCode2
Harmonizer: Learning to Perform White-Box Image and Video HarmonizationCode2
Android in the Zoo: Chain-of-Action-Thought for GUI AgentsCode2
Knowledge Circuits in Pretrained TransformersCode2
PyMIC: A deep learning toolkit for annotation-efficient medical image segmentationCode2
PHemoNet: A Multimodal Network for Physiological SignalsCode2
From Sparse to Soft Mixtures of ExpertsCode2
ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and TextCode2
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale DatasetCode2
DifIISR: A Diffusion Model with Gradient Guidance for Infrared Image Super-ResolutionCode2
nuScenes: A multimodal dataset for autonomous drivingCode2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong DetectionCode2
Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and DenoisingCode2
Video Prediction Transformers without Recurrence or ConvolutionCode2
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video ReasoningCode2
DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep FilteringCode2
PoseScript: Linking 3D Human Poses and Natural LanguageCode2
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential EquationsCode2
Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain ShiftCode2
LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating MetaheuristicsCode2
Unsupervised Universal Image SegmentationCode2
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation ModelsCode2
Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent CollaborationCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
MedM-VL: What Makes a Good Medical LVLM?Code2
Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained RewardsCode2
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkCode2
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion ModelsCode2
All for One and One for All: Improving Music Separation by Bridging NetworksCode2
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and RestorationCode2
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent SystemsCode2
Mixture of LoRA ExpertsCode2
Neighboring Autoregressive Modeling for Efficient Visual GenerationCode2
The Calysto Scheme ProjectCode2
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsCode2
Exploring Plain Vision Transformer Backbones for Object DetectionCode2
Twin-Merging: Dynamic Integration of Modular Expertise in Model MergingCode2
Hidden Biases of End-to-End Driving ModelsCode2
LaserMix for Semi-Supervised LiDAR Semantic SegmentationCode2
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source LocalizationCode2
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal ModelingCode2
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation EngineeringCode2
Show:102550
← PrevPage 107 of 13232Next →