SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 90019050 of 661570 papers

TitleStatusHype
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?Code2
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly DetectionCode2
End-to-end Piano Performance-MIDI to Score Conversion with TransformersCode2
Mamba in Vision: A Comprehensive Survey of Techniques and ApplicationsCode2
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win RatesCode2
Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing ImagesCode2
Reversible Decoupling Network for Single Image Reflection RemovalCode2
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to AutomationCode2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
LLM-Based Multi-Agent Systems are Scalable Graph Generative ModelsCode2
GS^3: Efficient Relighting with Triple Gaussian SplattingCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
Batch and match: black-box variational inference with a score-based divergenceCode2
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search EnginesCode2
Retrieval-Enhanced Mutation Mastery: Augmenting Zero-Shot Prediction of Protein Language ModelCode2
Accelerating Direct Preference Optimization with Prefix SharingCode2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
MassSpecGym: A benchmark for the discovery and identification of moleculesCode2
PC-Gym: Benchmark Environments For Process Control ProblemsCode2
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMsCode2
Learning General-Purpose Biomedical Volume Representations using Randomized SynthesisCode2
A Modular and Robust Physics-Based Approach for Lensless Image ReconstructionCode2
PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross DockingCode2
GTA: Global Tracklet Association for Multi-Object Tracking in SportsCode2
Golden Noise for Diffusion Models: A Learning FrameworkCode2
SymphonyQG: Towards Symphonious Integration of Quantization and Graph for Approximate Nearest Neighbor SearchCode2
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative EditingCode2
GaussianSpeech: Audio-Driven Gaussian AvatarsCode2
PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video GenerationCode2
RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook PriorsCode2
Volumetrically Consistent 3D Gaussian RasterizationCode2
Splatter-360: Generalizable 360^ Gaussian Splatting for Wide-baseline Panoramic ImagesCode2
FlashRNN: Optimizing Traditional RNNs on Modern HardwareCode2
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM ReasoningCode2
Gramian Multimodal Representation Learning and AlignmentCode2
Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-ReflectionCode2
EvalGIM: A Library for Evaluating Generative Image ModelsCode2
Tracr: Compiled Transformers as a Laboratory for InterpretabilityCode2
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationCode2
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow MatchingCode2
SoftPatch+: Fully Unsupervised Anomaly Classification and SegmentationCode2
Superposition in Transformers: A Novel Way of Building Mixture of ExpertsCode2
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose EstimationCode2
M-SENA: An Integrated Platform for Multimodal Sentiment AnalysisCode2
RL Tango: Reinforcing Generator and Verifier Together for Language ReasoningCode2
UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission GenerationCode2
TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video UnderstandingCode2
Leveraging ASIC AI Chips for Homomorphic EncryptionCode2
A Simple Aerial Detection Baseline of Multimodal Language ModelsCode2
Show:102550
← PrevPage 181 of 13232Next →