SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 97519800 of 661570 papers

TitleStatusHype
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space ModelCode2
A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Speech TranslationCode2
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language ModelsCode2
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual ManipulationCode2
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible GuidanceCode2
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language ModelsCode2
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW ImagesCode2
MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at ScaleCode2
VAE Explainer: Supplement Learning Variational Autoencoders with Interactive VisualizationCode2
Compositional Video Generation as Flow EqualizationCode2
Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning ApproachCode2
Balancing LoRA Performance and Efficiency with Simple Shard SharingCode2
PGN: The RNN's New Successor is Effective for Long-Range Time Series ForecastingCode2
Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and InterpolationCode2
GraphRouter: A Graph-based Router for LLM SelectionsCode2
Learning to Optimize for Mixed-Integer Non-linear Programming with Feasibility GuaranteesCode2
Spiking GS: Towards High-Accuracy and Low-Cost Surface Reconstruction via Spiking Neuron-based Gaussian SplattingCode2
PAPILLON: Privacy Preservation from Internet-based and Local Language Model EnsemblesCode2
Extended Mind TransformersCode2
Combining Induction and Transduction for Abstract ReasoningCode2
Adaptive Length Image Tokenization via Recurrent AllocationCode2
MC-LLaVA: Multi-Concept Personalized Vision-Language ModelCode2
Are large language models superhuman chemists?Code2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
Monet: Mixture of Monosemantic Experts for TransformersCode2
MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference OptimizationCode2
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any GranularityCode2
FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation LearningCode2
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction SystemCode2
R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual LocalizationCode2
DiffGraph: Heterogeneous Graph Diffusion ModelCode2
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud MapsCode2
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information SteeringCode2
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQLCode2
VaViM and VaVAM: Autonomous Driving through Video Generative ModelingCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
A Survey on Industrial Anomalies SynthesisCode2
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You ThinkCode2
InsTaG: Learning Personalized 3D Talking Head from Few-Second VideoCode2
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual LearningCode2
WritingBench: A Comprehensive Benchmark for Generative WritingCode2
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and EditingCode2
Advancing Language Model Reasoning through Reinforcement Learning and Inference ScalingCode2
MegaMath: Pushing the Limits of Open Math CorporaCode2
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D ReconstructionCode2
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language NavigationCode2
GuardReasoner-VL: Safeguarding VLMs via Reinforced ReasoningCode2
μPC: Scaling Predictive Coding to 100+ Layer NetworksCode2
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to RankCode2
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal FeaturesCode2
Show:102550
← PrevPage 196 of 13232Next →