SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 64516500 of 661570 papers

TitleStatusHype
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing ControlCode2
SoftPatch+: Fully Unsupervised Anomaly Classification and SegmentationCode2
Edicho: Consistent Image Editing in the WildCode2
Efficient Parallel Genetic Algorithm for Perturbed Substructure Optimization in Complex NetworkCode2
MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic MasksCode2
Natural Language Fine-TuningCode2
DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face SynthesisCode2
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction SystemCode2
MaIR: A Locality- and Continuity-Preserving Mamba for Image RestorationCode2
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingCode2
From Generalist to Specialist: A Survey of Large Language Models for ChemistryCode2
GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian SplattingCode2
MBQ: Modality-Balanced Quantization for Large Vision-Language ModelsCode2
Towards Open-Vocabulary Remote Sensing Image Semantic SegmentationCode2
ETTA: Elucidating the Design Space of Text-to-Audio ModelsCode2
SUTrack: Towards Simple and Unified Single Object TrackingCode2
RecLM: Recommendation Instruction TuningCode2
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task AlignmentCode2
WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian SplattingCode2
CGCOD: Class-Guided Camouflaged Object DetectionCode2
Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras with Human SemanticsCode2
EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model EvaluationCode2
ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban ScienceCode2
Long-Form Speech Generation with Spoken Language ModelsCode2
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language ModelsCode2
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene UnderstandingCode2
Token-Budget-Aware LLM ReasoningCode2
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly DetectionCode2
Reasoning to Attend: Try to Understand How <SEG> Token WorksCode2
Large Language Model Safety: A Holistic SurveyCode2
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught ReasonersCode2
ActiveGS: Active Scene Reconstruction Using Gaussian SplattingCode2
Scenario-Wise Rec: A Multi-Scenario Recommendation BenchmarkCode2
Cross-View Referring Multi-Object TrackingCode2
Token Statistics Transformer: Linear-Time Attention via Variational Rate ReductionCode2
Evaluation of Bio-Inspired Models under Different Learning Settings For Energy Efficiency in Network Traffic PredictionCode2
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing EncoderCode2
Reconstructing People, Places, and CamerasCode2
xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend DecompositionCode2
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length GeneralizationCode2
Guided Real Image Dehazing using YCbCr Color SpaceCode2
Evaluating LLM Reasoning in the Operations Research Domain with ORQACode2
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor RegressionCode2
An OpenMind for 3D medical vision self-supervised learningCode2
Pinwheel-shaped Convolution and Scale-based Dynamic Loss for Infrared Small Target DetectionCode2
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-TuningCode2
Where am I? Cross-View Geo-localization with Natural Language DescriptionsCode2
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow MatchingCode2
WPMixer: Efficient Multi-Resolution Mixing for Long-Term Time Series ForecastingCode2
A Generalizable Anomaly Detection Method in Dynamic GraphsCode2
Show:102550
← PrevPage 130 of 13232Next →