SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 91519200 of 661570 papers

TitleStatusHype
Llama-VITS: Enhancing TTS Synthesis with Semantic AwarenessCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information RetrievalCode2
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial ImagesCode2
Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarksCode2
GeoSynth: Contextually-Aware High-Resolution Satellite Image SynthesisCode2
Optimization Methods for Personalizing Large Language Models through Retrieval AugmentationCode2
ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video ColorizationCode2
Policy-Guided DiffusionCode2
Hash3D: Training-free Acceleration for 3D GenerationCode2
Magic-Boost: Boost 3D Generation with Multi-View Conditioned DiffusionCode2
Autonomous Evaluation and Refinement of Digital AgentsCode2
RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length VideosCode2
GOAT-Bench: A Benchmark for Multi-Modal Lifelong NavigationCode2
SmartControl: Enhancing ControlNet for Handling Rough Visual ConditionsCode2
Robust Confidence Intervals in Stereo Matching using Possibility TheoryCode2
Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image SegmentationCode2
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?Code2
Test-Time Zero-Shot Temporal Action LocalizationCode2
Evaluating Mathematical Reasoning Beyond AccuracyCode2
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document EnhancementCode2
TIM: A Time Interval Machine for Audio-Visual Action RecognitionCode2
ATFNet: Adaptive Time-Frequency Ensembled Network for Long-term Time Series ForecastingCode2
Dual-Camera Smooth Zoom on Mobile PhonesCode2
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in HematologyCode2
Joint Reconstruction of 3D Human and Object via Contact-Based Refinement TransformerCode2
VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan ModuleCode2
UniMD: Towards Unifying Moment Retrieval and Temporal Action DetectionCode2
Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAMCode2
3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level SupervisionsCode2
Rethinking Diffusion Model for Multi-Contrast MRI Super-ResolutionCode2
Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion ModelsCode2
LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image SegmentationCode2
Diffusion Time-step Curriculum for One Image to 3D GenerationCode2
Bridging the Gap Between End-to-End and Two-Step Text SpottingCode2
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red TeamingCode2
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical ProblemsCode2
Aligning Diffusion Models by Optimizing Human UtilityCode2
OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point CloudsCode2
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise OptimizationCode2
Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered ScenesCode2
Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion ModelsCode2
MedIAnomaly: A comparative study of anomaly detection in medical imagesCode2
Dynamic Prompt Optimizing for Text-to-Image GenerationCode2
Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step GenerationCode2
Identity Decoupling for Multi-Subject Personalization of Text-to-Image ModelsCode2
ClickDiffusion: Harnessing LLMs for Interactive Precise Image EditingCode2
Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph ConstructionCode2
Hypothesis Generation with Large Language ModelsCode2
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal InputsCode2
Show:102550
← PrevPage 184 of 13232Next →