SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 76267650 of 474278 papers

TitleStatusHype
Towards Real-world Event-guided Low-light Video Enhancement and DeblurringCode2
SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical ImagesCode2
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure ModelingCode2
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks YetCode2
NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG SignalsCode2
CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting MitigationCode2
Training-Free Activation Sparsity in Large Language ModelsCode2
A Practitioner's Guide to Continual Multimodal PretrainingCode2
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned PolicyCode2
MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models AgentsCode2
Video-CCAM: Enhancing Video-Language Understanding with Causal Cross-Attention Masks for Short and Long VideosCode2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token EmbeddingsCode2
MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image SegmentationCode2
MobileQuant: Mobile-friendly Quantization for On-device Language ModelsCode2
SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian SplattingCode2
TripleMixer: A 3D Point Cloud Denoising Model for Adverse WeatherCode2
3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image ClassificationCode2
Segment Any Mesh: Zero-shot Mesh Part Segmentation via Lifting Segment Anything 2 to 3DCode2
DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image GenerationCode2
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language DescriptionCode2
WildFusion: Individual Animal Identification with Calibrated Similarity FusionCode2
Data-Driven Parametrization of Molecular Mechanics Force Fields for Expansive Chemical Space CoverageCode2
Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate SchedulerCode2
LLM-PBE: Assessing Data Privacy in Large Language ModelsCode2
DeTPP: Leveraging Object Detection for Robust Long-Horizon Event PredictionCode2
Show:102550
← PrevPage 306 of 18972Next →