SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 89519000 of 661570 papers

TitleStatusHype
Learning Transferable Negative Prompts for Out-of-Distribution DetectionCode2
ClickDiffusion: Harnessing LLMs for Interactive Precise Image EditingCode2
Joint Reconstruction of 3D Human and Object via Contact-Based Refinement TransformerCode2
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented DiffusionCode2
Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered ScenesCode2
Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image SegmentationCode2
SafeGen: Mitigating Sexually Explicit Content Generation in Text-to-Image ModelsCode2
MindBridge: A Cross-Subject Brain Decoding FrameworkCode2
Content-Adaptive Non-Local Convolution for Remote Sensing PansharpeningCode2
Inheritune: Training Smaller Yet More Attentive Language ModelsCode2
Latent Guard: a Safety Framework for Text-to-image GenerationCode2
TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language ModelsCode2
LLM-Seg: Bridging Image Segmentation and Large Language Model ReasoningCode2
Confidential Federated ComputationsCode2
PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and VerifyCode2
Point-In-Context: Understanding Point Cloud via In-Context LearningCode2
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space ModelCode2
MAexp: A Generic Platform for RL-based Multi-Agent ExplorationCode2
Improving Sequential Recommendations with LLMsCode2
Retrieval-Augmented Generation-based Relation ExtractionCode2
TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion ModelsCode2
Classifier-guided neural blind deconvolution: a physics-informed denoising module for bearing fault diagnosis under heavy noiseCode2
Kangaroo: Lossless Self-Speculative Decoding via Double Early ExitingCode2
3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose MappingCode2
Causal Evaluation of Language ModelsCode2
Joint Signal Detection and Automatic Modulation Classification via Deep LearningCode2
PLeak: Prompt Leaking Attacks against Large Language Model ApplicationsCode2
Transcriptomics-guided Slide Representation Learning in Computational PathologyCode2
End-to-End Full-Page Optical Music Recognition for Pianoform Sheet MusicCode2
Identifying Functionally Important Features with End-to-End Sparse Dictionary LearningCode2
RoGs: Large Scale Road Surface Reconstruction with Meshgrid GaussianCode2
Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern GeneratorsCode2
LM4LV: A Frozen Large Language Model for Low-level Vision TasksCode2
Sparse maximal update parameterization: A holistic approach to sparse training dynamicsCode2
Frustratingly Easy Test-Time Adaptation of Vision-Language ModelsCode2
Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identificationCode2
Scaling Laws and Compute-Optimal Training Beyond Fixed Training DurationsCode2
Benchmarking and Improving Detail Image CaptionCode2
WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting PointCode2
Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series ClassificationCode2
TabPedia: Towards Comprehensive Visual Table Understanding with Concept SynergyCode2
DroneVis: Versatile Computer Vision Library for DronesCode2
Neural Optimal Transport with Lagrangian CostsCode2
Generative Pre-trained Speech Language Model with Efficient Hierarchical TransformerCode2
Parameter-Inverted Image Pyramid NetworksCode2
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision TasksCode2
FRAG: Frequency Adapting Group for Diffusion Video EditingCode2
Towards Lifelong Learning of Large Language Models: A SurveyCode2
Needle In A Multimodal HaystackCode2
DafnyBench: A Benchmark for Formal Software VerificationCode2
Show:102550
← PrevPage 180 of 13232Next →