SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 88018850 of 661570 papers

TitleStatusHype
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic CameraCode2
Learning Multi-Agent Communication from Graph Modeling PerspectiveCode2
ADA-Track++: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and AssociationCode2
EchoTracker: Advancing Myocardial Point Tracking in EchocardiographyCode2
Rethinking Prior Information Generation with CLIP for Few-Shot SegmentationCode2
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed BenchmarkCode2
GREEN: a lightweight architecture using learnable wavelets and Riemannian geometry for biomarker explorationCode2
Autonomous clustering by fast find of mass and distance peaksCode2
FreeVA: Offline MLLM as Training-Free Video AssistantCode2
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place RecognitionCode2
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image PromptingCode2
Evaluation of Retrieval-Augmented Generation: A SurveyCode2
AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker PreventionCode2
CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-ResolutionCode2
DiffTF++: 3D-aware Diffusion Transformer for Large-Vocabulary 3D GenerationCode2
Zero-Shot Tokenizer TransferCode2
Transferable Neural Wavefunctions for SolidsCode2
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text DetectorsCode2
Localizing Task Information for Improved Model Merging and CompressionCode2
PHUDGE: Phi-3 as Scalable JudgeCode2
Learnable Item Tokenization for Generative RecommendationCode2
BoQ: A Place is Worth a Bag of Learnable QueriesCode2
IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source LocalizationCode2
Piccolo2: General Text Embedding with Multi-task Hybrid Loss TrainingCode2
MRSegmentator: Multi-Modality Segmentation of 40 Classes in MRI and CTCode2
Self-Consistent Recursive Diffusion Bridge for Medical Image TranslationCode2
Context-Guided Spatial Feature Reconstruction for Efficient Semantic SegmentationCode2
What Can Natural Language Processing Do for Peer Review?Code2
Linearizing Large Language ModelsCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
Learning A Spiking Neural Network for Efficient Image DerainingCode2
Time Evidence Fusion Network: Multi-source View in Long-Term Time Series ForecastingCode2
PLeak: Prompt Leaking Attacks against Large Language Model ApplicationsCode2
Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale AttentionCode2
State-Free Inference of State-Space Models: The Transfer Function ApproachCode2
Memory MosaicsCode2
Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMambaCode2
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMsCode2
HMT: Hierarchical Memory Transformer for Long Context Language ProcessingCode2
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image GenerationCode2
Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting MaskCode2
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-ExpertsCode2
FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCsCode2
LMVD: A Large-Scale Multimodal Vlog Dataset for Depression Detection in the WildCode2
Memory-Space Visual Prompting for Efficient Vision-Language Fine-TuningCode2
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid InferenceCode2
Outlier-robust Kalman Filtering through Generalised BayesCode2
HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-ResolutionCode2
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language ModelsCode2
Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReIDCode2
Show:102550
← PrevPage 177 of 13232Next →