SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 77017750 of 661570 papers

TitleStatusHype
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event DetectionCode2
AgentCourt: Simulating Court with Adversarial Evolvable Lawyer AgentsCode2
SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-trainingCode2
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
Snuffy: Efficient Whole Slide Image ClassifierCode2
Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame PruningCode2
HAIR: Hypernetworks-based All-in-One Image RestorationCode2
Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent FrameworkCode2
SustainDC: Benchmarking for Sustainable Data Center ControlCode2
BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt LearningCode2
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry AreaCode2
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series AnalysisCode2
Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality CollaborationCode2
Causal Agent based on Large Language ModelCode2
Parallel Speculative Decoding with Adaptive Draft LengthCode2
Improving Synthetic Image Detection Towards Generalization: An Image Transformation PerspectiveCode2
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data TrainingCode2
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in AlignmentCode2
Efficient and Scalable Point Cloud Generation with Sparse Point-Voxel Diffusion ModelsCode2
Strategy Game-Playing with Size-Constrained State AbstractionCode2
Post-Training Sparse Attention with Double SparsityCode2
SSL: A Self-similarity Loss for Improving Generative Image Super-resolutionCode2
FuXi Weather: A data-to-forecast machine learning system for global weatherCode2
Cross-view image geo-localization with Panorama-BEV Co-Retrieval NetworkCode2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary SegmentationCode2
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic SegmentationCode2
Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object DetectionCode2
MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing AgentsCode2
wav2graph: A Framework for Supervised Learning Knowledge Graph from SpeechCode2
mbrs: A Library for Minimum Bayes Risk DecodingCode2
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLPCode2
EfficientRAG: Efficient Retriever for Multi-Hop Question AnsweringCode2
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersCode2
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksCode2
RL-ADN: A High-Performance Deep Reinforcement Learning Environment for Optimal Energy Storage Systems Dispatch in Active Distribution NetworksCode2
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object DetectionCode2
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile ApplicationsCode2
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space ModelCode2
TrafficGPT: An LLM Approach for Open-Set Encrypted Traffic ClassificationCode2
500xCompressor: Generalized Prompt Compression for Large Language ModelsCode2
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and EnhancementCode2
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge GraphsCode2
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AICode2
LumiGauss: Relightable Gaussian Splatting in the WildCode2
DaCapo: a modular deep learning framework for scalable 3D image segmentationCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
YOWOv3: An Efficient and Generalized Framework for Human Action Detection and RecognitionCode2
XMainframe: A Large Language Model for Mainframe ModernizationCode2
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent SystemsCode2
Show:102550
← PrevPage 155 of 13232Next →