SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2125121300 of 474278 papers

TitleStatusHype
Symbolic Regression with a Learned Concept LibraryCode1
MotionTTT: 2D Test-Time-Training Motion Estimation for 3D Motion Corrected MRICode1
Learning Discrete World Models for Heuristic SearchCode1
VernaCopter: Disambiguated Natural-Language-Driven Robot via Formal SpecificationsCode1
From FDG to PSMA: A Hitchhiker's Guide to Multitracer, Multicenter Lesion Segmentation in PET/CT ImagingCode1
Informative Subgraphs Aware Masked Auto-Encoder in Dynamic GraphsCode1
Implicit Neural Representations with Fourier Kolmogorov-Arnold NetworksCode1
Block-Attention for Efficient RAGCode1
MHAD: Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological SignalsCode1
SAM-OCTA2: Layer Sequence OCTA Segmentation with Fine-tuned Segment Anything Model 2Code1
Real-world Adversarial Defense against Patch Attacks based on Diffusion ModelCode1
Detecting Looted Archaeological Sites from Satellite Image Time SeriesCode1
WeatherReal: A Benchmark Based on In-Situ Observations for Evaluating Weather ModelsCode1
Federated Learning with Quantum Computing and Fully Homomorphic Encryption: A Novel Computing Paradigm Shift in Privacy-Preserving MLCode1
Effective Pre-Training of Audio Transformers for Sound Event DetectionCode1
Associate Everything Detected: Facilitating Tracking-by-Detection to the UnknownCode1
GenMapping: Unleashing the Potential of Inverse Perspective Mapping for Robust Online HD Map ConstructionCode1
B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible TestsCode1
WheelPoser: Sparse-IMU Based Body Pose Estimation for Wheelchair UsersCode1
Anytime Continual Learning for Open Vocabulary ClassificationCode1
ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction TuningCode1
AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language ModelsCode1
Towards safe and tractable Gaussian process-based MPC: Efficient sampling within a sequential quadratic programming frameworkCode1
Explaining Datasets in Words: Statistical Models with Natural Language ParametersCode1
ODAQ: Open Dataset of Audio Quality - Benchmark on GitHubCode1
TabKANet: Tabular Data Modeling with Kolmogorov-Arnold Network and TransformerCode1
DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and ClassificationCode1
xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory EditingCode1
Data Efficient Child-Adult Speaker Diarization with Simulated ConversationsCode1
Uncertainty Estimation by Density Aware Evidential Deep LearningCode1
L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic ContextCode1
AIPO: Improving Training Objective for Iterative Preference OptimizationCode1
DiffFAS: Face Anti-Spoofing via Generative Diffusion ModelsCode1
PINNfluence: Influence Functions for Physics-Informed Neural NetworksCode1
What Should We Engineer in Prompts? Training Humans in Requirement-Driven LLM UseCode1
Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial OdometryCode1
Evaluating the Quality of Brain MRI GeneratorsCode1
ReCLAP: Improving Zero Shot Audio Classification by Describing SoundsCode1
Learning incomplete factorization preconditioners for GMRESCode1
Improving Virtual Try-On with Garment-focused Diffusion ModelsCode1
WirelessAgent: Large Language Model Agents for Intelligent Wireless NetworksCode1
Estimating Atmospheric Variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion ModelsCode1
Deep learning and machine learning techniques for head pose estimation: a surveyCode1
AudioBERT: Audio Knowledge Augmented Language ModelCode1
InvDesFlow: An AI-driven materials inverse design workflow to explore possible high-temperature superconductorsCode1
Fine-tuning Large Language Models for Entity MatchingCode1
SimMAT: Exploring Transferability from Vision Foundation Models to Any Image ModalityCode1
Click2Mask: Local Editing with Dynamic Mask GenerationCode1
Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?Code1
meds_reader: A fast and efficient EHR processing libraryCode1
Show:102550
← PrevPage 426 of 9486Next →