SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1990119950 of 474278 papers

TitleStatusHype
MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference CalibrationCode1
TaxaBind: A Unified Embedding Space for Ecological ApplicationsCode1
PatternBoost: Constructions in Mathematics with a Little Help from AICode1
Rationale-Guided Retrieval Augmented Generation for Medical Question AnsweringCode1
A Lorentz-Equivariant Transformer for All of the LHCCode1
Automated Classification of Cell Shapes: A Comparative Evaluation of Shape DescriptorsCode1
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited ModalitiesCode1
A Survey on Bundle Recommendation: Methods, Applications, and ChallengesCode1
Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series ClassificationCode1
Identify Backdoored Model in Federated Learning via Individual UnlearningCode1
Constant Acceleration FlowCode1
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban SimulationCode1
KAN-AD: Time Series Anomaly Detection with Kolmogorov-Arnold NetworksCode1
Beyond Utility: Evaluating LLM as RecommenderCode1
MIRFLEX: Music Information Retrieval Feature Library for ExtractionCode1
Nearest Neighbor Normalization Improves Multimodal RetrievalCode1
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered ScenesCode1
Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and BenchmarkingCode1
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language ModelsCode1
FRoundation: Are Foundation Models Ready for Face Recognition?Code1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility PredictionCode1
EMGBench: Benchmarking Out-of-Distribution Generalization and Adaptation for ElectromyographyCode1
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like SketchingCode1
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and ConditioningCode1
Enhancing Chess Reinforcement Learning with Graph RepresentationCode1
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority LanguagesCode1
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware DiffusionCode1
Constraint Back-translation Improves Complex Instruction Following of Large Language ModelsCode1
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision TransformersCode1
Graph Learning for Numeric PlanningCode1
Self-Ensembling Gaussian Splatting for Few-Shot Novel View SynthesisCode1
Muscles in Time: Learning to Understand Human Motion by Simulating Muscle ActivationsCode1
Automatically Learning Hybrid Digital Twins of Dynamical SystemsCode1
Local Superior Soups: A Catalyst for Model Merging in Cross-Silo Federated LearningCode1
RAGraph: A General Retrieval-Augmented Graph Learning FrameworkCode1
PSL: Rethinking and Improving Softmax Loss from Pairwise Perspective for RecommendationCode1
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?Code1
BitStack: Any-Size Compression of Large Language Models in Variable Memory EnvironmentsCode1
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion ModelCode1
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI DetectionCode1
SeafloorAI: A Large-scale Vision-Language Dataset for Seafloor Geological SurveyCode1
Prospective Learning: Learning for a Dynamic FutureCode1
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image SegmentationCode1
AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite ImageryCode1
LLaMo: Large Language Model-based Molecular Graph AssistantCode1
AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and ValidationCode1
Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian StructureCode1
Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone ConnectivityCode1
SambaMixer: State of Health Prediction of Li-ion Batteries using Mamba State Space ModelsCode1
Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change DetectionCode1
Show:102550
← PrevPage 399 of 9486Next →