SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1805118100 of 474278 papers

TitleStatusHype
Accurate Pocket Identification for Binding-Site-Agnostic DockingCode1
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
SurvHive: a package to consistently access multiple survival-analysis packagesCode1
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic SegmentationCode1
Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task PerspectivesCode1
From Uncertain to Safe: Conformal Fine-Tuning of Diffusion Models for Safe PDE ControlCode1
Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and ClassificationCode1
Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and MaliseetCode1
MATCNN: Infrared and Visible Image Fusion Method Based on Multi-scale CNN with Attention TransformerCode1
IncepFormerNet: A multi-scale multi-head attention network for SSVEP classificationCode1
Unified Spatial-Temporal Edge-Enhanced Graph Networks for Pedestrian Trajectory PredictionCode1
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and DatasetCode1
Adaptive Self-improvement LLM Agentic System for ML Library DevelopmentCode1
From Words to Collisions: LLM-Guided Evaluation and Adversarial Generation of Safety-Critical Driving ScenariosCode1
FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024Code1
Analytical Lyapunov Function Discovery: An RL-based Generative ApproachCode1
deCIFer: Crystal Structure Prediction from Powder Diffraction Data using Autoregressive Language ModelsCode1
Activation-Informed Merging of Large Language ModelsCode1
Combinatorial Optimization Perspective based Framework for Multi-behavior RecommendationCode1
Transformers Boost the Performance of Decision Trees on Tabular Data across Sample SizesCode1
Improved Training Technique for Latent Consistency ModelsCode1
AdaSVD: Adaptive Singular Value Decomposition for Large Language ModelsCode1
VILP: Imitation Learning with Latent Video PlanningCode1
Simulating Rumor Spreading in Social Networks using LLM AgentsCode1
Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive SparsityCode1
Progressive Binarization with Semi-Structured Pruning for LLMsCode1
ML-Dev-Bench: Comparative Analysis of AI Agents on ML development workflowsCode1
Detecting Backdoor Samples in Contrastive Language Image PretrainingCode1
Adversarial Reasoning at Jailbreaking TimeCode1
UASTHN: Uncertainty-Aware Deep Homography Estimation for UAV Satellite-Thermal Geo-localizationCode1
Joint Localization and Activation Editing for Low-Resource Fine-TuningCode1
C codegen considered unnecessary: go directly to binary, do not pass C. Compilation of Julia code for deployment in model-based engineeringCode1
Logits are All We Need to Adapt Closed ModelsCode1
Fine-Tuning Discrete Diffusion Models with Policy Gradient MethodsCode1
A Wearable Device Dataset for Mental Health Assessment Using Laser Doppler Flowmetry and Fluorescence Spectroscopy SensorsCode1
VidSketch: Hand-drawn Sketch-Driven Video Generation with Diffusion ControlCode1
Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language ModelsCode1
Learning to Generate Unit Tests for Automated DebuggingCode1
FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective PropagationCode1
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo MethodsCode1
Trajectory World Models for Heterogeneous EnvironmentsCode1
SeizeIT2: Wearable Dataset Of Patients With Focal EpilepsyCode1
GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic EnvironmentsCode1
Polynomial, trigonometric, and tropical activationsCode1
COVE: COntext and VEracity prediction for out-of-context imagesCode1
Partial Channel Network: Compute Fewer, Perform BetterCode1
Evolving Symbolic 3D Visual Grounder with Weakly Supervised ReflectionCode1
Learning Efficient Positional Encodings with Graph Neural NetworksCode1
FSPGD: Rethinking Black-box Attacks on Semantic SegmentationCode1
SimPER: A Minimalist Approach to Preference Alignment without HyperparametersCode1
Show:102550
← PrevPage 362 of 9486Next →