SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 62516300 of 177340 papers

TitleStatusHype
Adaptive Kalman-Informed TransformerCode2
PnPXAI: A Universal XAI Framework Providing Automatic Explanations Across Diverse Modalities and ModelsCode2
Learning Adaptive Parallel Reasoning with Language ModelsCode2
FuXi Weather: A data-to-forecast machine learning system for global weatherCode2
Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data SegmentCode2
SAEs Are Good for Steering -- If You Select the Right FeaturesCode2
TurtleBench: Evaluating Top Language Models via Real-World Yes/No PuzzlesCode2
Forensics Adapter: Unleashing CLIP for Generalizable Face Forgery DetectionCode2
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language ModelsCode2
Adaptive Latent-Space Constraints in Personalized FLCode2
UV-SAM: Adapting Segment Anything Model for Urban Village IdentificationCode2
SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLMCode2
MedCLIP: Contrastive Learning from Unpaired Medical Images and TextCode2
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion GuidanceCode2
CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion ModelsCode2
SPECTRE: An FFT-Based Efficient Drop-In Replacement to Self-Attention for Long ContextsCode2
Memorizing TransformersCode2
Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor SearchCode2
TechGPT-2.0: A large language model project to solve the task of knowledge graph constructionCode2
Composed Image Retrieval for Remote SensingCode2
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow EstimationCode2
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement LearningCode2
Zero Bubble Pipeline ParallelismCode2
A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnosticsCode2
V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and PredictionCode2
Transferable Neural Wavefunctions for SolidsCode2
MoM: Linear Sequence Modeling with Mixture-of-MemoriesCode2
Medical Hallucinations in Foundation Models and Their Impact on HealthcareCode2
Towards Robust Multi-tab Website FingerprintingCode2
Self-Training with Direct Preference Optimization Improves Chain-of-Thought ReasoningCode2
Tip-Adapter: Training-free Adaption of CLIP for Few-shot ClassificationCode2
Accurate RNA 3D structure prediction using a language model-based deep learning approachCode2
Guidance with Spherical Gaussian Constraint for Conditional DiffusionCode2
TACO: Topics in Algorithmic COde generation datasetCode2
Data-Driven Parametrization of Molecular Mechanics Force Fields for Expansive Chemical Space CoverageCode2
Steerable Scene Generation with Post Training and Inference-Time SearchCode2
VFIMamba: Video Frame Interpolation with State Space ModelsCode2
CodeSteer: Symbolic-Augmented Language Models via Code/Text GuidanceCode2
PyGRF: An improved Python Geographical Random Forest model and case studies in public health and natural disastersCode2
geomstats: a Python Package for Riemannian Geometry in Machine LearningCode2
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel HierarchiesCode2
PAM: Prompting Audio-Language Models for Audio Quality AssessmentCode2
Geometry-Complete Diffusion for 3D Molecule Generation and OptimizationCode2
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian GenerationCode2
Libra: Building Decoupled Vision System on Large Language ModelsCode2
ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm EngineeringCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Towards Open Vocabulary Learning: A SurveyCode2
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map GenerationCode2
BBTv2: Towards a Gradient-Free Future with Large Language ModelsCode2
Show:102550
← PrevPage 126 of 3547Next →