SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1190111950 of 474278 papers

TitleStatusHype
DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific LiteratureCode2
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-TuningCode2
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance DistillationCode2
FairDiff: Fair Segmentation with Point-Image DiffusionCode2
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation ExploitationCode2
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic ModelsCode2
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-ResourceCode2
Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor ProductsCode2
Playable Game GenerationCode2
ETAP: Event-based Tracking of Any PointCode2
Moonbeam: A MIDI Foundation Model Using Both Absolute and Relative Music AttributesCode2
A Survey on Hallucination in Large Vision-Language ModelsCode2
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and InterpolationCode2
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language ModelsCode2
Scaling Large Motion Models with Million-Level Human MotionsCode2
MDFEND: Multi-domain Fake News DetectionCode2
RecLM: Recommendation Instruction TuningCode2
A Survey on 3D Egocentric Human Pose EstimationCode2
MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and DefenseCode2
CAR: Controllable Autoregressive Modeling for Visual GenerationCode2
Vision Foundation Models for Computed TomographyCode2
Medical Image Segmentation with Domain Adaptation: A SurveyCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Newclid: A User-Friendly Replacement for AlphaGeometryCode2
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUsCode2
Day-Night Cross-domain Vehicle Re-identificationCode2
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023Code2
Active Prompting with Chain-of-Thought for Large Language ModelsCode2
Continuous Diffusion Model for Language ModelingCode2
XRec: Large Language Models for Explainable RecommendationCode2
ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier TransformerCode2
MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic WorkflowCode2
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement LearningCode2
Progressive Representation Learning for Real-Time UAV TrackingCode2
PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware GroupingCode2
GraphEdit: Large Language Models for Graph Structure LearningCode2
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
Simple Policy OptimizationCode2
Accelerating Large Language Model Decoding with Speculative SamplingCode2
audino: A Modern Annotation Tool for Audio and SpeechCode2
Solving Dynamic Traveling Salesman Problems With Deep Reinforcement LearningCode2
Structural Pruning for Diffusion ModelsCode2
KernelWarehouse: Rethinking the Design of Dynamic ConvolutionCode2
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud UnderstandingCode2
Physics Informed Distillation for Diffusion ModelsCode2
SketchDeco: Decorating B&W Sketches with ColourCode2
Mixture of Lookup ExpertsCode2
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web AgentsCode2
Behavior Alignment: A New Perspective of Evaluating LLM-based Conversational Recommender SystemsCode2
UniMD: Towards Unifying Moment Retrieval and Temporal Action DetectionCode2
Show:102550
← PrevPage 239 of 9486Next →