SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1260112650 of 177340 papers

TitleStatusHype
ETAP: Event-based Tracking of Any PointCode2
Moonbeam: A MIDI Foundation Model Using Both Absolute and Relative Music AttributesCode2
A Survey on Hallucination in Large Vision-Language ModelsCode2
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and InterpolationCode2
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language ModelsCode2
Scaling Large Motion Models with Million-Level Human MotionsCode2
MDFEND: Multi-domain Fake News DetectionCode2
RecLM: Recommendation Instruction TuningCode2
A Survey on 3D Egocentric Human Pose EstimationCode2
MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and DefenseCode2
CAR: Controllable Autoregressive Modeling for Visual GenerationCode2
Vision Foundation Models for Computed TomographyCode2
Medical Image Segmentation with Domain Adaptation: A SurveyCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Newclid: A User-Friendly Replacement for AlphaGeometryCode2
Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUsCode2
Day-Night Cross-domain Vehicle Re-identificationCode2
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023Code2
Active Prompting with Chain-of-Thought for Large Language ModelsCode2
Continuous Diffusion Model for Language ModelingCode2
XRec: Large Language Models for Explainable RecommendationCode2
ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier TransformerCode2
MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic WorkflowCode2
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement LearningCode2
Progressive Representation Learning for Real-Time UAV TrackingCode2
PG-SAG: Parallel Gaussian Splatting for Fine-Grained Large-Scale Urban Buildings Reconstruction via Semantic-Aware GroupingCode2
GraphEdit: Large Language Models for Graph Structure LearningCode2
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
Simple Policy OptimizationCode2
Accelerating Large Language Model Decoding with Speculative SamplingCode2
audino: A Modern Annotation Tool for Audio and SpeechCode2
Solving Dynamic Traveling Salesman Problems With Deep Reinforcement LearningCode2
Structural Pruning for Diffusion ModelsCode2
KernelWarehouse: Rethinking the Design of Dynamic ConvolutionCode2
CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud UnderstandingCode2
Physics Informed Distillation for Diffusion ModelsCode2
SketchDeco: Decorating B&W Sketches with ColourCode2
Mixture of Lookup ExpertsCode2
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web AgentsCode2
Behavior Alignment: A New Perspective of Evaluating LLM-based Conversational Recommender SystemsCode2
UniMD: Towards Unifying Moment Retrieval and Temporal Action DetectionCode2
MixVPR: Feature Mixing for Visual Place RecognitionCode2
Shape Preserving Facial Landmarks with Graph Attention NetworksCode2
Open-Vocabulary Camouflaged Object SegmentationCode2
Noise Modeling in One Hour: Minimizing Preparation Efforts for Self-supervised Low-Light RAW Image DenoisingCode2
Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)Code2
MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesCode2
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank ReductionCode2
Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement LearningCode2
Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language ModelsCode2
Show:102550
← PrevPage 253 of 3547Next →