SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 72517300 of 661570 papers

TitleStatusHype
H3WB: Human3.6M 3D WholeBody Dataset and BenchmarkCode2
PointPillars: Fast Encoders for Object Detection from Point CloudsCode2
VTimeLLM: Empower LLM to Grasp Video MomentsCode2
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator TrajectoriesCode2
Correlation-Guided Query-Dependency Calibration for Video Temporal GroundingCode2
Surg-3M: A Dataset and Foundation Model for Perception in Surgical SettingsCode2
Graph Diffusion Transformers for Multi-Conditional Molecular GenerationCode2
When and why vision-language models behave like bags-of-words, and what to do about it?Code2
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency ModelsCode2
FlexiDreamer: Single Image-to-3D Generation with FlexiCubesCode2
USP: Unified Self-Supervised Pretraining for Image Generation and UnderstandingCode2
Alpha^2: Discovering Logical Formulaic Alphas using Deep Reinforcement LearningCode2
FastInst: A Simple Query-Based Model for Real-Time Instance SegmentationCode2
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
Mapping the Mind of an Instruction-based Image Editing using SMILECode2
MatteFormer: Transformer-Based Image Matting via Prior-TokensCode2
LLMGA: Multimodal Large Language Model based Generation AssistantCode2
Hydra: Bidirectional State Space Models Through Generalized Matrix MixersCode2
auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event DataCode2
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific TuningCode2
FastMoE: A Fast Mixture-of-Expert Training SystemCode2
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Improving Image Restoration by Revisiting Global Information AggregationCode2
Efficient Face Super-Resolution via Wavelet-based Feature Enhancement NetworkCode2
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AICode2
FLAT: Chinese NER Using Flat-Lattice TransformerCode2
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language ModelsCode2
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language ModelsCode2
Squeezeformer: An Efficient Transformer for Automatic Speech RecognitionCode2
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationCode2
ControlVideo: Training-free Controllable Text-to-Video GenerationCode2
Open-Vocabulary Segmentation with Unpaired Mask-Text SupervisionCode2
Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and ReconstructionCode2
Forgetting Transformer: Softmax Attention with a Forget GateCode2
Tool-Planner: Task Planning with Clusters across Multiple ToolsCode2
PID: Physics-Informed Diffusion Model for Infrared Image GenerationCode2
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A SurveyCode2
LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorchCode2
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion PreimageCode2
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion TransformerCode2
No More Adam: Learning Rate Scaling at Initialization is All You NeedCode2
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
LongSpec: Long-Context Speculative Decoding with Efficient Drafting and VerificationCode2
NNSVS: A Neural Network-Based Singing Voice Synthesis ToolkitCode2
MVBench: A Comprehensive Multi-modal Video Understanding BenchmarkCode2
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and DatasetCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
vid-TLDR: Training Free Token merging for Light-weight Video TransformerCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
Guiding Language Models of Code with Global Context using MonitorsCode2
Show:102550
← PrevPage 146 of 13232Next →