SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 48014850 of 177340 papers

TitleStatusHype
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel TransformerCode2
Mapping the Mind of an Instruction-based Image Editing using SMILECode2
MatteFormer: Transformer-Based Image Matting via Prior-TokensCode2
LLMGA: Multimodal Large Language Model based Generation AssistantCode2
Hydra: Bidirectional State Space Models Through Generalized Matrix MixersCode2
auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event DataCode2
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific TuningCode2
FastMoE: A Fast Mixture-of-Expert Training SystemCode2
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Improving Image Restoration by Revisiting Global Information AggregationCode2
Efficient Face Super-Resolution via Wavelet-based Feature Enhancement NetworkCode2
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AICode2
FLAT: Chinese NER Using Flat-Lattice TransformerCode2
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language ModelsCode2
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language ModelsCode2
Squeezeformer: An Efficient Transformer for Automatic Speech RecognitionCode2
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationCode2
ControlVideo: Training-free Controllable Text-to-Video GenerationCode2
Open-Vocabulary Segmentation with Unpaired Mask-Text SupervisionCode2
Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and ReconstructionCode2
Forgetting Transformer: Softmax Attention with a Forget GateCode2
Tool-Planner: Task Planning with Clusters across Multiple ToolsCode2
PID: Physics-Informed Diffusion Model for Infrared Image GenerationCode2
Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A SurveyCode2
LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorchCode2
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion PreimageCode2
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion TransformerCode2
No More Adam: Learning Rate Scaling at Initialization is All You NeedCode2
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
LongSpec: Long-Context Speculative Decoding with Efficient Drafting and VerificationCode2
NNSVS: A Neural Network-Based Singing Voice Synthesis ToolkitCode2
MVBench: A Comprehensive Multi-modal Video Understanding BenchmarkCode2
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and DatasetCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
vid-TLDR: Training Free Token merging for Light-weight Video TransformerCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
Guiding Language Models of Code with Global Context using MonitorsCode2
dKV-Cache: The Cache for Diffusion Language ModelsCode2
Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder-Decoder ModelsCode2
Diffusion Models Beat GANs on Image SynthesisCode2
Towards Stable Test-Time Adaptation in Dynamic Wild WorldCode2
beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender SystemsCode2
Measuring Style Similarity in Diffusion ModelsCode2
LangBridge: Multilingual Reasoning Without Multilingual SupervisionCode2
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future OpportunitiesCode2
LEACE: Perfect linear concept erasure in closed formCode2
SEBERTNets: Sequence Enhanced BERT Networks for Event Entity Extraction Tasks Oriented to the Finance FieldCode2
Graph-enhanced Large Language Models in Asynchronous Plan ReasoningCode2
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian EvaluationCode2
An OpenMind for 3D medical vision self-supervised learningCode2
Show:102550
← PrevPage 97 of 3547Next →