SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1110111150 of 661570 papers

TitleStatusHype
Adapting a Language Model While Preserving its General KnowledgeCode2
DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug DesignCode2
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level QualityCode2
General Detection-based Text Line RecognitionCode2
Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge ModelsCode2
FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative ModelingCode2
Denoising Diffusion Models for Plug-and-Play Image RestorationCode2
OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point CloudsCode2
RemDet: Rethinking Efficient Model Design for UAV Object DetectionCode2
Pose2Sim: An open-source Python package for multiview markerless kinematicsCode2
Scaling Laws of Synthetic Images for Model Training ... for NowCode2
DrFuse: Learning Disentangled Representation for Clinical Multi-Modal Fusion with Missing Modality and Modal InconsistencyCode2
FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose GenerationCode2
Sky-image-based solar forecasting using deep learning with multi-location data: training models locally, globally or via transfer learning?Code2
Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline StudyCode2
MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical NeedsCode2
DEGSTalk: Decomposed Per-Embedding Gaussian Fields for Hair-Preserving Talking Face SynthesisCode2
Neural Preset for Color Style TransferCode2
XNet: Wavelet-Based Low and High Frequency Fusion Networks for Fully- and Semi-Supervised Semantic Segmentation of Biomedical ImagesCode2
VecCity: A Taxonomy-guided Library for Map Entity Representation LearningCode2
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalCode2
MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech EnhancementCode2
Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence MapsCode2
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive LearningCode2
FastBlend: a Powerful Model-Free Toolkit Making Video Stylization EasierCode2
Neural Light Spheres for Implicit Image Stitching and View SynthesisCode2
The More You See in 2D the More You Perceive in 3DCode2
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative ModelsCode2
Data Management For Training Large Language Models: A SurveyCode2
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and PruningCode2
MTS-Mixers: Multivariate Time Series Forecasting via Factorized Temporal and Channel MixingCode2
Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language ModelsCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
PyTorch-IE: Fast and Reproducible Prototyping for Information ExtractionCode2
ProteinBERT: a universal deep-learning model of protein sequence and functionCode2
Large Language Model Instruction Following: A Survey of Progresses and ChallengesCode2
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality PropagationCode2
On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDARCode2
Image as Set of PointsCode2
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual DrawingCode2
Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of CodeCode2
Flowformer: Linearizing Transformers with Conservation FlowsCode2
Scaled Decoupled DistillationCode2
Diffusion Models for Imperceptible and Transferable Adversarial AttackCode2
UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging ModalitiesCode2
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation SystemsCode2
CausalGym: Benchmarking causal interpretability methods on linguistic tasksCode2
PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual PatternsCode2
Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenesCode2
Harnessing Vision Models for Time Series Analysis: A SurveyCode2
Show:102550
← PrevPage 223 of 13232Next →