SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 33513400 of 659983 papers

TitleStatusHype
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion RecognitionCode3
DDColor: Towards Photo-Realistic Image Colorization via Dual DecodersCode3
PCDCNet: A Surrogate Model for Air Quality Forecasting with Physical-Chemical Dynamics and ConstraintsCode3
MACE: Mass Concept Erasure in Diffusion ModelsCode3
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein EmbeddingCode3
TopoTune : A Framework for Generalized Combinatorial Complex Neural NetworksCode3
FlipSketch: Flipping Static Drawings to Text-Guided Sketch AnimationsCode3
DoWhy: An End-to-End Library for Causal InferenceCode3
Relative Pose Estimation through Affine Corrections of Monocular Depth PriorsCode3
DistiLLM: Towards Streamlined Distillation for Large Language ModelsCode3
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming VideosCode3
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy OptimizationCode3
Music2Latent: Consistency Autoencoders for Latent Audio CompressionCode3
Advanced Video Inpainting Using Optical Flow-Guided Efficient DiffusionCode3
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly DetectionCode3
A Survey on the Memory Mechanism of Large Language Model based AgentsCode3
ACEGEN: Reinforcement learning of generative chemical agents for drug discoveryCode3
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and PlanningCode3
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction TasksCode3
Embodied Understanding of Driving ScenariosCode3
Personalized Image Generation with Deep Generative Models: A Decade SurveyCode3
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPOCode3
Datasheet for the PileCode3
UniMERNet: A Universal Network for Real-World Mathematical Expression RecognitionCode3
imitation: Clean Imitation Learning ImplementationsCode3
Efficient Video Action Detection with Token Dropout and Context RefinementCode3
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of GeneralizationCode3
LLM-Pruner: On the Structural Pruning of Large Language ModelsCode3
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter ModelCode3
HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene ReconstructionCode3
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone TrainingCode3
White-Box Transformers via Sparse Rate ReductionCode3
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in StructuresCode3
Fine-Tuning Language Models from Human PreferencesCode3
GuardT2I: Defending Text-to-Image Models from Adversarial PromptsCode3
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion ModelCode3
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender EstimationCode3
EvoTorch: Scalable Evolutionary Computation in PythonCode3
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object DetectionCode3
Are We Done with MMLU?Code3
Does End-to-End Autonomous Driving Really Need Perception Tasks?Code3
Towards Realistic Scene Generation with LiDAR Diffusion ModelsCode3
Improved Modelling of Federated Datasets using Mixtures-of-Dirichlet-MultinomialsCode3
LightGBM: A Highly Efficient Gradient Boosting Decision TreeCode3
Faster Diffusion via Temporal Attention DecompositionCode3
A Comprehensive Survey on Composed Image RetrievalCode3
Animatable and Relightable Gaussians for High-fidelity Human Avatar ModelingCode3
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and ImprovementCode3
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language ModelsCode3
TextBox 2.0: A Text Generation Library with Pre-trained Language ModelsCode3
Show:102550
← PrevPage 68 of 13200Next →