SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1045110500 of 661570 papers

TitleStatusHype
ADELIE: Aligning Large Language Models on Information ExtractionCode2
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
DeepPrivacy2: Towards Realistic Full-Body AnonymizationCode2
Pre-training Enhanced Spatial-temporal Graph Neural Network for Multivariate Time Series ForecastingCode2
EasyText: Controllable Diffusion Transformer for Multilingual Text RenderingCode2
Graph Condensation: A SurveyCode2
Large Language Model Enhanced Recommender Systems: A SurveyCode2
SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure FramesCode2
FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning EvaluationCode2
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary OptimizersCode2
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous DrivingCode2
LLark: A Multimodal Instruction-Following Language Model for MusicCode2
The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement LearningCode2
Heterogeneity-Informed Meta-Parameter Learning for Spatiotemporal Time Series ForecastingCode2
OMLT: Optimization & Machine Learning ToolkitCode2
HiGPT: Heterogeneous Graph Language ModelCode2
Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object RelightingCode2
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and ActionCode2
Rethinking Diverse Human Preference Learning through Principal Component AnalysisCode2
Generative Inbetweening through Frame-wise Conditions-Driven Video GenerationCode2
See More Details: Efficient Image Super-Resolution by Experts MiningCode2
TAVA: Template-free Animatable Volumetric ActorsCode2
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General ReasoningCode2
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLMCode2
Cross-View Referring Multi-Object TrackingCode2
Gaussian Mixture Flow Matching ModelsCode2
Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and BeyondCode2
Searching Latent Program SpacesCode2
MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image SegmentationCode2
Equalized Focal Loss for Dense Long-Tailed Object DetectionCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
A Touch, Vision, and Language Dataset for Multimodal AlignmentCode2
Latent Ewald summation for machine learning of long-range interactionsCode2
Exploring Contrastive Learning for Multimodal Detection of Misogynistic MemesCode2
VLT: Vision-Language Transformer and Query Generation for Referring SegmentationCode2
DeepDPM: Deep Clustering With an Unknown Number of ClustersCode2
LayoutPrompter: Awaken the Design Ability of Large Language ModelsCode2
Behind Maya: Building a Multilingual Vision Language ModelCode2
Alignment faking in large language modelsCode2
Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-LocalizationCode2
CelebV-HQ: A Large-Scale Video Facial Attributes DatasetCode2
Jodi: Unification of Visual Generation and Understanding via Joint ModelingCode2
Dress Code: High-Resolution Multi-Category Virtual Try-OnCode2
Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement LearningCode2
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with TransformersCode2
3D Student Splatting and ScoopingCode2
TJ4DRadSet: A 4D Radar Dataset for Autonomous DrivingCode2
Delta Decompression for MoE-based LLMs CompressionCode2
CMMLU: Measuring massive multitask language understanding in ChineseCode2
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
Show:102550
← PrevPage 210 of 13232Next →