SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 73267350 of 474278 papers

TitleStatusHype
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous DrivingCode2
Generative causal testing to bridge data-driven models and scientific theories in language neuroscienceCode2
Uncertainty Modelling and Robust Observer Synthesis using the Koopman OperatorCode2
Recent Advances in Speech Language Models: A SurveyCode2
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU LanguagesCode2
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly DetectionCode2
EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary DynamicsCode2
CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAMCode2
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy PredictionCode2
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"Code2
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language ModelsCode2
HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy ScenesCode2
End-to-end Piano Performance-MIDI to Score Conversion with TransformersCode2
Frequency Adaptive Normalization For Non-stationary Time Series ForecastingCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Towards Robust Multimodal Sentiment Analysis with Incomplete DataCode2
QAEncoder: Towards Aligned Representation Learning in Question Answering SystemCode2
Robin3D: Improving 3D Large Language Model via Robust Instruction TuningCode2
PerCo (SD): Open Perceptual CompressionCode2
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language ModelsCode2
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language ModelsCode2
Melody-Guided Music GenerationCode2
DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning DataCode2
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge AugmentationCode2
Show:102550
← PrevPage 294 of 18972Next →