SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1285112900 of 474278 papers

TitleStatusHype
PortaSpeech: Portable and High-Quality Generative Text-to-SpeechCode2
Perspective Fields for Single Image Camera CalibrationCode2
SGFormer: Simplifying and Empowering Transformers for Large-Graph RepresentationsCode2
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial TasksCode2
DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in TransformerCode2
What Makes a Scene ? Scene Graph-based Evaluation and Feedback for Controllable GenerationCode2
A Noise-Robust Turn-Taking System for Real-World Dialogue Robots: A Field ExperimentCode2
CoMoSVC: Consistency Model-based Singing Voice ConversionCode2
Speech Model Pre-training for End-to-End Spoken Language UnderstandingCode2
AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode2
Efficient Memory Management for Deep Neural Net InferenceCode2
2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object DetectionCode2
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based MethodsCode2
VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AICode2
UniNet: A Contrastive Learning-guided Unified Framework with Feature Selection for Anomaly DetectionCode2
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormerCode2
Out-of-sample scoring and automatic selection of causal estimatorsCode2
Linearly-evolved Transformer for Pan-sharpeningCode2
Geometric Clifford Algebra NetworksCode2
Diffusion Model Alignment Using Direct Preference OptimizationCode2
ConceptNet at SemEval-2017 Task 2: Extending Word Embeddings with Multilingual Relational KnowledgeCode2
PandaGPT: One Model To Instruction-Follow Them AllCode2
Side-channel analysis against ANSSI’s protected AES implementation on ARM: end-to-end attacks with multi-task learningCode2
Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate SchedulerCode2
GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian SplattingCode2
PixelLM: Pixel Reasoning with Large Multimodal ModelCode2
Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition MonitoringCode2
Multi-Agent Large Language Models for Conversational Task-SolvingCode2
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information ExtractionCode2
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
Towards Robust and Generalizable Lensless Imaging with Modular Learned ReconstructionCode2
CityNav: Language-Goal Aerial Navigation Dataset with Geographic InformationCode2
Accelerating Image Super-Resolution Networks with Pixel-Level ClassificationCode2
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object TrackingCode2
F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian SplattingCode2
Make It Count: Text-to-Image Generation with an Accurate Number of ObjectsCode2
Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton FormatsCode2
3D-VisTA: Pre-trained Transformer for 3D Vision and Text AlignmentCode2
FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature ConsistencyCode2
Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning FrameworkCode2
The dark side of the forces: assessing non-conservative force models for atomistic machine learningCode2
Evaluating Frontier Models for Dangerous CapabilitiesCode2
K-LITE: Learning Transferable Visual Models with External KnowledgeCode2
TIPO: Text to Image with Text Presampling for Prompt OptimizationCode2
Sequential Model-Based Optimization for General Algorithm ConfigurationCode2
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge DistillationCode2
SSLAM: Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic SoundscapesCode2
M2I: From Factored Marginal Trajectory Prediction to Interactive PredictionCode2
A fully automatic AI system for tooth and alveolar bone segmentation from cone-beam CT imagesCode2
GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph LearningCode2
Show:102550
← PrevPage 258 of 9486Next →