SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 72017250 of 661570 papers

TitleStatusHype
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
SEGAN: Speech Enhancement Generative Adversarial NetworkCode2
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMsCode2
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AICode2
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene UnderstandingCode2
Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object TrackingCode2
AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot MovementsCode2
Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal DatasetsCode2
ByT5 model for massively multilingual grapheme-to-phoneme conversionCode2
Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and SimplificationCode2
StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street ViewsCode2
A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and BenchmarkCode2
Neural interval-censored survival regression with feature selectionCode2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented GenerationCode2
DiffusionBERT: Improving Generative Masked Language Models with Diffusion ModelsCode2
Executing your Commands via Motion Diffusion in Latent SpaceCode2
NMS Strikes BackCode2
YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion FusionCode2
DiffFace: Diffusion-based Face Swapping with Facial GuidanceCode2
CodeJudge: Evaluating Code Generation with Large Language ModelsCode2
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model SizesCode2
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and contextCode2
Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action SegmentationCode2
One-Step Diffusion Distillation through Score Implicit MatchingCode2
Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model CapabilityCode2
Efficient Speech Enhancement via Embeddings from Pre-trained Generative AudioencodersCode2
Learning local equivariant representations for quantum operatorsCode2
BYOL for Audio: Exploring Pre-trained General-purpose Audio RepresentationsCode2
Scaling Relationship on Learning Mathematical Reasoning with Large Language ModelsCode2
ETSformer: Exponential Smoothing Transformers for Time-series ForecastingCode2
Investigating Affective Use and Emotional Well-being on ChatGPTCode2
LLM Processes: Numerical Predictive Distributions Conditioned on Natural LanguageCode2
AnySat: One Earth Observation Model for Many Resolutions, Scales, and ModalitiesCode2
ZnTrack -- Data as CodeCode2
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model ParallelismCode2
Autonomous Improvement of Instruction Following Skills via Foundation ModelsCode2
MemoryBank: Enhancing Large Language Models with Long-Term MemoryCode2
FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic PredictionCode2
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language ModelsCode2
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image EditingCode2
GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D ReconstructionCode2
Unified Continuous Generative ModelsCode2
Text-based Animatable 3D Avatars with Morphable Model AlignmentCode2
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data RestorationCode2
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language ModelsCode2
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model CapabilitiesCode2
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly DetectionCode2
A Tutorial on Structural Identifiability of Epidemic Models Using StructuralIdentifiability.jlCode2
DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable PolicyCode2
RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought ReasoningCode2
Show:102550
← PrevPage 145 of 13232Next →