SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1180111850 of 177340 papers

TitleStatusHype
Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series DataCode2
Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing ProcessCode2
OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFDCode2
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level VisionCode2
Animate-A-Story: Storytelling with Retrieval-Augmented Video GenerationCode2
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-trainingCode2
KAGNNs: Kolmogorov-Arnold Networks meet Graph LearningCode2
TRAK: Attributing Model Behavior at ScaleCode2
Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and ReconstructionCode2
DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained DiffusionCode2
Long-Context Language Modeling with Parallel Context EncodingCode2
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic CameraCode2
Diffusion Guidance Is a Controllable Policy Improvement OperatorCode2
FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid EditingCode2
OpenForest: A data catalogue for machine learning in forest monitoringCode2
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal ModelsCode2
Unlocking Efficient Long-to-Short LLM Reasoning with Model MergingCode2
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMsCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
LoQT: Low-Rank Adapters for Quantized PretrainingCode2
EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real WorldCode2
NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLMCode2
Reason3D: Searching and Reasoning 3D Segmentation via Large Language ModelCode2
Visual Speech Recognition for Multiple Languages in the WildCode2
LaneSegNet: Map Learning with Lane Segment Perception for Autonomous DrivingCode2
Distillation-Free One-Step Diffusion for Real-World Image Super-ResolutionCode2
Efficient Reinforcement Finetuning via Adaptive Curriculum LearningCode2
Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of ArtifactsCode2
Writing in the Margins: Better Inference Pattern for Long Context RetrievalCode2
chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulationsCode2
Envision3D: One Image to 3D with Anchor Views InterpolationCode2
Full-Atom Peptide Design based on Multi-modal Flow MatchingCode2
Inter-subject Contrastive Learning for Subject Adaptive EEG-based Visual RecognitionCode2
HF-NeuS: Improved Surface Reconstruction Using High-Frequency DetailsCode2
SKIPP'D: a SKy Images and Photovoltaic Power Generation Dataset for Short-term Solar ForecastingCode2
LayoutGPT: Compositional Visual Planning and Generation with Large Language ModelsCode2
SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time SegmentationCode2
Neural-Fly Enables Rapid Learning for Agile Flight in Strong WindsCode2
Neural Cloth SimulationCode2
AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction SimulatorCode2
Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object DetectionCode2
A Case Study in CUDA Kernel Fusion: Implementing FlashAttention-2 on NVIDIA Hopper Architecture using the CUTLASS LibraryCode2
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical VectorCode2
Striped Attention: Faster Ring Attention for Causal TransformersCode2
4D Contrastive Superflows are Dense 3D Representation LearnersCode2
Symbol as Points: Panoptic Symbol Spotting via Point-based RepresentationCode2
Personalized Large Language ModelsCode2
Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product SearchCode2
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite ImageryCode2
Show:102550
← PrevPage 237 of 3547Next →