SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 74017450 of 177340 papers

TitleStatusHype
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and BenchmarkCode2
DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion ModelsCode2
Language-Driven Representation Learning for RoboticsCode2
Learning stiff chemical kinetics using extended deep neural operatorsCode2
AbstentionBench: Reasoning LLMs Fail on Unanswerable QuestionsCode2
Efficient and Explicit Modelling of Image Hierarchies for Image RestorationCode2
Multimodal Industrial Anomaly Detection via Hybrid FusionCode2
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural NetworksCode2
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth EstimationCode2
V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative PerceptionCode2
Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D GenerationCode2
FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency RegularizationCode2
Model scale versus domain knowledge in statistical forecasting of chaotic systemsCode2
Conditional Diffusion Models for Semantic 3D Brain MRI SynthesisCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
FreeDoM: Training-Free Energy-Guided Conditional Diffusion ModelCode2
Masked Image Training for Generalizable Deep Image DenoisingCode2
Anti-DreamBooth: Protecting users from personalized text-to-image synthesisCode2
Learned Image Compression with Mixed Transformer-CNN ArchitecturesCode2
Label-Free Liver Tumor SegmentationCode2
Learning Generative Structure Prior for Blind Text Image Super-resolutionCode2
DDP: Diffusion Model for Dense Visual PredictionCode2
3D Line Mapping RevisitedCode2
OrienterNet: Visual Localization in 2D Public Maps with Neural MatchingCode2
DiffMimic: Efficient Motion Mimicking with Differentiable PhysicsCode2
Inst-Inpaint: Instructing to Remove Objects with Diffusion ModelsCode2
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous EnvironmentsCode2
GEO: Generative Engine OptimizationCode2
CherryPicker: Semantic Skeletonization and Topological Reconstruction of Cherry TreesCode2
Deep Image Matting: A Comprehensive SurveyCode2
Similarity search in the blink of an eye with compressed indicesCode2
Unifying and Personalizing Weakly-supervised Federated Medical Image Segmentation via Adaptive Representation and AggregationCode2
DiffusionRig: Learning Personalized Priors for Facial Appearance EditingCode2
An Edit Friendly DDPM Noise Space: Inversion and ManipulationsCode2
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and DatasetCode2
ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital HumanCode2
Unlocking Context Constraints of LLMs: Enhancing Context Efficiency of LLMs with Self-Information-Based Content FilteringCode2
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data LakesCode2
LLM+P: Empowering Large Language Models with Optimal Planning ProficiencyCode2
Total-Recon: Deformable Scene Reconstruction for Embodied View SynthesisCode2
TensoIR: Tensorial Inverse RenderingCode2
TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with RecommendationCode2
TidyBot: Personalized Robot Assistance with Large Language ModelsCode2
SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical ImagesCode2
ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control SystemsCode2
Active Retrieval Augmented GenerationCode2
Marsellus: A Heterogeneous RISC-V AI-IoT End-Node SoC with 2-to-8b DNN Acceleration and 30%-Boost Adaptive Body BiasingCode2
Monocular Dynamic View Synthesis: A Reality CheckCode2
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and OptimisationCode2
Improving Factuality and Reasoning in Language Models through Multiagent DebateCode2
Show:102550
← PrevPage 149 of 3547Next →