SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 43014350 of 661570 papers

TitleStatusHype
Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud DetectionCode3
ParetoQ: Scaling Laws in Extremely Low-bit LLM QuantizationCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
SealQA: Raising the Bar for Reasoning in Search-Augmented Language ModelsCode3
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible CostCode3
View Selection for 3D Captioning via Diffusion RankingCode3
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden IntermediatesCode3
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image GenerationCode3
Lossless and Near-Lossless Compression for Foundation ModelsCode3
StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI AstrophysicistCode3
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery ClassificationCode3
Affordable AI Assistants with Knowledge Graph of ThoughtsCode3
The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection BenchmarkCode3
DDT: Decoupled Diffusion TransformerCode3
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from VideosCode3
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve ClassificationCode3
Detecting hallucinations in large language models using semantic entropyCode3
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and AudioCode3
BoostTrack: boosting the similarity measure and detection confidence for improved multiple object trackingCode3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-ScalingCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^Code3
VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-TuningCode3
Improving Dictionary Learning with Gated Sparse AutoencodersCode3
Open3D: A Modern Library for 3D Data ProcessingCode3
ATPrompt: Textual Prompt Learning with Embedded AttributesCode3
N-BEATS: Neural basis expansion analysis for interpretable time series forecastingCode3
Mip-Splatting: Alias-free 3D Gaussian SplattingCode3
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray InterpretationCode3
Scaling Rectified Flow Transformers for High-Resolution Image SynthesisCode3
MegaPairs: Massive Data Synthesis For Universal Multimodal RetrievalCode3
BasicVSR: The Search for Essential Components in Video Super-Resolution and BeyondCode3
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMsCode3
WHAM: Reconstructing World-grounded Humans with Accurate 3D MotionCode3
Block-NeRF: Scalable Large Scene Neural View SynthesisCode3
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General SoundCode3
Vision Transformers for Dense PredictionCode3
RepViT: Revisiting Mobile CNN From ViT PerspectiveCode3
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion ModelCode3
CRAG -- Comprehensive RAG BenchmarkCode3
Major TOM: Expandable Datasets for Earth ObservationCode3
Uni-QSAR: an Auto-ML Tool for Molecular Property PredictionCode3
Optimal Variable Speed Limit Control Strategy on Freeway Segments under Fog ConditionsCode3
Towards General-purpose Infrastructure for Protecting Scientific Data Under StudyCode3
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement LearningCode3
Genie: Generative Interactive EnvironmentsCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
Efficiently Serving LLM Reasoning Programs with CertaindexCode3
SPO: Sequential Monte Carlo Policy OptimisationCode3
AgentStudio: A Toolkit for Building General Virtual AgentsCode3
Show:102550
← PrevPage 87 of 13232Next →