SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 70017050 of 661570 papers

TitleStatusHype
Graph Domain Adaptation: Challenges, Progress and ProspectsCode2
Learning GFlowNets from partial episodes for improved convergence and stabilityCode2
Surrogate Learning in Meta-Black-Box Optimization: A Preliminary StudyCode2
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSICode2
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGANCode2
Sparse4D v2: Recurrent Temporal Fusion with Sparse ModelCode2
Forecast-MAE: Self-supervised Pre-training for Motion Forecasting with Masked AutoencodersCode2
Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal ReasoningCode2
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
Tetrahedron Splatting for 3D GenerationCode2
Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth EstimationCode2
A Text-guided Protein Design FrameworkCode2
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion ModelsCode2
Point-to-Box Network for Accurate Object Detection via Single Point SupervisionCode2
FRNet: Frustum-Range Networks for Scalable LiDAR SegmentationCode2
The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settingsCode2
MaIR: A Locality- and Continuity-Preserving Mamba for Image RestorationCode2
FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural NetworksCode2
Advancing Plain Vision Transformer Towards Remote Sensing Foundation ModelCode2
Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path PlanningCode2
Communication Learning in Multi-Agent Systems from Graph Modeling PerspectiveCode2
Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community RetrievalCode2
Towards Natural Image Matting in the Wild via Real-Scenario PriorCode2
Generative Active Learning for Long-tailed Instance SegmentationCode2
ODRL: A Benchmark for Off-Dynamics Reinforcement LearningCode2
Expressive Text-to-Image Generation with Rich TextCode2
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose InitializationCode2
InPars: Data Augmentation for Information Retrieval using Large Language ModelsCode2
Root Mean Square Layer NormalizationCode2
WINA: Weight Informed Neuron Activation for Accelerating Large Language Model InferenceCode2
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn DialoguesCode2
PPFlow: Target-aware Peptide Design with Torsional Flow MatchingCode2
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation SystemsCode2
Particle Video Revisited: Tracking Through Occlusions Using Point TrajectoriesCode2
FreeTumor: Large-Scale Generative Tumor Synthesis in Computed Tomography Images for Improving Tumor RecognitionCode2
UMBRAE: Unified Multimodal Brain DecodingCode2
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single DecoderCode2
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language ModelsCode2
RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!Code2
TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular PotentialsCode2
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process RefinementCode2
FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning ModelsCode2
Pix2Poly: A Sequence Prediction Method for End-to-end Polygonal Building Footprint Extraction from Remote Sensing ImageryCode2
Multi-View Mesh Reconstruction with Neural Deferred ShadingCode2
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase PredictionCode2
Room impulse response reconstruction with physics-informed deep learningCode2
Efficient4D: Fast Dynamic 3D Object Generation from a Single-view VideoCode2
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical CodeCode2
Show:102550
← PrevPage 141 of 13232Next →