SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 29012950 of 659983 papers

TitleStatusHype
FireFlow: Fast Inversion of Rectified Flow for Image Semantic EditingCode3
CBraMod: A Criss-Cross Brain Foundation Model for EEG DecodingCode3
Normalizing Flows are Capable Generative ModelsCode3
BatchTopK Sparse AutoencodersCode3
GraphNeuralNetworks.jl: Deep Learning on Graphs with JuliaCode3
Towards Controllable Speech Synthesis in the Era of Large Language Models: A SurveyCode3
StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI AstrophysicistCode3
Around the World in 80 Timesteps: A Generative Approach to Global Visual GeolocationCode3
APOLLO: SGD-like Memory, AdamW-level PerformanceCode3
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous DrivingCode3
Aguvis: Unified Pure Vision Agents for Autonomous GUI InteractionCode3
Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono FailCode3
Cubify Anything: Scaling Indoor 3D Object DetectionCode3
VisionZip: Longer is Better but Not Necessary in Vision Language ModelsCode3
Reinforcement Learning Enhanced LLMs: A SurveyCode3
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth FusionCode3
ARC Prize 2024: Technical ReportCode3
PANGAEA: A Global and Inclusive Benchmark for Geospatial Foundation ModelsCode3
PaliGemma 2: A Family of Versatile VLMs for TransferCode3
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based AgentsCode3
ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and ReasoningCode3
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and GenerationCode3
Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation ApplicationsCode3
Remote Sensing Temporal Vision-Language Models: A Comprehensive SurveyCode3
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue DataCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different ScenesCode3
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible CostCode3
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous CircleCode3
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive GenerationCode3
HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous DrivingCode3
Towards Universal Soccer Video UnderstandingCode3
FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image RestorationCode3
emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose EstimationCode3
Advanced Video Inpainting Using Optical Flow-Guided Efficient DiffusionCode3
Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse PrimitivesCode3
o1-Coder: an o1 Replication for CodingCode3
Scaling Transformers for Low-Bitrate High-Quality Speech CodingCode3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language ModelsCode3
Differentiable Voxel-based X-ray Rendering Improves Sparse-View 3D CBCT ReconstructionCode3
Cyber-Attack Technique Classification Using Two-Stage Trained Large Language ModelsCode3
ChatRex: Taming Multimodal LLM for Joint Perception and UnderstandingCode3
HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene ReconstructionCode3
TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-ResolutionCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-TuningCode3
Star Attention: Efficient LLM Inference over Long SequencesCode3
On the Efficiency of NLP-Inspired Methods for Tabular Deep LearningCode3
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
Pushing the Limits of Large Language Model Quantization via the Linearity TheoremCode3
Show:102550
← PrevPage 59 of 13200Next →