SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 31513200 of 659983 papers

TitleStatusHype
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language ModelsCode3
DF40: Toward Next-Generation Deepfake DetectionCode3
Rho-1: Not All Tokens Are What You NeedCode3
multiGradICON: A Foundation Model for Multimodal Medical Image RegistrationCode3
MANTIS: Interleaved Multi-Image Instruction TuningCode3
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and LocalizationCode3
HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image AnalysisCode3
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha FactorsCode3
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem AugmentationCode3
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything ModelCode3
Evaluation of Text-to-Video Generation Models: A Dynamics PerspectiveCode3
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
OneRestore: A Universal Restoration Framework for Composite DegradationCode3
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work TasksCode3
Chat-Edit-3D: Interactive 3D Scene Editing via Text PromptsCode3
Unified Approach for Hedging Impermanent Loss of Liquidity ProvisionCode3
Neural Localizer Fields for Continuous 3D Human Pose and Shape EstimationCode3
rLLM: Relational Table Learning with LLMsCode3
WildGaussians: 3D Gaussian Splatting in the WildCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
Scaling Retrieval-Based Language Models with a Trillion-Token DatastoreCode3
Compact Language Models via Pruning and Knowledge DistillationCode3
PyABSA: A Modularized Framework for Reproducible Aspect-based Sentiment AnalysisCode3
Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object DetectionCode3
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for MedicineCode3
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality DataCode3
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge DevicesCode3
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language ModelsCode3
LoopSplat: Loop Closure by Registering 3D Gaussian SplatsCode3
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video GenerationCode3
AnyGraph: Graph Foundation Model in the WildCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-MarquardtCode3
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language InstructionsCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text PromptsCode3
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot NavigationCode3
Results of the Big ANN: NeurIPS'23 competitionCode3
Diffusion Models are Evolutionary AlgorithmsCode3
ControlAR: Controllable Image Generation with Autoregressive ModelsCode3
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character controlCode3
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video GenerationCode3
Scaling Diffusion Language Models via Adaptation from Autoregressive ModelsCode3
ZipNN: Lossless Compression for AI ModelsCode3
TEXGen: a Generative Diffusion Model for Mesh TexturesCode3
BIP3D: Bridging 2D Images and 3D Perception for Embodied IntelligenceCode3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language ModelsCode3
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and GenerationCode3
TryOffAnyone: Tiled Cloth Generation from a Dressed PersonCode3
InterPLM: Discovering Interpretable Features in Protein Language Models via Sparse AutoencodersCode3
Show:102550
← PrevPage 64 of 13200Next →