SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 31513200 of 659983 papers

TitleStatusHype
Leveraging Self-Supervised Learning for Speaker DiarizationCode3
The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic SpeechCode3
ASFT: Aligned Supervised Fine-Tuning through Absolute LikelihoodCode3
Breaking reCAPTCHAv2Code3
Apollo: Band-sequence Modeling for High-Quality Audio RestorationCode3
wgatools: an ultrafast toolkit for manipulating whole genome alignmentsCode3
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive SupervisionCode3
Neural Message Passing Induced by Energy-Constrained DiffusionCode3
SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear ComplexityCode3
WhisperNER: Unified Open Named Entity and Speech RecognitionCode3
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved OptimallyCode3
RePlay: a Recommendation Framework for Experimentation and Production UseCode3
Agent Workflow MemoryCode3
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion ModelsCode3
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular VideosCode3
Alignment of Diffusion Models: Fundamentals, Challenges, and FutureCode3
One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment LocomotionCode3
Robot Utility Models: General Policies for Zero-Shot Deployment in New EnvironmentsCode3
BigCodec: Pushing the Limits of Low-Bitrate Neural Speech CodecCode3
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at ScaleCode3
Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding ModelsCode3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP ResearchersCode3
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving ScenesCode3
VILA-U: a Unified Foundation Model Integrating Visual Understanding and GenerationCode3
Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-TaskCode3
Theory, Analysis, and Best Practices for Sigmoid Self-AttentionCode3
Attention Heads of Large Language Models: A SurveyCode3
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan ArchivesCode3
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection MatchingCode3
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid ArchitectureCode3
EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular VideoCode3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
Affordance-based Robot Manipulation with Flow MatchingCode3
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI SystemsCode3
ContextCite: Attributing Model Generation to ContextCode3
TinyAgent: Function Calling at the EdgeCode3
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language ModelCode3
VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series ForecastersCode3
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery ClassificationCode3
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable MannersCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance PropagationCode3
The Mamba in the Llama: Distilling and Accelerating Hybrid ModelsCode3
OctFusion: Octree-based Diffusion Models for 3D Shape GenerationCode3
A Survey of Camouflaged Object Detection and BeyondCode3
SWE-bench-java: A GitHub Issue Resolving Benchmark for JavaCode3
Foundation Models for Music: A SurveyCode3
Recent Event Camera Innovations: A SurveyCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
Show:102550
← PrevPage 64 of 13200Next →