SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 39514000 of 177340 papers

TitleStatusHype
HumanVid: Demystifying Training Data for Camera-controllable Human Image AnimationCode3
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human SupervisionCode3
PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware MaskCode3
Interpretable Differencing of Machine Learning ModelsCode3
Enhancing End-to-End Autonomous Driving with Latent World ModelCode3
GNM: A General Navigation Model to Drive Any RobotCode3
Cut and Learn for Unsupervised Object Detection and Instance SegmentationCode3
FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language ModelsCode3
Differentiable Voxel-based X-ray Rendering Improves Sparse-View 3D CBCT ReconstructionCode3
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio RepresentationsCode3
From human experts to machines: An LLM supported approach to ontology and knowledge graph constructionCode3
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttentionCode3
VideoMind: A Chain-of-LoRA Agent for Long Video ReasoningCode3
AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image GenerationCode3
Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue DataCode3
Deep Learning Alternatives of the Kolmogorov Superposition TheoremCode3
Transolver: A Fast Transformer Solver for PDEs on General GeometriesCode3
FNSPID: A Comprehensive Financial News Dataset in Time SeriesCode3
An Improved RaftStereo Trained with A Mixed Dataset for the Robust Vision Challenge 2022Code3
In-Context Learning for Extreme Multi-Label ClassificationCode3
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational AbilitiesCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked AutoencodersCode3
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a MinimapCode3
Cold Diffusion: Inverting Arbitrary Image Transforms Without NoiseCode3
Long-Context Autoregressive Video Modeling with Next-Frame PredictionCode3
ID-Animator: Zero-Shot Identity-Preserving Human Video GenerationCode3
Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera MotionCode3
Data-Copilot: Bridging Billions of Data and Humans with Autonomous WorkflowCode3
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset TransferCode3
Consistency Models Made EasyCode3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP ResearchersCode3
UniTraj: A Unified Framework for Scalable Vehicle Trajectory PredictionCode3
Scalable Optimization in the Modular NormCode3
SupeRANSAC: One RANSAC to Rule Them AllCode3
Wordflow: Social Prompt Engineering for Large Language ModelsCode3
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration TestingCode3
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and BaselinesCode3
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language ModelsCode3
Face Anonymization Made SimpleCode3
Locating and Editing Factual Associations in GPTCode3
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkCode3
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular VideosCode3
ImageInWords: Unlocking Hyper-Detailed Image DescriptionsCode3
Flow Q-LearningCode3
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge GraphsCode3
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and CompatibilityCode3
Lyra: An Efficient and Speech-Centric Framework for Omni-CognitionCode3
The Ninth NTIRE 2024 Efficient Super-Resolution Challenge ReportCode3
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language UnderstandingCode3
Show:102550
← PrevPage 80 of 3547Next →