SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 16011650 of 177339 papers

TitleStatusHype
Kornia-rs: A Low-Level 3D Computer Vision Library In RustCode4
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single ImageCode4
DeepFaceLab: Integrated, flexible and extensible face-swapping frameworkCode4
PromptFix: You Prompt and We Fix the PhotoCode4
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time SeriesCode4
A Survey on Deep Stereo Matching in the TwentiesCode4
EdgeTAM: On-Device Track Anything ModelCode4
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up QuestionsCode4
Towards Real-World Blind Face Restoration with Generative Facial PriorCode4
HuaTuo: Tuning LLaMA Model with Chinese Medical KnowledgeCode4
DynamiCrafter: Animating Open-domain Images with Video Diffusion PriorsCode4
"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language ModelsCode4
Benchopt: Reproducible, efficient and collaborative optimization benchmarksCode4
Couler: Unified Machine Learning Workflow Optimization in CloudCode4
N-Grammer: Augmenting Transformers with latent n-gramsCode4
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous DrivingCode4
SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion RecognitionCode4
SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition EvaluationCode4
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent SystemCode4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network OperationsCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
Prompt-to-Prompt Image Editing with Cross Attention ControlCode4
QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice CodebooksCode4
Differential Privacy: What is all the noise about?Code4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Diffusion Models: A Comprehensive Survey of Methods and ApplicationsCode4
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and VideoCode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
InternVideo: General Video Foundation Models via Generative and Discriminative LearningCode4
Optimizing Prompts for Text-to-Image GenerationCode4
DAMO-YOLO : A Report on Real-Time Object Detection DesignCode4
ViTMatte: Boosting Image Matting with Pretrained Plain Vision TransformersCode4
DeepInverse: A Python package for solving imaging inverse problems with deep learningCode4
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-EvolutionCode4
Holistic Evaluation of Language ModelsCode4
Seed-Coder: Let the Code Model Curate Data for ItselfCode4
FullStack Bench: Evaluating LLMs as Full Stack CodersCode4
Motion Capture Dataset for Practical Use of AI-based Motion Editing and StylizationCode4
The Platonic Representation HypothesisCode4
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation ExpertsCode4
ArchiSound: Audio Generation with DiffusionCode4
Aligning benchmark datasets for table structure recognitionCode4
Instruction Tuning with GPT-4Code4
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning ResearchersCode4
TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality AssessmentCode4
ChatHaruhi: Reviving Anime Character in Reality via Large Language ModelCode4
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language ModelsCode4
FLASC: A Flare-Sensitive Clustering AlgorithmCode4
Show:102550
← PrevPage 33 of 3547Next →