SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 16011650 of 659983 papers

TitleStatusHype
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical DomainCode4
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content CreationCode4
An Empirical Study of Instruction-tuning Large Language Models in ChineseCode4
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for CodeCode4
OpenProteinSet: Training data for structural biology at scaleCode4
OpenAGI: When LLM Meets Domain ExpertsCode4
Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational TrainingCode4
PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map ConsistencyCode4
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual GenerationCode4
MIGC: Multi-Instance Generation Controller for Text-to-Image SynthesisCode4
Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMACode4
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent LearningCode4
shapiq: Shapley Interactions for Machine LearningCode4
ResAdapter: Domain Consistent Resolution Adapter for Diffusion ModelsCode4
Tiny Machine Learning: Progress and FuturesCode4
End-to-End Autonomous Driving through V2X CooperationCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
SkyReels-A1: Expressive Portrait Animation in Video Diffusion TransformersCode4
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense CaptioningCode4
Direct Training High-Performance Deep Spiking Neural Networks: A Review of Theories and MethodsCode4
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression ToolkitCode4
Morphological Prototyping for Unsupervised Slide Representation Learning in Computational PathologyCode4
OmniGlue: Generalizable Feature Matching with Foundation Model GuidanceCode4
LLMs Meet Multimodal Generation and Editing: A SurveyCode4
Grokfast: Accelerated Grokking by Amplifying Slow GradientsCode4
HelpSteer2: Open-source dataset for training top-performing reward modelsCode4
Nemotron-4 340B Technical ReportCode4
Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User FeedbackCode4
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-CollaborationCode4
Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel TrainingCode4
SSL4EO-L: Datasets and Foundation Models for Landsat ImageryCode4
Continual Learning with Pre-Trained Models: A SurveyCode4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data EngineCode4
Tarsier: Recipes for Training and Evaluating Large Video Description ModelsCode4
Wavelet Convolutions for Large Receptive FieldsCode4
A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future TrendsCode4
Stable-Hair: Real-World Hair Transfer via Diffusion ModelCode4
Timer: Generative Pre-trained Transformers Are Large Time Series ModelsCode4
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of ExpertsCode4
Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASRCode4
Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 MatchingCode4
Fully Open Source Moxin-7B Technical ReportCode4
The Thousand Brains Project: A New Paradigm for Sensorimotor IntelligenceCode4
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language ModelsCode4
VLog: Video-Language Models by Generative Retrieval of Narration VocabularyCode4
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and BeyondCode4
Kornia-rs: A Low-Level 3D Computer Vision Library In RustCode4
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single ImageCode4
DeepFaceLab: Integrated, flexible and extensible face-swapping frameworkCode4
PromptFix: You Prompt and We Fix the PhotoCode4
Show:102550
← PrevPage 33 of 13200Next →