SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 18011850 of 659983 papers

TitleStatusHype
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
A Survey on Large Language Models for RecommendationCode4
Segment Anything in Medical ImagesCode4
mPLUG-Owl: Modularization Empowers Large Language Models with MultimodalityCode4
The Ideal Continual Learner: An Agent That Never ForgetsCode4
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for RoboticsCode4
The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One ShotCode4
Turning Whisper into Real-Time Transcription SystemCode4
EasyJailbreak: A Unified Framework for Jailbreaking Large Language ModelsCode4
Neural general circulation models optimized to predict satellite-based precipitation observationsCode4
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical DomainCode4
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content CreationCode4
An Empirical Study of Instruction-tuning Large Language Models in ChineseCode4
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for CodeCode4
OpenProteinSet: Training data for structural biology at scaleCode4
OpenAGI: When LLM Meets Domain ExpertsCode4
Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational TrainingCode4
PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map ConsistencyCode4
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual GenerationCode4
MIGC: Multi-Instance Generation Controller for Text-to-Image SynthesisCode4
Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMACode4
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent LearningCode4
shapiq: Shapley Interactions for Machine LearningCode4
ResAdapter: Domain Consistent Resolution Adapter for Diffusion ModelsCode4
Tiny Machine Learning: Progress and FuturesCode4
End-to-End Autonomous Driving through V2X CooperationCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
SkyReels-A1: Expressive Portrait Animation in Video Diffusion TransformersCode4
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense CaptioningCode4
Direct Training High-Performance Deep Spiking Neural Networks: A Review of Theories and MethodsCode4
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression ToolkitCode4
Morphological Prototyping for Unsupervised Slide Representation Learning in Computational PathologyCode4
OmniGlue: Generalizable Feature Matching with Foundation Model GuidanceCode4
LLMs Meet Multimodal Generation and Editing: A SurveyCode4
Grokfast: Accelerated Grokking by Amplifying Slow GradientsCode4
HelpSteer2: Open-source dataset for training top-performing reward modelsCode4
Nemotron-4 340B Technical ReportCode4
Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User FeedbackCode4
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-CollaborationCode4
Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel TrainingCode4
SSL4EO-L: Datasets and Foundation Models for Landsat ImageryCode4
Continual Learning with Pre-Trained Models: A SurveyCode4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data EngineCode4
Tarsier: Recipes for Training and Evaluating Large Video Description ModelsCode4
Wavelet Convolutions for Large Receptive FieldsCode4
A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future TrendsCode4
Stable-Hair: Real-World Hair Transfer via Diffusion ModelCode4
Timer: Generative Pre-trained Transformers Are Large Time Series ModelsCode4
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of ExpertsCode4
Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASRCode4
Show:102550
← PrevPage 37 of 13200Next →