SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1075110800 of 661570 papers

TitleStatusHype
BakedAvatar: Baking Neural Fields for Real-Time Head Avatar SynthesisCode2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
LLaVA-Plus: Learning to Use Tools for Creating Multimodal AgentsCode2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
Agent Lumos: Unified and Modular Training for Open-Source Language AgentsCode2
A differentiable brain simulator bridging brain simulation and brain-inspired computingCode2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence EmbeddingsCode2
High-Performance Transformers for Table Structure Recognition Need Early ConvolutionsCode2
CellPhoneDB v5: inferring cell-cell communication from single-cell multiomics dataCode2
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker AdaptationCode2
Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant TransformersCode2
NExT-Chat: An LMM for Chat, Detection and SegmentationCode2
Rethinking Benchmark and Contamination for Language Models with Rephrased SamplesCode2
Neuro-GPT: Towards A Foundation Model for EEGCode2
Black-Box Prompt Optimization: Aligning Large Language Models without Model TrainingCode2
A Survey of Large Language Models AttributionCode2
Towards Garment Sewing Pattern Reconstruction from a Single ImageCode2
A Foundation Model for Music InformaticsCode2
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free LunchCode2
PhoGPT: Generative Pre-training for VietnameseCode2
Can LLMs Follow Simple Rules?Code2
GLaMM: Pixel Grounding Large Multimodal ModelCode2
QECO: A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge ComputingCode2
MFTCoder: Boosting Code LLMs with Multitask Fine-TuningCode2
Simplifying Transformer BlocksCode2
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-SupervisionCode2
Medical Image Segmentation with Domain Adaptation: A SurveyCode2
Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A ReviewCode2
PPI++: Efficient Prediction-Powered InferenceCode2
Diffusion Models for Reinforcement Learning: A SurveyCode2
Adapting Frechet Audio Distance for Generative Music EvaluationCode2
ProAgent: From Robotic Process Automation to Agentic Process AutomationCode2
TopicGPT: A Prompt-based Topic Modeling FrameworkCode2
Instruction Distillation Makes Large Language Models Efficient Zero-shot RankersCode2
JADE: A Linguistics-based Safety Evaluation Platform for Large Language ModelsCode2
OpenForest: A data catalogue for machine learning in forest monitoringCode2
SoulChat: Improving LLMs' Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy ConversationsCode2
Efficient LLM Inference on CPUsCode2
Low-latency Real-time Voice Conversion on CPUCode2
What's In My Big Data?Code2
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and PredictionCode2
CapsFusion: Rethinking Image-Text Data at ScaleCode2
ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object DetectionCode2
Mathematical Introduction to Deep Learning: Methods, Implementations, and TheoryCode2
Modular Boundaries in Recurrent Neural NetworksCode2
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual RecognitionCode2
Evaluating Large Language Models: A Comprehensive SurveyCode2
Large Trajectory Models are Scalable Motion Predictors and PlannersCode2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
Atom: Low-bit Quantization for Efficient and Accurate LLM ServingCode2
Show:102550
← PrevPage 216 of 13232Next →