SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 19011950 of 659983 papers

TitleStatusHype
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional TokenizationCode4
ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object DetectionCode4
InstanceDiffusion: Instance-level Control for Image GenerationCode4
Timer: Generative Pre-trained Transformers Are Large Time Series ModelsCode4
VM-UNet: Vision Mamba UNet for Medical Image SegmentationCode4
DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image EditingCode4
LLM-Enhanced Data ManagementCode4
Image Fusion via Vision-Language ModelCode4
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A SurveyCode4
Boximator: Generating Rich and Controllable Motions for Video SynthesisCode4
Nomic Embed: Training a Reproducible Long Context Text EmbedderCode4
KTO: Model Alignment as Prospect Theoretic OptimizationCode4
Large Language Models for Time Series: A SurveyCode4
A Comprehensive Survey on 3D Content GenerationCode4
Lightweight Pixel Difference Networks for Efficient Visual Representation LearningCode4
Recurrent Partial Kernel Network for Efficient Optical Flow EstimationCode4
AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video DataCode4
Agile But Safe: Learning Collision-Free High-Speed Legged LocomotionCode4
I Think, Therefore I am: Benchmarking Awareness of Large Language Models Using AwareBenchCode4
Proactive Detection of Voice Cloning with Localized WatermarkingCode4
InstructIR: High-Quality Image Restoration Following Human InstructionsCode4
Continual Learning with Pre-Trained Models: A SurveyCode4
ServerlessLLM: Low-Latency Serverless Inference for Large Language ModelsCode4
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image SegmentationCode4
OK-Robot: What Really Matters in Integrating Open-Knowledge Models for RoboticsCode4
Orion-14B: Open-source Multilingual Large Language ModelsCode4
Knowledge Fusion of Large Language ModelsCode4
GPAvatar: Generalizable and Precise Head Avatar from Image(s)Code4
PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map ConsistencyCode4
ReFT: Reasoning with Reinforced Fine-TuningCode4
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine TranslationCode4
Transformer for Object Re-Identification: A SurveyCode4
Scalable 3D Panoptic Segmentation As Superpoint Graph ClusteringCode4
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision ApplicationsCode4
TRIPS: Trilinear Point Splatting for Real-Time Radiance Field RenderingCode4
TOFU: A Task of Fictitious Unlearning for LLMsCode4
TrustLLM: Trustworthiness in Large Language ModelsCode4
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time SeriesCode4
Mixtral of ExpertsCode4
CRUXEval: A Benchmark for Code Reasoning, Understanding and ExecutionCode4
Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational TrainingCode4
LLaMA Pro: Progressive LLaMA with Block ExpansionCode4
GPT-4V(ision) is a Generalist Web Agent, if GroundedCode4
LLM Maybe LongLM: Self-Extend LLM Context Window Without TuningCode4
PAIR Diffusion: A Comprehensive Multimodal Object-Level Image EditorCode4
V?: Guided Visual Search as a Core Mechanism in Multimodal LLMsCode4
Video Understanding with Large Language Models: A SurveyCode4
Fast Inference of Mixture-of-Experts Language Models with OffloadingCode4
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language ModelCode4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language ModelCode4
Show:102550
← PrevPage 39 of 13200Next →