SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 86018650 of 661570 papers

TitleStatusHype
Frustratingly Easy Test-Time Adaptation of Vision-Language ModelsCode2
REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout AlignmentCode2
Color Shift Estimation-and-Correction for Image EnhancementCode2
TransVIP: Speech to Speech Translation System with Voice and Isochrony PreservationCode2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
DiG: Scalable and Efficient Diffusion Models with Gated Linear AttentionCode2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in AlignmentCode2
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction TuningCode2
AutoPSV: Automated Process-Supervised VerifierCode2
NoteLLM-2: Multimodal Large Representation Models for RecommendationCode2
Multi-Behavior Generative RecommendationCode2
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation ExperimentsCode2
Memorize What Matters: Emergent Scene Decomposition from MultitraverseCode2
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of ParametersCode2
Saturn: Sample-efficient Generative Molecular Design using Memory ManipulationCode2
Spectral-Refiner: Accurate Fine-Tuning of Spatiotemporal Fourier Neural Operator for Turbulent FlowsCode2
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion ModelsCode2
Motion-Agent: A Conversational Framework for Human Motion Generation with LLMsCode2
Reason3D: Searching and Reasoning 3D Segmentation via Large Language ModelCode2
Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-LearningCode2
TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token PredictionCode2
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple ModalitiesCode2
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement LearningCode2
EASI-Tex: Edge-Aware Mesh Texturing from Single ImageCode2
Autoformalizing Euclidean GeometryCode2
Are Self-Attentions Effective for Time Series Forecasting?Code2
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model TrainingCode2
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion ModelsCode2
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal ModelsCode2
DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam VideosCode2
Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground TruthCode2
M^3CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-ThoughtCode2
Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 ChallengeCode2
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time AdaptationCode2
AdaFisher: Adaptive Second Order Optimization via Fisher InformationCode2
LoQT: Low-Rank Adapters for Quantized PretrainingCode2
A Survey of Multimodal Large Language Model from A Data-centric PerspectiveCode2
Medical MLLM is Vulnerable: Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language ModelsCode2
Splat-SLAM: Globally Optimized RGB-only SLAM with 3D GaussiansCode2
Crafting Interpretable Embeddings by Asking LLMs QuestionsCode2
KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgeCode2
MambaTS: Improved Selective State Space Models for Long-term Time Series ForecastingCode2
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph GenerationCode2
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic ResolutionCode2
Continuous Temporal Domain GeneralizationCode2
MoEUT: Mixture-of-Experts Universal TransformersCode2
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationCode2
Underwater Image Enhancement by Diffusion Model with Customized CLIP-ClassifierCode2
Show:102550
← PrevPage 173 of 13232Next →