SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 85518600 of 661570 papers

TitleStatusHype
Enhancing Large Vision Language Models with Self-Training on Image ComprehensionCode2
Easy Problems That LLMs Get WrongCode2
Group Robust Preference Optimization in Reward-free RLHFCode2
Promptus: Can Prompts Streaming Replace Video Streaming with Stable DiffusionCode2
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
Recurrent neural network wave functions for Rydberg atom arrays on kagome latticeCode2
N-Dimensional Gaussians for Fitting of High Dimensional FunctionsCode2
STHN: Deep Homography Estimation for UAV Thermal Geo-localization with Satellite ImageryCode2
Fully-inductive Node Classification on Arbitrary GraphsCode2
Improving the Training of Rectified FlowsCode2
LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating MetaheuristicsCode2
All-In-One Medical Image Restoration via Task-Adaptive RoutingCode2
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo BenchmarkCode2
Open-Set Domain Adaptation for Semantic SegmentationCode2
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous DrivingCode2
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion ModelsCode2
Self-Exploring Language Models: Active Preference Elicitation for Online AlignmentCode2
NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the WildCode2
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image FormatsCode2
SketchDeco: Decorating B&W Sketches with ColourCode2
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory MatchingCode2
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long VideosCode2
RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow MatchingCode2
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language ModelsCode2
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent ControlCode2
Benchmarking and Improving Detail Image CaptionCode2
Compressing Large Language Models using Low Rank and Low Precision DecompositionCode2
Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge TransferCode2
Can Graph Learning Improve Planning in LLM-based Agents?Code2
Matryoshka Query Transformer for Large Vision-Language ModelsCode2
ViG: Linear-complexity Visual Sequence Learning with Gated Linear AttentionCode2
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory SignalsCode2
FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor ScenesCode2
Seeing the Image: Prioritizing Visual Correlation by Contrastive AlignmentCode2
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language ModelsCode2
Frustratingly Easy Test-Time Adaptation of Vision-Language ModelsCode2
Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identificationCode2
Scaling Laws and Compute-Optimal Training Beyond Fixed Training DurationsCode2
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in AlignmentCode2
Color Shift Estimation-and-Correction for Image EnhancementCode2
MMPareto: Boosting Multimodal Learning with Innocent Unimodal AssistanceCode2
Deform3DGS: Flexible Deformation for Fast Surgical Scene Reconstruction with Gaussian SplattingCode2
FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic PredictionCode2
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound GenerationCode2
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic ModelCode2
Dataset Regeneration for Sequential RecommendationCode2
Adapting Pre-Trained Vision Models for Novel Instance Detection and SegmentationCode2
Show:102550
← PrevPage 172 of 13232Next →