SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15511600 of 659983 papers

TitleStatusHype
Restructuring Vector Quantization with the Rotation TrickCode4
Story-Adapter: A Training-free Iterative Framework for Long Story VisualizationCode4
Timer-XL: Long-Context Transformers for Unified Time Series ForecastingCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
shapiq: Shapley Interactions for Machine LearningCode4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction DataCode4
Evaluating Deep Regression Models for WSI-Based Gene-Expression PredictionCode4
Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image RestorationCode4
Old Optimizer, New Norm: An AnthologyCode4
Replace Anyone in VideosCode4
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained TransformersCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense PredictionCode4
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language ModelsCode4
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language ModelsCode4
Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASRCode4
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of ExpertsCode4
Zero-shot forecasting of chaotic systemsCode4
KISS-Matcher: Fast and Robust Point Cloud Registration RevisitedCode4
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video UnderstandingCode4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
StoryMaker: Towards Holistic Consistent Characters in Text-to-image GenerationCode4
Fine-Tuning Image-Conditional Diffusion Models is Easier than You ThinkCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
Kolmogorov-Arnold TransformerCode4
On the limits of agency in agent-based modelsCode4
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image RetrievalCode4
Windows Agent Arena: Evaluating Multi-Modal OS Agents at ScaleCode4
GeoCalib: Learning Single-image Calibration with Geometric OptimizationCode4
RealisDance: Equip controllable character animation with realistic handsCode4
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual GenerationCode4
One-Shot Diffusion Mimicker for Handwritten Text GenerationCode4
xLAM: A Family of Large Action Models to Empower AI Agent SystemsCode4
iText2KG: Incremental Knowledge Graphs Construction Using Large Language ModelsCode4
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding BenchmarkCode4
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QACode4
Large Language Model-Based Agents for Software Engineering: A SurveyCode4
OLMoE: Open Mixture-of-Experts Language ModelsCode4
Diffusion Policy Policy OptimizationCode4
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo MatchingCode4
CrisperWhisper: Accurate Timestamps on Verbatim Speech TranscriptionsCode4
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of EncodersCode4
MegActor-Σ: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion TransformerCode4
Text2SQL is Not Enough: Unifying AI and Databases with TAGCode4
Relationships are Complicated! An Analysis of Relationships Between Datasets on the WebCode4
EmbodiedSAM: Online Segment Any 3D Thing in Real TimeCode4
SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion RecognitionCode4
RUMI: Rummaging Using Mutual InformationCode4
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree SearchCode4
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual GuidanceCode4
Show:102550
← PrevPage 32 of 13200Next →