SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 96519700 of 661570 papers

TitleStatusHype
BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel ModelingCode2
JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics FrameworkCode2
LLMs in the Imaginarium: Tool Learning through Simulated Trial and ErrorCode2
Large Language Models are In-Context Molecule LearnersCode2
AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit DetectorsCode2
Online Adaptation of Language Models with a Memory of Amortized ContextsCode2
Mastering Memory Tasks with World ModelsCode2
QAQ: Quality Adaptive Quantization for LLM KV CacheCode2
Active Generalized Category DiscoveryCode2
An Item is Worth a Prompt: Versatile Image Editing with Disentangled ControlCode2
Backtracing: Retrieving the Cause of the QueryCode2
Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance ExtensionCode2
Mamba4Rec: Towards Efficient Sequential Recommendation with Selective State Space ModelsCode2
Task Attribute Distance for Few-Shot Learning: Theoretical Analysis and ApplicationsCode2
MeaCap: Memory-Augmented Zero-shot Image CaptioningCode2
Learning to Decode Collaboratively with Multiple Language ModelsCode2
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B PeopleCode2
MolNexTR: A Generalized Deep Learning Model for Molecular Image RecognitionCode2
ShortGPT: Layers in Large Language Models are More Redundant Than You ExpectCode2
VastTrack: Vast Category Visual Object TrackingCode2
DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-TrainingCode2
GPTopic: Dynamic and Interactive Topic RepresentationsCode2
An L-BFGS-B approach for linear and nonlinear system identification under _1 and group-Lasso regularizationCode2
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and MergingCode2
Diffusion-based Generative Prior for Low-Complexity MIMO Channel EstimationCode2
Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal ReasoningCode2
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language TransformerCode2
Towards Measuring and Modeling "Culture" in LLMs: A SurveyCode2
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing ModelCode2
Interactive Continual Learning: Fast and Slow ThinkingCode2
PPFlow: Target-aware Peptide Design with Torsional Flow MatchingCode2
Android in the Zoo: Chain-of-Action-Thought for GUI AgentsCode2
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical LabelsCode2
Semantic Human Mesh Reconstruction with TexturesCode2
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model AgentsCode2
ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular ModelingCode2
What do we learn from inverting CLIP models?Code2
TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of ExpertsCode2
PointCore: Efficient Unsupervised Point Cloud Anomaly Detector Using Local-Global FeaturesCode2
Multi-perspective Improvement of Knowledge Graph Completion with Large Language ModelsCode2
Trainable Fractional Fourier TransformCode2
Large language models surpass human experts in predicting neuroscience resultsCode2
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPTCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
xT: Nested Tokenization for Larger Context in Large ImagesCode2
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target DetectionCode2
Birbal: An efficient 7B instruct-model fine-tuned with curated datasetsCode2
Wukong: Towards a Scaling Law for Large-Scale RecommendationCode2
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker RecordingsCode2
Making Pre-trained Language Models Great on Tabular PredictionCode2
Show:102550
← PrevPage 194 of 13232Next →