SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 57015750 of 661570 papers

TitleStatusHype
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and PlanningCode2
DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-SpeechCode2
GPTScore: Evaluate as You DesireCode2
Towards Learning Universal Hyperparameter Optimizers with TransformersCode2
Towards Building the Federated GPT: Federated Instruction TuningCode2
GaussRender: Learning 3D Occupancy with Gaussian RenderingCode2
Prompting for Numerical Sequences: A Case Study on Market Comment GenerationCode2
Learning Efficient Convolutional Networks through Network SlimmingCode2
SEAL: Steerable Reasoning Calibration of Large Language Models for FreeCode2
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse AutoencodersCode2
Multi-Modal Fusion Transformer for End-to-End Autonomous DrivingCode2
Harnessing Administrative Data Inventories to Create a Reliable Transnational Reference Database for Crop Type MonitoringCode2
Online Decision TransformerCode2
Synthesizing Anyone, Anywhere, in Any PoseCode2
Learning Transferable Visual Models From Natural Language SupervisionCode2
When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image GenerationCode2
Adapting Frechet Audio Distance for Generative Music EvaluationCode2
Linearizing Large Language ModelsCode2
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language ModelsCode2
Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM modelsCode2
SpA-Former: Transformer image shadow detection and removal via spatial attentionCode2
Layer-Condensed KV Cache for Efficient Inference of Large Language ModelsCode2
DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal ForecastingCode2
Kernel Neural Optimal TransportCode2
Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph ConstructionCode2
WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning EmbeddingsCode2
Actuarial Applications of Natural Language Processing Using Transformers: Case Studies for Using Text Features in an Actuarial ContextCode2
Graph-based Neural Weather Prediction for Limited Area ModelingCode2
MOROCCO: Model Resource Comparison FrameworkCode2
LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body ImagingCode2
Dilated Neighborhood Attention TransformerCode2
Vakyansh: ASR Toolkit for Low Resource Indic languagesCode2
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Surg-3M: A Dataset and Foundation Model for Perception in Surgical SettingsCode2
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System CollaborationCode2
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding ModelCode2
Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A BenchmarkCode2
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormerCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
DiffDock-PP: Rigid Protein-Protein Docking with Diffusion ModelsCode2
Number it: Temporal Grounding Videos like Flipping MangaCode2
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative ReasoningCode2
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementCode2
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D GenerationCode2
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation ModelsCode2
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image GenerationCode2
Photoreal Scene Reconstruction from an Egocentric DeviceCode2
How Much are Large Language Models Contaminated? A Comprehensive Survey and the LLMSanitize LibraryCode2
TabLLM: Few-shot Classification of Tabular Data with Large Language ModelsCode2
Show:102550
← PrevPage 115 of 13232Next →