SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1120111250 of 661570 papers

TitleStatusHype
SSLRec: A Self-Supervised Learning Framework for RecommendationCode2
LLM As DBACode2
Follow Anything: Open-set detection, tracking, and following in real-timeCode2
Flexible Isosurface Extraction for Gradient-Based Mesh OptimizationCode2
PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequencesCode2
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object DetectionCode2
Fuzz4All: Universal Fuzzing with Large Language ModelsCode2
PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation LearningCode2
Cumulative Reasoning with Large Language ModelsCode2
LATR: 3D Lane Detection from Monocular Images with TransformerCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
3D-VisTA: Pre-trained Transformer for 3D Vision and Text AlignmentCode2
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI ToolCode2
Shepherd: A Critic for Language Model GenerationCode2
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative InstructionsCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
ConDistFL: Conditional Distillation for Federated Learning from Partially Annotated DataCode2
PokerKit: A Comprehensive Python Library for Fine-Grained Multi-Variant Poker Game SimulationsCode2
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity RecognitionCode2
TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language ModelsCode2
Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn DialogueCode2
SynJax: Structured Probability Distributions for JAXCode2
AlphaStar Unplugged: Large-Scale Offline Reinforcement LearningCode2
Dual Aggregation Transformer for Image Super-ResolutionCode2
Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise ModelCode2
Spanish Pre-trained BERT Model and Evaluation DataCode2
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategiesCode2
Early Detection and Localization of Pancreatic Cancer by Label-Free Tumor SynthesisCode2
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent EducationCode2
PowerSimulationsDynamics.jl -- An Open Source Modeling Package for Modern Power Systems with Inverter-Based ResourcesCode2
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIPCode2
Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical DataCode2
FB-BEV: BEV Representation from Forward-Backward View TransformationsCode2
MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesCode2
UniSim: A Neural Closed-Loop Sensor SimulatorCode2
Scaling Relationship on Learning Mathematical Reasoning with Large Language ModelsCode2
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior ConstraintsCode2
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open WorldCode2
DETR Doesn't Need Multi-Scale or Locality DesignCode2
From Sparse to Soft Mixtures of ExpertsCode2
Flows: Building Blocks of Reasoning and Collaborating AICode2
Hybrid-SORT: Weak Cues Matter for Online Multi-Object TrackingCode2
AnyLoc: Towards Universal Visual Place RecognitionCode2
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous DrivingCode2
FLatten Transformer: Vision Transformer using Focused Linear AttentionCode2
UniVTG: Towards Unified Video-Language Temporal GroundingCode2
MovieChat: From Dense Token to Sparse Memory for Long Video UnderstandingCode2
LP-MusicCaps: LLM-Based Pseudo Music CaptioningCode2
All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed AudioCode2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture DesignCode2
Show:102550
← PrevPage 225 of 13232Next →