SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 76517700 of 661570 papers

TitleStatusHype
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
IT3D: Improved Text-to-3D Generation with Explicit View SynthesisCode2
Knowledge Graph Prompting for Multi-Document Question AnsweringCode2
PromptIR: Prompting for All-in-One Image RestorationCode2
Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable DiffusionCode2
FastSurfer-HypVINN: Automated sub-segmentation of the hypothalamus and adjacent structures on high-resolutional brain MRICode2
DARWIN Series: Domain Specific Large Language Models for Natural ScienceCode2
Selective Prompt Anchoring for Code GenerationCode2
Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution ShiftCode2
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction TuningCode2
Temporal Action Localization with Enhanced Instant DiscriminabilityCode2
PyMOLfold: Interactive Protein and Ligand Structure Prediction in PyMOLCode2
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error CorrectionCode2
Commands as AI ConversationsCode2
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier TransformCode2
Grasp-Anything: Large-scale Grasp Dataset from Foundation ModelsCode2
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured SparsityCode2
RMT: Retentive Networks Meet Vision TransformersCode2
Detect Everything with Few ExamplesCode2
Detecting and Grounding Multi-Modal Media Manipulation and BeyondCode2
Improving CLIP Fine-tuning PerformanceCode2
PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking AttacksCode2
GenSim: Generating Robotic Simulation Tasks via Large Language ModelsCode2
Interpreting CLIP's Image Representation via Text-Based DecompositionCode2
Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud RegistrationCode2
TopoMLP: A Simple yet Strong Pipeline for Driving Topology ReasoningCode2
A Semantic Invariant Robust Watermark for Large Language ModelsCode2
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-SpecificityCode2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic ScenesCode2
OmniControl: Control Any Joint at Any Time for Human Motion GenerationCode2
Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody ModellingCode2
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image EditingCode2
Character-LLM: A Trainable Agent for Role-PlayingCode2
Few-Shot Learning Patterns in Financial Time-Series for Trend-Following StrategiesCode2
Reflection-Tuning: Data Recycling Improves LLM Instruction-TuningCode2
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical DomainCode2
CapsFusion: Rethinking Image-Text Data at ScaleCode2
TopicGPT: A Prompt-based Topic Modeling FrameworkCode2
Simplifying Transformer BlocksCode2
Instruction Distillation Makes Large Language Models Efficient Zero-shot RankersCode2
GLaMM: Pixel Grounding Large Multimodal ModelCode2
Neuro-GPT: Towards A Foundation Model for EEGCode2
A Survey of Large Language Models AttributionCode2
NExT-Chat: An LMM for Chat, Detection and SegmentationCode2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
Semi-Supervised Domain Generalizable Person Re-IdentificationCode2
Ant Colony Sampling with GFlowNets for Combinatorial OptimizationCode2
To See is to Believe: Prompting GPT-4V for Better Visual Instruction TuningCode2
Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural NetworksCode2
Neural General Circulation Models for Weather and ClimateCode2
Show:102550
← PrevPage 154 of 13232Next →