SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50515100 of 661570 papers

TitleStatusHype
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
Source-Free Domain Adaptation with Frozen Multimodal Foundation ModelCode2
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution TransformersCode2
TimeLMs: Diachronic Language Models from TwitterCode2
string2string: A Modern Python Library for String-to-String AlgorithmsCode2
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark SuiteCode2
Spectrally Pruned Gaussian Fields with Neural CompensationCode2
BIG-Bench Extra HardCode2
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient PerspectiveCode2
Chain of Hindsight Aligns Language Models with FeedbackCode2
MiraGe: Editable 2D Images using Gaussian SplattingCode2
Maverick: Efficient and Accurate Coreference Resolution Defying Recent TrendsCode2
Vision-aided UAV navigation and dynamic obstacle avoidance using gradient-based B-spline trajectory optimizationCode2
Deep learning-driven pulmonary artery and vein segmentation reveals demography-associated vasculature anatomical differencesCode2
A Novel State Space Model with Local Enhancement and State Sharing for Image FusionCode2
The Dark Side of Function Calling: Pathways to Jailbreaking Large Language ModelsCode2
Spiking Diffusion ModelsCode2
Putting People in their Place: Monocular Regression of 3D People in DepthCode2
MMPareto: Boosting Multimodal Learning with Innocent Unimodal AssistanceCode2
PnLCalib: Sports Field Registration via Points and Lines OptimizationCode2
XHand: Real-time Expressive Hand AvatarCode2
FedGraph: A Research Library and Benchmark for Federated Graph LearningCode2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View RepresentationCode2
ZenSVI: An Open-Source Software for the Integrated Acquisition, Processing and Analysis of Street View Imagery Towards Scalable Urban ScienceCode2
ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and DesignCode2
Editing Models with Task ArithmeticCode2
Learning Video Representations from Large Language ModelsCode2
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person PerspectivesCode2
Model-free quantification of completeness, uncertainties, and outliers in atomistic machine learning using information theoryCode2
Masked Face Recognition Dataset and ApplicationCode2
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language ModelsCode2
Semantic Image Synthesis via Diffusion ModelsCode2
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image EditingCode2
Generating 3D Molecules for Target Protein BindingCode2
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex ManipulationCode2
Isotropic Correlation Models for the Cross-Section of Equity ReturnsCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
QAEncoder: Towards Aligned Representation Learning in Question Answering SystemCode2
Neural-Driven Image EditingCode2
Rethinking Negative Instances for Generative Named Entity RecognitionCode2
Act3D: 3D Feature Field Transformers for Multi-Task Robotic ManipulationCode2
Space Group Informed Transformer for Crystalline Materials GenerationCode2
SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing SegmentationCode2
Fourier Neural Operator with Learned Deformations for PDEs on General GeometriesCode2
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud ProviderCode2
Deep Video Prior for Video Consistency and PropagationCode2
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient SparsityCode2
Towards Large-Scale Training of Pathology Foundation ModelsCode2
Explicit Differentiable Slicing and Global Deformation for Cardiac Mesh ReconstructionCode2
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based TrainingCode2
Show:102550
← PrevPage 102 of 13232Next →