SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 78017850 of 177340 papers

TitleStatusHype
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
TroL: Traversal of Layers for Large Language and Vision ModelsCode2
One-for-More: Continual Diffusion Model for Anomaly DetectionCode2
GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising DiffusionCode2
EmoBench: Evaluating the Emotional Intelligence of Large Language ModelsCode2
Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI ImagesCode2
Text2HOI: Text-guided 3D Motion Generation for Hand-Object InteractionCode2
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human ReferencesCode2
ICDAR 2021 Competition on Scientific Literature ParsingCode2
Scale-invariant Learning by Physics InversionCode2
Probabilistic Warp Consistency for Weakly-Supervised Semantic CorrespondencesCode2
Differentiable Voxelization and Mesh MorphingCode2
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video ComprehensionCode2
Transformers learn in-context by gradient descentCode2
Meent: Differentiable Electromagnetic Simulator for Machine LearningCode2
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video UnderstandingCode2
Video Super-Resolution Transformer with Masked Inter&Intra-Frame AttentionCode2
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly DetectionCode2
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural NetworksCode2
Plug and Play Language Models: A Simple Approach to Controlled Text GenerationCode2
Third Time's the Charm? Image and Video Editing with StyleGAN3Code2
PPI++: Efficient Prediction-Powered InferenceCode2
Adapting Segment Anything Model for Change Detection in HR Remote Sensing ImagesCode2
Black-Box Tuning for Language-Model-as-a-ServiceCode2
Managing FAIR Knowledge Graphs as Polyglot Data End Points: A Benchmark based on the rdf2pg Framework and Plant Biology DataCode2
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less HallucinationCode2
Spatio-Temporal Self-Supervised Learning for Traffic Flow PredictionCode2
Contrastive Decoding: Open-ended Text Generation as OptimizationCode2
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingCode2
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised LearningCode2
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationCode2
FreshLLMs: Refreshing Large Language Models with Search Engine AugmentationCode2
EpiLearn: A Python Library for Machine Learning in Epidemic ModelingCode2
3D Point Cloud Compression with Recurrent Neural Network and Image Compression MethodsCode2
It Takes Two to Tango: Directly Optimizing for Constrained Synthesizability in Generative Molecular DesignCode2
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource AllocationCode2
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape EstimationCode2
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image UnderstandingCode2
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion ModelsCode2
DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view RepresentationCode2
MG-LLaVA: Towards Multi-Granularity Visual Instruction TuningCode2
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language ModelsCode2
Scattered Mixture-of-Experts ImplementationCode2
EgoVideo: Exploring Egocentric Foundation Model and Downstream AdaptationCode2
DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time VariabilityCode2
RoboUniView: Visual-Language Model with Unified View Representation for Robotic ManipulationCode2
Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural NetworksCode2
Odd-One-Out: Anomaly Detection by Comparing with NeighborsCode2
Show:102550
← PrevPage 157 of 3547Next →