SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 94019450 of 661570 papers

TitleStatusHype
BiFormer: Vision Transformer with Bi-Level Routing AttentionCode2
MakeItTalk: Speaker-Aware Talking-Head AnimationCode2
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETRCode2
Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR FusionCode2
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent ControlCode2
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and ChallengesCode2
Scalable Zero-shot Entity Linking with Dense Entity RetrievalCode2
MonoDETR: Depth-guided Transformer for Monocular 3D Object DetectionCode2
RS-Agent: Automating Remote Sensing Tasks through Intelligent AgentCode2
RhythmFormer: Extracting Patterned rPPG Signals based on Periodic Sparse AttentionCode2
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and FutureCode2
Learning-to-Cache: Accelerating Diffusion Transformer via Layer CachingCode2
MiVOLO: Multi-input Transformer for Age and Gender EstimationCode2
Graph Neural Networks in Supply Chain Analytics and Optimization: Concepts, Perspectives, Dataset and BenchmarksCode2
Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic SegmentationCode2
RetinaFace: Single-Shot Multi-Level Face Localisation in the WildCode2
RainMamba: Enhanced Locality Learning with State Space Models for Video DerainingCode2
Mergenetic: a Simple Evolutionary Model Merging LibraryCode2
Augmented Object Intelligence with XR-ObjectsCode2
TART: A plug-and-play Transformer module for task-agnostic reasoningCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
Face Swap via Diffusion ModelCode2
Segment anything model 2: an application to 2D and 3D medical imagesCode2
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction FollowingCode2
A Comparative Study on Reasoning Patterns of OpenAI's o1 ModelCode2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head AvatarsCode2
Query-Dependent Video Representation for Moment Retrieval and Highlight DetectionCode2
SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object TrackingCode2
SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance FieldsCode2
Augraphy: A Data Augmentation Library for Document ImagesCode2
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language ModelsCode2
Neural Implicit Representation for Building Digital Twins of Unknown Articulated ObjectsCode2
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic PropagationCode2
MotionChain: Conversational Motion Controllers via Multimodal PromptsCode2
Behavior Trees Enable Structured Programming of Language Model AgentsCode2
Counterfactual Learning on Graphs: A SurveyCode2
HILCodec: High-Fidelity and Lightweight Neural Audio CodecCode2
Full Page Handwriting Recognition via Image to Sequence ExtractionCode2
F-LMM: Grounding Frozen Large Multimodal ModelsCode2
DreamDiffusion: Generating High-Quality Images from Brain EEG SignalsCode2
Improved Multi-Task Brain Tumour Segmentation with Synthetic Data AugmentationCode2
moolib: A Platform for Distributed RLCode2
COCO-O: A Benchmark for Object Detectors under Natural Distribution ShiftsCode2
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented GenerationCode2
DiffLoc: Diffusion Model for Outdoor LiDAR LocalizationCode2
PropertyGPT: LLM-driven Formal Verification of Smart Contracts through Retrieval-Augmented Property GenerationCode2
SSL: A Self-similarity Loss for Improving Generative Image Super-resolutionCode2
SOLO: A Single Transformer for Scalable Vision-Language ModelingCode2
UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing ImagesCode2
LLaSM: Large Language and Speech ModelCode2
Show:102550
← PrevPage 189 of 13232Next →