SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 47014750 of 177340 papers

TitleStatusHype
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained OptimizationCode2
Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical SystemsCode2
TripleMixer: A 3D Point Cloud Denoising Model for Adverse WeatherCode2
Mamba Meets Financial Markets: A Graph-Mamba Approach for Stock Price PredictionCode2
Audio-Synchronized Visual AnimationCode2
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data PruningCode2
SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignCode2
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image UnderstandingCode2
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention LensCode2
MaskBit: Embedding-free Image Generation via Bit TokensCode2
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement LearningCode2
Emulating Self-attention with Convolution for Efficient Image Super-ResolutionCode2
GuardReasoner: Towards Reasoning-based LLM SafeguardsCode2
RFWave: Multi-band Rectified Flow for Audio Waveform ReconstructionCode2
PPSURF: Combining Patches and Point Convolutions for Detailed Surface ReconstructionCode2
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to RefuseCode2
Matryoshka Query Transformer for Large Vision-Language ModelsCode2
Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing ImageryCode2
DiffusionInst: Diffusion Model for Instance SegmentationCode2
Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing CluesCode2
Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image SegmentationCode2
Non-stationary Transformers: Exploring the Stationarity in Time Series ForecastingCode2
In-Context Language Learning: Architectures and AlgorithmsCode2
Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image EnhancementCode2
Fin-GAN: forecasting and classifying financial time series via generative adversarial networksCode2
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language ModelsCode2
SpaceByte: Towards Deleting Tokenization from Large Language ModelingCode2
Realistic Rainy Weather Simulation for LiDARs in CARLA SimulatorCode2
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question AnsweringCode2
Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR PredictionCode2
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean DataCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
When Attention Sink Emerges in Language Models: An Empirical ViewCode2
SciLitLLM: How to Adapt LLMs for Scientific Literature UnderstandingCode2
CFAT: Unleashing Triangular Windows for Image Super-resolutionCode2
Towards Fast, Accurate and Stable 3D Dense Face AlignmentCode2
Diffusion Models for Adversarial PurificationCode2
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative RecommendationsCode2
RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented InstructionsCode2
Samba: A Unified Mamba-based Framework for General Salient Object DetectionCode2
Centralized Feature Pyramid for Object DetectionCode2
DiffuSeq: Sequence to Sequence Text Generation with Diffusion ModelsCode2
An End-to-End Structure with Novel Position Mechanism and Improved EMD for Stock ForecastingCode2
PartCraft: Crafting Creative Objects by PartsCode2
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersCode2
A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing ImagesCode2
Large Language Models Are Zero-Shot Time Series ForecastersCode2
TinyViM: Frequency Decoupling for Tiny Hybrid Vision MambaCode2
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile DevicesCode2
Show:102550
← PrevPage 95 of 3547Next →