SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 61516200 of 661570 papers

TitleStatusHype
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance FieldsCode2
Query2CAD: Generating CAD models using natural language queriesCode2
Magic Mirror: ID-Preserved Video Generation in Video Diffusion TransformersCode2
What is the Role of Small Models in the LLM Era: A SurveyCode2
Methods for Detoxification of Texts for the Russian LanguageCode2
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth EstimationCode2
GTA: A Benchmark for General Tool AgentsCode2
Chat-Scene: Bridging 3D Scene and Large Language Models with Object IdentifiersCode2
Massive Values in Self-Attention Modules are the Key to Contextual Knowledge UnderstandingCode2
Sketch and Refine: Towards Fast and Accurate Lane DetectionCode2
Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-OnCode2
DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attentionCode2
FairyGen: Storied Cartoon Video from a Single Child-Drawn CharacterCode2
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation ModelsCode2
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision TransformersCode2
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained OptimizationCode2
Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical SystemsCode2
TripleMixer: A 3D Point Cloud Denoising Model for Adverse WeatherCode2
Mamba Meets Financial Markets: A Graph-Mamba Approach for Stock Price PredictionCode2
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory SharpeningCode2
Audio-Synchronized Visual AnimationCode2
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data PruningCode2
SHViT: Single-Head Vision Transformer with Memory Efficient Macro DesignCode2
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image UnderstandingCode2
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention LensCode2
MaskBit: Embedding-free Image Generation via Bit TokensCode2
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement LearningCode2
Emulating Self-attention with Convolution for Efficient Image Super-ResolutionCode2
GuardReasoner: Towards Reasoning-based LLM SafeguardsCode2
RFWave: Multi-band Rectified Flow for Audio Waveform ReconstructionCode2
PPSURF: Combining Patches and Point Convolutions for Detailed Surface ReconstructionCode2
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to RefuseCode2
Matryoshka Query Transformer for Large Vision-Language ModelsCode2
Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing ImageryCode2
DiffusionInst: Diffusion Model for Instance SegmentationCode2
Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing CluesCode2
Learn to Rectify the Bias of CLIP for Unsupervised Semantic SegmentationCode2
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image SegmentationCode2
Non-stationary Transformers: Exploring the Stationarity in Time Series ForecastingCode2
In-Context Language Learning: Architectures and AlgorithmsCode2
Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image EnhancementCode2
Fin-GAN: forecasting and classifying financial time series via generative adversarial networksCode2
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language ModelsCode2
SpaceByte: Towards Deleting Tokenization from Large Language ModelingCode2
Realistic Rainy Weather Simulation for LiDARs in CARLA SimulatorCode2
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question AnsweringCode2
Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR PredictionCode2
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean DataCode2
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
When Attention Sink Emerges in Language Models: An Empirical ViewCode2
Show:102550
← PrevPage 124 of 13232Next →