SOTAVerified

Inductive Bias

Papers

Showing 651675 of 1529 papers

TitleStatusHype
Hidden Synergy: L_1 Weight Normalization and 1-Path-Norm Regularization0
Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution Generalization0
Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize HierarchicallyCode0
SPARO: Selective Attention for Robust and Compositional Transformer Encodings for VisionCode0
Are Biological Systems More Intelligent Than Artificial Intelligence?0
BMapEst: Estimation of Brain Tissue Probability Maps using a Differentiable MRI SimulatorCode0
Uncertainty in latent representations of variational autoencoders optimized for visual tasksCode0
Masked Latent Transformer with the Random Masking Ratio to Advance the Diagnosis of Dental FluorosisCode0
Spectral Convolutional Conditional Neural ProcessesCode0
Do LLMs Think Fast and Slow? A Causal Study on Sentiment AnalysisCode0
A provable control of sensitivity of neural networks through a direct parameterization of the overall bi-LipschitznessCode0
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation0
On Computational Modeling of Sleep-Wake Cycle0
Technical Report: The Graph Spectral Token -- Enhancing Graph Transformers with Spectral InformationCode0
Efficient Learnable Collaborative Attention for Single Image Super-Resolution0
GvT: A Graph-based Vision Transformer with Talking-Heads Utilizing Sparsity, Trained from Scratch on Small Datasets0
Graph Neural Networks for Electric and Hydraulic Data Fusion to Enhance Short-term Forecasting of Pumped-storage Hydroelectricity0
Structured Initialization for Attention in Vision TransformersCode0
Unveiling Divergent Inductive Biases of LLMs on Temporal DataCode0
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification0
Learning to Rank Patches for Unbiased Image Redundancy ReductionCode0
Track Everything Everywhere Fast and Robustly0
Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence ModelCode0
Laplacian-guided Entropy Model in Neural Codec with Blur-dissipated Synthesis0
Role of Locality and Weight Sharing in Image-Based Tasks: A Sample Complexity Separation between CNNs, LCNs, and FCNs0
Show:102550
← PrevPage 27 of 62Next →

No leaderboard results yet.