ANDHRA Bandersnatch: Training Neural Networks to Predict Parallel Realities

2024-11-28Code Available0· sign in to hype

Venkata Satya Sai Ajay Daliparthi

Code Available — Be the first to reproduce this paper.

Code

github.com/dvssajay/New_World
Officialpytorch★ 1

Abstract

Inspired by the Many-Worlds Interpretation (MWI), this work introduces a novel neural network architecture that splits the same input signal into parallel branches at each layer, utilizing a Hyper Rectified Activation, referred to as ANDHRA. The branched layers do not merge and form separate network paths, leading to multiple network heads for output prediction. For a network with a branching factor of 2 at three levels, the total number of heads is 2^3 = 8 . The individual heads are jointly trained by combining their respective loss values. However, the proposed architecture requires additional parameters and memory during training due to the additional branches. During inference, the experimental results on CIFAR-10/100 demonstrate that there exists one individual head that outperforms the baseline accuracy, achieving statistically significant improvement with equal parameters and computational cost.

Tasks

Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10	ABNet-2G-R3-Combined	Percentage correct	96.38	—	Unverified
CIFAR-10	ABNet-2G-R3	Percentage correct	96.09	—	Unverified
CIFAR-10	ABNet-2G-R2	Percentage correct	95.9	—	Unverified
CIFAR-10	ABNet-2G-R1	Percentage correct	95.54	—	Unverified
CIFAR-10	ABNet-2G-R0	Percentage correct	94.12	—	Unverified
CIFAR-100	ABNet-2G-R3-Combined	Percentage correct	82.78	—	Unverified
CIFAR-100	ABNet-2G-R3	Percentage correct	80.83	—	Unverified
CIFAR-100	ABNet-2G-R2	Percentage correct	80.35	—	Unverified
CIFAR-100	ABNet-2G-R1	Percentage correct	78.79	—	Unverified
CIFAR-100	ABNet-2G-R0	Percentage correct	73.93	—	Unverified

ANDHRA Bandersnatch: Training Neural Networks to Predict Parallel Realities

Code

Abstract

Tasks

Benchmark Results

Reproductions