SOTAVerified

Playing the Game of 2048

Papers

Showing 125 of 57 papers

TitleStatusHype
Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationCode2
PFGM++: Unlocking the Potential of Physics-Inspired Generative ModelsCode2
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
Long Short-Term Transformer for Online Action DetectionCode1
Pavementscapes: a large-scale hierarchical image dataset for asphalt pavement damage segmentationCode1
Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Conditioned GANsCode1
Planning in Stochastic Environments with a Learned ModelCode1
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image HarmonizationCode1
Optimistic Temporal Difference Learning for 2048Code1
Efficient Human Pose Estimation via 3D Event Point CloudCode1
Faster Person Re-IdentificationCode1
On Reinforcement Learning for the Game of 2048Code1
Parallel Context Windows for Large Language ModelsCode1
Playing 2048 With Reinforcement LearningCode0
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutesCode0
Decoder-tailored Polar Code Design Using the Genetic AlgorithmCode0
Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in GamesCode0
Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain MappingCode0
Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP ModelsCode0
GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingCode0
Genetic Algorithm-based Polar Code Construction for the AWGN ChannelCode0
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document MatchingCode0
Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel ShapingCode0
ImageNet Training in MinutesCode0
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep LearningCode0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Stochastic MuzeroAverage Score500,000Unverified
2AlphaZero (With Simulator)Average Score500,000Unverified
3MuZeroAverage Score300,000Unverified
4Beam SearchAverage Score1,024Unverified
5DQN (1000 episodes)Average Score256Unverified