SOTAVerified

Playing the Game of 2048

Papers

Showing 150 of 57 papers

TitleStatusHype
Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in GamesCode0
PFGM++: Unlocking the Potential of Physics-Inspired Generative ModelsCode2
Parallel Context Windows for Large Language ModelsCode1
On Reinforcement Learning for the Game of 2048Code1
Pavementscapes: a large-scale hierarchical image dataset for asphalt pavement damage segmentationCode1
Rapid Person Re-Identification via Sub-space Consistency Regularization0
Efficient Human Pose Estimation via 3D Event Point CloudCode1
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments0
Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP ModelsCode0
Pathways: Asynchronous Distributed Dataflow for ML0
Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Conditioned GANsCode1
The Economics of Orbit Use: Open Access, External Costs, and Runaway Debris Growth0
DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification0
Optimistic Temporal Difference Learning for 2048Code1
Playing 2048 With Reinforcement LearningCode0
Why Out-of-distribution Detection in CNNs Does Not Like Mahalanobis -- and What to Use Instead0
Planning in Stochastic Environments with a Learned ModelCode1
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image HarmonizationCode1
Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationCode2
Scalable Reverse Image Search Engine for NASAWorldview0
Accelerating Markov Random Field Inference with Uncertainty Quantification0
Long Short-Term Transformer for Online Action DetectionCode1
M6-T: Exploring Sparse Expert Models and Beyond0
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
Objective-Based Hierarchical Clustering of Deep Embedding Vectors0
Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour0
Dense Dual-Path Network for Real-time Semantic Segmentation0
An Investigation of Feature Selection and Transfer Learning for Writer-Independent Offline Handwritten Signature Verification0
LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System0
Faster Person Re-IdentificationCode1
GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingCode0
Interference Cancellation Based Channel Estimation for Massive MIMO Systems with Time Shifted Pilots0
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document MatchingCode0
Improving BPSO-based feature selection applied to offline WI handwritten signature verification through overfitting control0
Efficient Video Semantic Segmentation with Labels Propagation and Refinement0
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep LearningCode0
Real-Time Semantic Segmentation via Multiply Spatial Fusion Network0
A Hardware-Efficient ADMM-Based SVM Training Algorithm for Edge Computing0
Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain MappingCode0
Towards Sampling from Nondirected Probabilistic Graphical models using a D-Wave Quantum Annealer0
Decoder-tailored Polar Code Design Using the Genetic AlgorithmCode0
Genetic Algorithm-based Polar Code Construction for the AWGN ChannelCode0
An Interpretable Machine Vision Approach to Human Activity Recognition using Photoplethysmograph Sensor Data0
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks0
Improving Electron Micrograph Signal-to-Noise with an Atrous Convolutional Encoder-DecoderCode0
Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes0
An Affective Robot Companion for Assisting the Elderly in a Cognitive Game Scenario0
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutesCode0
Seeded Ising Model and Statistical Natures of Human Iris Templates0
ImageNet Training in MinutesCode0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Stochastic MuzeroAverage Score500,000Unverified
2AlphaZero (With Simulator)Average Score500,000Unverified
3MuZeroAverage Score300,000Unverified
4Beam SearchAverage Score1,024Unverified
5DQN (1000 episodes)Average Score256Unverified