SOTAVerified

Playing the Game of 2048

Papers

Showing 150 of 57 papers

TitleStatusHype
PFGM++: Unlocking the Potential of Physics-Inspired Generative ModelsCode2
Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationCode2
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
On Reinforcement Learning for the Game of 2048Code1
Parallel Context Windows for Large Language ModelsCode1
Pavementscapes: a large-scale hierarchical image dataset for asphalt pavement damage segmentationCode1
Efficient Human Pose Estimation via 3D Event Point CloudCode1
Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Conditioned GANsCode1
Optimistic Temporal Difference Learning for 2048Code1
Planning in Stochastic Environments with a Learned ModelCode1
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image HarmonizationCode1
Long Short-Term Transformer for Online Action DetectionCode1
Faster Person Re-IdentificationCode1
Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in GamesCode0
Rapid Person Re-Identification via Sub-space Consistency Regularization0
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments0
Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP ModelsCode0
Pathways: Asynchronous Distributed Dataflow for ML0
The Economics of Orbit Use: Open Access, External Costs, and Runaway Debris Growth0
DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification0
Playing 2048 With Reinforcement LearningCode0
Why Out-of-distribution Detection in CNNs Does Not Like Mahalanobis -- and What to Use Instead0
Scalable Reverse Image Search Engine for NASAWorldview0
Accelerating Markov Random Field Inference with Uncertainty Quantification0
M6-T: Exploring Sparse Expert Models and Beyond0
Objective-Based Hierarchical Clustering of Deep Embedding Vectors0
Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour0
Dense Dual-Path Network for Real-time Semantic Segmentation0
LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System0
An Investigation of Feature Selection and Transfer Learning for Writer-Independent Offline Handwritten Signature Verification0
GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingCode0
Interference Cancellation Based Channel Estimation for Massive MIMO Systems with Time Shifted Pilots0
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document MatchingCode0
Improving BPSO-based feature selection applied to offline WI handwritten signature verification through overfitting control0
Efficient Video Semantic Segmentation with Labels Propagation and Refinement0
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep LearningCode0
Real-Time Semantic Segmentation via Multiply Spatial Fusion Network0
A Hardware-Efficient ADMM-Based SVM Training Algorithm for Edge Computing0
Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain MappingCode0
Towards Sampling from Nondirected Probabilistic Graphical models using a D-Wave Quantum Annealer0
Decoder-tailored Polar Code Design Using the Genetic AlgorithmCode0
Genetic Algorithm-based Polar Code Construction for the AWGN ChannelCode0
An Interpretable Machine Vision Approach to Human Activity Recognition using Photoplethysmograph Sensor Data0
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks0
Improving Electron Micrograph Signal-to-Noise with an Atrous Convolutional Encoder-DecoderCode0
Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes0
An Affective Robot Companion for Assisting the Elderly in a Cognitive Game Scenario0
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutesCode0
Seeded Ising Model and Statistical Natures of Human Iris Templates0
ImageNet Training in MinutesCode0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Stochastic MuzeroAverage Score500,000Unverified
2AlphaZero (With Simulator)Average Score500,000Unverified
3MuZeroAverage Score300,000Unverified
4Beam SearchAverage Score1,024Unverified
5DQN (1000 episodes)Average Score256Unverified