SOTAVerified

Playing the Game of 2048

Papers

Showing 150 of 57 papers

TitleStatusHype
PFGM++: Unlocking the Potential of Physics-Inspired Generative ModelsCode2
GSPMD: General and Scalable Parallelization for ML Computation GraphsCode2
Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationCode2
Faster Person Re-IdentificationCode1
Pavementscapes: a large-scale hierarchical image dataset for asphalt pavement damage segmentationCode1
Parallel Context Windows for Large Language ModelsCode1
Optimistic Temporal Difference Learning for 2048Code1
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image HarmonizationCode1
Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Conditioned GANsCode1
Planning in Stochastic Environments with a Learned ModelCode1
On Reinforcement Learning for the Game of 2048Code1
Long Short-Term Transformer for Online Action DetectionCode1
Efficient Human Pose Estimation via 3D Event Point CloudCode1
Scaling Distributed Training of Flood-Filling Networks on HPC Infrastructure for Brain MappingCode0
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document MatchingCode0
Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP ModelsCode0
Decoder-tailored Polar Code Design Using the Genetic AlgorithmCode0
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutesCode0
Genetic Algorithm-based Polar Code Construction for the AWGN ChannelCode0
GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingCode0
ImageNet Training in MinutesCode0
Improving Electron Micrograph Signal-to-Noise with an Atrous Convolutional Encoder-DecoderCode0
Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel ShapingCode0
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep LearningCode0
Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in GamesCode0
Playing 2048 With Reinforcement LearningCode0
Towards Efficient and Exact MAP-Inference for Large Scale Discrete Computer Vision Problems via Combinatorial Optimization0
Interference Cancellation Based Channel Estimation for Massive MIMO Systems with Time Shifted Pilots0
LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System0
Learning Word Embeddings from the Portuguese Twitter Stream: A Study of some Practical Aspects0
DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification0
Towards Sampling from Nondirected Probabilistic Graphical models using a D-Wave Quantum Annealer0
Accelerating Markov Random Field Inference with Uncertainty Quantification0
Multi-Stage Temporal Difference Learning for 2048-like Games0
Objective-Based Hierarchical Clustering of Deep Embedding Vectors0
Dense Dual-Path Network for Real-time Semantic Segmentation0
The Economics of Orbit Use: Open Access, External Costs, and Runaway Debris Growth0
Automatic Node Selection for Deep Neural Networks using Group Lasso Regularization0
A Parallel Algorithm for Exact Bayesian Structure Discovery in Bayesian Networks0
Pathways: Asynchronous Distributed Dataflow for ML0
An Investigation of Feature Selection and Transfer Learning for Writer-Independent Offline Handwritten Signature Verification0
Why Out-of-distribution Detection in CNNs Does Not Like Mahalanobis -- and What to Use Instead0
An Interpretable Machine Vision Approach to Human Activity Recognition using Photoplethysmograph Sensor Data0
An Affective Robot Companion for Assisting the Elderly in a Cognitive Game Scenario0
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments0
Rapid Person Re-Identification via Sub-space Consistency Regularization0
Real-Time Semantic Segmentation via Multiply Spatial Fusion Network0
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks0
Scalable Reverse Image Search Engine for NASAWorldview0
M6-T: Exploring Sparse Expert Models and Beyond0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Stochastic MuzeroAverage Score500,000Unverified
2AlphaZero (With Simulator)Average Score500,000Unverified
3MuZeroAverage Score300,000Unverified
4Beam SearchAverage Score1,024Unverified
5DQN (1000 episodes)Average Score256Unverified