SOTAVerified

ListOps

Papers

Showing 1120 of 22 papers

TitleStatusHype
Simplified State Space Layers for Sequence ModelingCode2
Training Discrete Deep Generative Models via Gapped Straight-Through EstimatorCode1
Dynamic Token Normalization Improves Vision TransformersCode1
ORCHARD: A Benchmark For Measuring Systematic Generalization of Multi-Hierarchical ReasoningCode0
Efficiently Modeling Long Sequences with Structured State SpacesCode1
The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic GeneralizationCode1
Adaptive Control Flow in Transformers Improves Systematic Generalization0
Going Beyond Linear Transformers with Recurrent Fast Weight ProgrammersCode1
Modeling Hierarchical Structures with Continuous Recursive Neural NetworksCode1
Long Range Arena: A Benchmark for Efficient TransformersCode1
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.