| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Dec 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Jul 26, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Learning Approximate Inference Networks for Structured Prediction | Mar 9, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Chess as a Testbed for Language Model State Tracking | Feb 26, 2021 | Game of ChessLanguage Modeling | CodeCode Available | 1 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training | Dec 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |