| AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Oct 15, 2020 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Differential Transformer | Oct 7, 2024 | HallucinationIn-Context Learning | CodeCode Available | 2 | 5 |
| Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers | Jan 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT-Driver: Learning to Drive with GPT | Oct 2, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| Android in the Zoo: Chain-of-Action-Thought for GUI Agents | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GPT Can Solve Mathematical Problems Without a Calculator | Sep 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding | Oct 7, 2022 | Chart Question AnsweringDiversity | CodeCode Available | 2 | 5 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 | 5 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 | 5 |