| ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention | May 15, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 | 5 |
| FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings | Jul 24, 2020 | ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| ArthModel: Enhance Arithmetic Skills to Large Language Model | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Data Noising as Smoothing in Neural Network Language Models | Mar 7, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Letter-Based Speech Recognition with Gated ConvNets | Dec 22, 2017 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 | 5 |
| LLM vs. Lawyers: Identifying a Subset of Summary Judgments in a Large UK Case Law Dataset | Mar 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Detection of depression on social networks using transformers and ensembles | May 9, 2023 | Depression DetectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Let the Poem Hit the Rhythm: Using a Byte-Based Transformer for Beat-Aligned Poetry Generation | Jun 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |