| Gradient Ascent Post-training Enhances Language Model Generalization | Jun 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search | Jun 11, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model | Jun 11, 2023 | General KnowledgeKnowledge Distillation | CodeCode Available | 1 |
| Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method | Jun 11, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Large Language Models Are Semi-Parametric Reinforcement Learning Agents | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 14 Examples of How LLMs Can Transform Materials Science and Chemistry: A Reflection on a Large Language Model Hackathon | Jun 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Hexatagging: Projective Dependency Parsing as Tagging | Jun 8, 2023 | Computational EfficiencyDependency Parsing | CodeCode Available | 1 |
| Privately generating tabular data using language models | Jun 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! | Jun 6, 2023 | counterfactualData Augmentation | CodeCode Available | 1 |