| How Does Code Pretraining Affect Language Model Task Performance? | Sep 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Confidential Computing on NVIDIA Hopper GPUs: A Performance Benchmark Study | Sep 6, 2024 | CPUGPU | —Unverified | 0 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets | Sep 6, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Multi-Programming Language Ensemble for Code Generation in Large Language Model | Sep 6, 2024 | Code GenerationHumanEval | CodeCode Available | 0 |
| A Fused Large Language Model for Predicting Startup Success | Sep 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Sep 5, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| LAST: Language Model Aware Speech Tokenization | Sep 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The AdEMAMix Optimizer: Better, Faster, Older | Sep 5, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| N-gram Prediction and Word Difference Representations for Language Modeling | Sep 5, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |