| LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses | Oct 30, 2023 | FormLanguage Modeling | CodeCode Available | 1 |
| MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models | Oct 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V | Oct 29, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark | Oct 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Real-time Animation Generation and Control on Rigged Models via Large Language Models | Oct 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 |
| Qilin-Med-VL: Towards Chinese Large Vision-Language Model for General Healthcare | Oct 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Open Source Data Contamination Report for Large Language Models | Oct 26, 2023 | HellaSwagLanguage Modeling | CodeCode Available | 1 |
| InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CompeteAI: Understanding the Competition Dynamics in Large Language Model-based Agents | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LightLM: A Lightweight Deep and Narrow Language Model for Generative Recommendation | Oct 26, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Content-based Controls For Music Large Language Modeling | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs | Oct 25, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature | Oct 24, 2023 | Abstractive Text SummarizationInformation Retrieval | CodeCode Available | 1 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 |
| AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery | Oct 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning | Oct 24, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| TRAMS: Training-free Memory Selection for Long-range Language Modeling | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding | Oct 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding | Oct 23, 2023 | ArticlesContrastive Learning | CodeCode Available | 1 |
| GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs | Oct 23, 2023 | Contrastive LearningGraph Neural Network | CodeCode Available | 1 |
| LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis | Oct 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Improving Seq2Seq Grammatical Error Correction via Decoding Interventions | Oct 23, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 1 |