| WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models | Aug 7, 2024 | AI and SafetyBenchmarking | CodeCode Available | 1 |
| GAIA -- A Large Language Model for Advanced Power Dispatch | Aug 7, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| How Well Can Vision Language Models See Image Details? | Aug 7, 2024 | Decision MakingImage Segmentation | —Unverified | 0 |
| Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling | Aug 7, 2024 | Image GenerationLanguage Modelling | CodeCode Available | 1 |
| Improving Large Language Model (LLM) fidelity through context-aware grounding: A systematic approach to reliability and veracity | Aug 7, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case | Aug 7, 2024 | ChatbotLarge Language Model | —Unverified | 0 |
| MMSummary: Multimodal Summary Generation for Fetal Ultrasound Video | Aug 7, 2024 | AnatomyLanguage Modeling | —Unverified | 0 |
| Digital Avatars: Framework Development and Their Evaluation | Aug 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automated Theorem Provers Help Improve Large Language Model Reasoning | Aug 7, 2024 | Formal LogicLanguage Modeling | —Unverified | 0 |
| EgyBERT: A Large Language Model Pretrained on Egyptian Dialect Corpora | Aug 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |