| Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text Generation | Aug 24, 2024 | DiversitySentence | CodeCode Available | 1 |
| Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images | Aug 15, 2024 | Image GenerationSentence | CodeCode Available | 1 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| SentenceVAE: Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context | Aug 1, 2024 | DecoderSentence | CodeCode Available | 1 |
| Can Editing LLMs Inject Harm? | Jul 29, 2024 | FairnessGeneral Knowledge | CodeCode Available | 1 |
| ClinicRealm: Re-evaluating Large Language Models with Conventional Machine Learning for Non-Generative Clinical Prediction Tasks | Jul 26, 2024 | BenchmarkingModel Selection | CodeCode Available | 1 |
| AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description | Jul 22, 2024 | Sentence | CodeCode Available | 1 |
| Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition | Jul 17, 2024 | Grounded Multimodal Named Entity RecognitionMachine Reading Comprehension | CodeCode Available | 1 |
| AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning | Jul 9, 2024 | Keyword ExtractionSentence | CodeCode Available | 1 |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Jul 1, 2024 | BenchmarkingHallucination | CodeCode Available | 1 |