| Scope Ambiguities in Large Language Models | Apr 5, 2024 | World Knowledge | CodeCode Available | 0 |
| BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Apr 5, 2024 | Factual probeGeneral Knowledge | CodeCode Available | 1 |
| PRobELM: Plausibility Ranking Evaluation for Language Models | Apr 4, 2024 | Question AnsweringTruthfulQA | —Unverified | 0 |
| PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model | Apr 4, 2024 | 3D Part SegmentationBenchmarking | CodeCode Available | 1 |
| GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views | Apr 2, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 3 |
| Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization | Apr 2, 2024 | MemorizationOpen-Domain Question Answering | —Unverified | 0 |
| LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start Recommendations | Mar 31, 2024 | Recommendation SystemsRe-Ranking | CodeCode Available | 0 |
| EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs | Mar 30, 2024 | Graph Neural NetworkKnowledge Graphs | CodeCode Available | 0 |
| Enhancing Content-based Recommendation via Large Language Model | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Are We on the Right Way for Evaluating Large Vision-Language Models? | Mar 29, 2024 | World Knowledge | CodeCode Available | 3 |