| RAGViz: Diagnose and Visualize Retrieval-Augmented Generation | Nov 4, 2024 | Answer GenerationGPU | CodeCode Available | 2 | 5 |
| Can ChatGPT Assess Human Personalities? A General Evaluation Framework | Mar 1, 2023 | Answer GenerationFairness | CodeCode Available | 1 | 5 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 | 5 |
| End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering | Jun 9, 2021 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 | 5 |
| EgoNormia: Benchmarking Physical Social Norm Understanding | Feb 27, 2025 | Answer GenerationBenchmarking | CodeCode Available | 1 | 5 |
| Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation | Dec 16, 2022 | Answer GenerationDecoder | CodeCode Available | 1 | 5 |
| GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents | May 21, 2025 | Answer GenerationReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Asking Questions the Human Way: Scalable Question-Answer Generation from Text Corpus | Jan 27, 2020 | Answer GenerationChatbot | CodeCode Available | 1 | 5 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations? | Apr 29, 2024 | Answer GenerationBenchmarking | CodeCode Available | 1 | 5 |