SOTAVerified

Open-Ended Question Answering

Open-ended questions are defined as those that simply pose the question, without imposing any constraints on the format of the response. This distinguishes them from questions with a predetermined answer format.

Papers

Showing 101125 of 796 papers

TitleStatusHype
Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIsCode1
Coresets for Data-efficient Training of Machine Learning ModelsCode1
Strategies for Pre-training Graph Neural NetworksCode1
Learning to Cluster Faces on an Affinity GraphCode1
Hybrid Task Cascade for Instance SegmentationCode1
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax DiseasesCode1
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks0
WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis0
anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding0
CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data SynthesisCode0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making0
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild0
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models0
Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement0
PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language ModelsCode0
TVBench: Redesigning Video-Language Evaluation0
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering0
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction TuningCode0
Video Instruction Tuning With Synthetic Data0
CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks0
Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health QuestionsCode0
Reference-Guided Verdict: LLMs-as-Judges in Automatic Evaluation of Free-Form Text0
TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models0
Extrinsic Evaluation of Cultural Competence in Large Language ModelsCode0
Show:102550
← PrevPage 5 of 32Next →

No leaderboard results yet.