SOTAVerified

Open-Ended Question Answering

Open-ended questions are defined as those that simply pose the question, without imposing any constraints on the format of the response. This distinguishes them from questions with a predetermined answer format.

Papers

Showing 101150 of 796 papers

TitleStatusHype
Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIsCode1
Coresets for Data-efficient Training of Machine Learning ModelsCode1
Strategies for Pre-training Graph Neural NetworksCode1
Learning to Cluster Faces on an Affinity GraphCode1
Hybrid Task Cascade for Instance SegmentationCode1
ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax DiseasesCode1
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks0
WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis0
anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task Understanding0
CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data SynthesisCode0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making0
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild0
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models0
Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement0
PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language ModelsCode0
TVBench: Redesigning Video-Language Evaluation0
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering0
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction TuningCode0
Video Instruction Tuning With Synthetic Data0
CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarks0
Ranking Generated Answers: On the Agreement of Retrieval Models with Humans on Consumer Health QuestionsCode0
Reference-Guided Verdict: LLMs-as-Judges in Automatic Evaluation of Free-Form Text0
TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models0
Extrinsic Evaluation of Cultural Competence in Large Language ModelsCode0
Long Story Short: Story-level Video Understanding from 20K Short Films0
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering0
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQCode0
API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access0
Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering0
Shai: A large language model for asset management0
Universal Self-Consistency for Large Language Model Generation0
Downstream Trade-offs of a Family of Text WatermarksCode0
Monolingual or Multilingual Instruction Tuning: Which Makes a Better AlpacaCode0
Prompting Large Language Models with Speech Recognition Abilities0
On the Model-Misspecification in Reinforcement Learning0
2D-Shapley: A Framework for Fragmented Data ValuationCode0
Adversaries with Limited Information in the Friedkin--Johnsen ModelCode0
POP: Prompt Of Prompts for Continual Learning0
Provable Accelerated Convergence of Nesterov's Momentum for Deep ReLU Neural Networks0
Non-autoregressive Conditional Diffusion Models for Time Series Prediction0
Benchmarking Foundation Models with Language-Model-as-an-Examiner0
Differences in boundary behavior in the 3D vertex and Voronoi models0
Computation with Sequences in a Model of the Brain0
HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE0
SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts0
Tight Regret Bounds for Single-pass Streaming Multi-armed BanditsCode0
Dynamic Algorithms for Matroid Submodular Maximization0
Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of MindCode0
Trustworthy Sensor Fusion against Inaudible Command Attacks in Advanced Driver-Assistance System0
Show:102550
← PrevPage 3 of 16Next →

No leaderboard results yet.