SOTAVerified

Multiple-choice

Papers

Showing 551600 of 1107 papers

TitleStatusHype
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering0
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration0
Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering0
Towards Collective Superintelligence: Amplifying Group IQ using Conversational Swarms0
Towards combinatorial clustering: preliminary research survey0
Enhancing LLM Evaluations: The Garbling Trick0
Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning0
Enhancing Multiple-choice Machine Reading Comprehension by Punishing Illogical Interpretations0
Enhancing Multiple-Choice Question Answering with Causal Knowledge0
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation0
EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta0
Establishing Task Scaling Laws via Compute-Efficient Model Ladders0
Towards Conversational AI for Disease Management0
Evalita-LLM: Benchmarking Large Language Models on Italian0
Towards Decision Support Technology Platform for Modular Systems0
Evaluating LLM-corrupted Crowdsourcing Data Without Ground Truth0
Evaluating LLM -- Generated Multimodal Diagnosis from Medical Images and Symptom Analysis0
Evaluating LLMs on Document-Based QA: Exact Answer Selection and Numerical Extraction using Cogtale dataset0
Evaluating Machine Reading Systems through Comprehension Tests0
Evaluating multiple large language models in pediatric ophthalmology0
Evaluating Nuanced Bias in Large Language Model Free Response Answers0
Evaluating Question Answering Evaluation0
A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults0
Evaluating the Performance and Robustness of LLMs in Materials Science Q&A and Property Predictions0
Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions0
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension0
Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education0
Evaluating Vision-Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms0
Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration0
Evaluation of Automatically Generated Pronoun Reference Questions0
Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil0
Towards Geo-Culturally Grounded LLM Generations0
Towards Integrated Glance To Restructuring in Combinatorial Optimization0
ExplanationLP: Abductive Reasoning for Explainable Science Question Answering0
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization0
Explore then Determine: A GNN-LLM Synergy Framework for Reasoning over Knowledge Graph0
Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement0
Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications0
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge0
How Additional Knowledge can Improve Natural Language Commonsense Question Answering?0
Exposing the Limits of Video-Text Models through Contrast Sets0
Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History0
FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees0
Towards Multistage Design of Modular Systems0
FAMULUS: Interactive Annotation and Feedback Generation for Teaching Diagnostic Reasoning0
FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models0
Town Hall Debate Prompting: Enhancing Logical Reasoning in LLMs through Multi-Persona Interaction0
FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding0
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models0
Field-testing items using artificial intelligence: Natural language processing with transformers0
Show:102550
← PrevPage 12 of 23Next →

No leaderboard results yet.