| Morph Call: Probing Morphosyntactic Content of Multilingual Transformers | Apr 26, 2021 | Common Sense ReasoningMORPH | CodeCode Available | 0 |
| Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs? | Mar 13, 2025 | NavigateWorld Knowledge | CodeCode Available | 0 |
| BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language Models | May 8, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Does Commonsense help in detecting Sarcasm? | Sep 17, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AKEW: Assessing Knowledge Editing in the Wild | Feb 29, 2024 | Articlescounterfactual | CodeCode Available | 0 |
| Walk-and-Relate: A Random-Walk-based Algorithm for Representation Learning on Sparse Knowledge Graphs | Sep 19, 2022 | Knowledge GraphsRepresentation Learning | CodeCode Available | 0 |
| Arrows are the Verbs of Diagrams | Aug 1, 2018 | BIG-bench Machine LearningWorld Knowledge | CodeCode Available | 0 |
| Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment | Jun 24, 2025 | Informativenessreinforcement-learning | CodeCode Available | 0 |
| TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language Models | May 21, 2025 | Human AgingQuestion Answering | CodeCode Available | 0 |
| LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Aug 9, 2024 | DiversityInstruction Following | CodeCode Available | 0 |
| My Teacher Thinks The World Is Flat! Interpreting Automatic Essay Scoring Mechanism | Dec 27, 2020 | Common Sense ReasoningNatural Language Understanding | CodeCode Available | 0 |
| LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start Recommendations | Mar 31, 2024 | Recommendation SystemsRe-Ranking | CodeCode Available | 0 |
| Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and Challenges | May 16, 2025 | BenchmarkingState Estimation | CodeCode Available | 0 |
| NLITrans at SemEval-2018 Task 12: Transfer of Semantic Knowledge for Argument Comprehension | Apr 23, 2018 | PositionSentence | CodeCode Available | 0 |
| CoRTEx: Contrastive Learning for Representing Terms via Explanations with Applications on Constructing Biomedical Knowledge Graphs | Dec 13, 2023 | ClusteringContrastive Learning | CodeCode Available | 0 |
| FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language Models | Feb 21, 2024 | Recommendation SystemsTaxonomy Expansion | CodeCode Available | 0 |
| Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models | Mar 21, 2024 | SentenceWorld Knowledge | CodeCode Available | 0 |
| LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals | Apr 28, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 |
| ObjCAViT: Improving Monocular Depth Estimation Using Natural Language Models And Image-Object Cross-Attention | Nov 30, 2022 | Depth EstimationImage-to-Image Translation | CodeCode Available | 0 |
| Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams | Jun 17, 2024 | AllBenchmarking | CodeCode Available | 0 |
| Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior Simulation | Mar 27, 2024 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 0 |
| Scaling Autoregressive Models for Content-Rich Text-to-Image Generation | Jun 22, 2022 | DecoderImage Generation | CodeCode Available | 0 |
| Finding Motifs in Knowledge Graphs using Compression | Apr 16, 2021 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 |
| Advancing and Benchmarking Personalized Tool Invocation for LLMs | May 7, 2025 | BenchmarkingWorld Knowledge | CodeCode Available | 0 |
| Video Summarization: Towards Entity-Aware Captions | Dec 1, 2023 | Image CaptioningVideo Captioning | CodeCode Available | 0 |