| RaTEScore: A Metric for Radiology Report Generation | Jun 24, 2024 | DiagnosticEntity Embeddings | CodeCode Available | 4 | 5 |
| Discovering Latent Knowledge in Language Models Without Supervision | Dec 7, 2022 | Imitation LearningLanguage Modelling | CodeCode Available | 2 | 5 |
| Controlling Language and Diffusion Models by Transporting Activations | Oct 30, 2024 | Negation | CodeCode Available | 2 | 5 |
| GreaseLM: Graph REASoning Enhanced Language Models for Question Answering | Jan 21, 2022 | Knowledge GraphsMedical Question Answering | CodeCode Available | 2 | 5 |
| Editing Models with Task Arithmetic | Dec 8, 2022 | NegationTask Arithmetic | CodeCode Available | 2 | 5 |
| FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | May 5, 2025 | BenchmarkingMathematical Reasoning | CodeCode Available | 2 | 5 |
| Is CLIP ideal? No. Can we fix it? Yes! | Mar 10, 2025 | AttributeNegation | CodeCode Available | 2 | 5 |
| FactKG: Fact Verification via Reasoning on Knowledge Graphs | May 11, 2023 | Fact VerificationKnowledge Graphs | CodeCode Available | 1 | 5 |
| Evaluating Scoped Meaning Representations | Feb 23, 2018 | Natural Language UnderstandingNegation | CodeCode Available | 1 | 5 |
| Exploiting Partial Knowledge in Declarative Domain-Specific Heuristics for ASP | Sep 18, 2019 | Negation | CodeCode Available | 1 | 5 |
| Ask Again, Then Fail: Large Language Models' Vacillations in Judgment | Oct 3, 2023 | Negation | CodeCode Available | 1 | 5 |
| Evaluating statistical language models as pragmatic reasoners | May 1, 2023 | NegationSemantic Parsing | CodeCode Available | 1 | 5 |
| Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs | Oct 22, 2020 | Complex Query AnsweringKnowledge Graphs | CodeCode Available | 1 | 5 |
| Expressive Sign Equivariant Networks for Spectral Geometric Learning | Dec 4, 2023 | Link PredictionNegation | CodeCode Available | 1 | 5 |
| Approximate Attributions for Off-the-Shelf Siamese Transformers | Feb 5, 2024 | NegationSentence | CodeCode Available | 1 | 5 |
| Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition | Apr 7, 2020 | DiagnosticImplicatures | CodeCode Available | 1 | 5 |
| Distributional Formal Semantics | Mar 2, 2021 | NegationSemantic Similarity | CodeCode Available | 1 | 5 |
| CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No | Aug 23, 2023 | NegationOut-of-Distribution Detection | CodeCode Available | 1 | 5 |
| Accelerated and interpretable oblique random survival forests | Aug 1, 2022 | BenchmarkingComputational Efficiency | CodeCode Available | 1 | 5 |
| ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs | Oct 26, 2021 | Knowledge GraphsNegation | CodeCode Available | 1 | 5 |
| Composing Parameter-Efficient Modules with Arithmetic Operations | Jun 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CREPE: Can Vision-Language Foundation Models Reason Compositionally? | Dec 13, 2022 | Image RetrievalNegation | CodeCode Available | 1 | 5 |
| Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining? | Aug 24, 2023 | AttributeNegation | CodeCode Available | 1 | 5 |
| Enhancing Phenotype Recognition in Clinical Notes Using Large Language Models: PhenoBCBERT and PhenoGPT | Aug 11, 2023 | Negation | CodeCode Available | 1 | 5 |
| A Boolean Task Algebra for Reinforcement Learning | Jan 6, 2020 | Lifelong learningNegation | CodeCode Available | 1 | 5 |