| Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning | Jul 20, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 | 5 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| NaturalProver: Grounded Mathematical Proof Generation with Language Models | May 25, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 1 | 5 |
| Causal Distillation for Language Models | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Causal Discovery with Language Models as Imperfect Experts | Jul 5, 2023 | Causal DiscoveryDecision Making | CodeCode Available | 1 | 5 |
| Catwalk: A Unified Language Model Evaluation Framework for Many Datasets | Dec 15, 2023 | In-Context LearningLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model | Apr 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Neural Implicit Vision-Language Feature Fields | Mar 20, 2023 | Image SegmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| A Realistic Threat Model for Large Language Model Jailbreaks | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CAT-LM: Training Language Models on Aligned Code And Tests | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 | 5 |
| Evaluation Benchmarks for Spanish Sentence Representations | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| 4-bit Shampoo for Memory-Efficient Network Training | May 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 | 5 |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Jun 13, 2024 | ArticlesEntity Disambiguation | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |