| Teaching LLMs to Abstain across Languages via Multilingual Feedback | Jun 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Teaching Smaller Language Models To Generalise To Unseen Compositional Questions | Aug 2, 2023 | ARCInformation Retrieval | CodeCode Available | 0 |
| Teaching Specific Scientific Knowledge into Large Language Models through Additional Training | Dec 6, 2023 | Hyperparameter OptimizationLanguage Modeling | CodeCode Available | 0 |
| mu-Forcing: Training Variational Recurrent Autoencoders for Text Generation | May 24, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry Generation | May 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MaxUp: A Simple Way to Improve Generalization of Neural Network Training | Feb 20, 2020 | Few-Shot Image ClassificationGeneral Classification | CodeCode Available | 0 |
| No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths | Jun 20, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Team Ohio State at CMCL 2021 Shared Task: Fine-Tuned RoBERTa for Eye-Tracking Data Prediction | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Team Papelo: Transformer Networks at FEVER | Jan 8, 2019 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| Pneg: Prompt-based Negative Response Generation for Dialogue Response Selection Task | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models | Sep 27, 2023 | HumanEvalLanguage Modeling | CodeCode Available | 0 |
| Resolving References in Visually-Grounded Dialogue via Text Generation | Sep 23, 2023 | Image RetrievalLanguage Modeling | CodeCode Available | 0 |
| Wanda++: Pruning Large Language Models via Regional Gradients | Mar 6, 2025 | DecoderGPU | CodeCode Available | 0 |
| MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO | May 19, 2025 | DecoderImage Generation | CodeCode Available | 0 |
| MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output | Jan 1, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale | Jun 26, 2025 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| MST5 -- Multilingual Question Answering over Knowledge Graphs | Jul 8, 2024 | DiversityGraph Question Answering | CodeCode Available | 0 |
| TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment | May 5, 2025 | 3D Object RetrievalLanguage Modeling | CodeCode Available | 0 |
| MSDT: Masked Language Model Scoring Defense in Text Domain | Nov 10, 2022 | Backdoor Attackbackdoor defense | CodeCode Available | 0 |
| Transformer based neural networks for emotion recognition in conversations | May 18, 2024 | Causal Language ModelingEmotion Classification | CodeCode Available | 0 |
| TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection | May 27, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Logit Separability-Driven Samples and Multiple Class-Related Words Selection for Advancing In-Context Learning | Jun 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Resolving Indirect Referring Expressions for Entity Selection | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled Data | Oct 10, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Telling Stories for Common Sense Zero-Shot Action Recognition | Sep 29, 2023 | Action RecognitionArticles | CodeCode Available | 0 |
| Reproducing NevIR: Negation in Neural Information Retrieval | Feb 19, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Tell me what I need to know: Exploring LLM-based (Personalized) Abstractive Multi-Source Meeting Summarization | Oct 18, 2024 | InformativenessLanguage Modeling | CodeCode Available | 0 |
| Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study | Apr 4, 2022 | Automatic Post-EditingLanguage Modeling | CodeCode Available | 0 |
| Reproducing and Regularizing the SCRN Model | Aug 1, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MetaSC: Test-Time Safety Specification Optimization for Language Models | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection | Nov 16, 2024 | DiagnosticInstruction Following | CodeCode Available | 0 |
| Reproducibility study of "LICO: Explainable Models with Language-Image Consistency" | Oct 17, 2024 | Explainable Modelsimage-classification | CodeCode Available | 0 |
| Representation of linguistic form and function in recurrent neural networks | Feb 29, 2016 | FormLanguage Modeling | CodeCode Available | 0 |
| Representation Learning of Daily Movement Data Using Text Encoders | May 7, 2024 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| TempoGPT: Enhancing Temporal Reasoning via Quantizing Embedding | Jan 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Temporal Action Detection Using a Statistical Language Model | Jun 1, 2016 | Action DetectionAction Recognition | CodeCode Available | 0 |
| Temporal Analysis of Language through Neural Language Models | May 14, 2014 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transformer Meets Twicing: Harnessing Unattended Residual Information | Mar 2, 2025 | Adversarial Robustnessimage-classification | CodeCode Available | 0 |
| PLDR-LLM: Large Language Model from Power Law Decoder Representations | Oct 22, 2024 | DecoderGraph Attention | CodeCode Available | 0 |
| XRJL-HKUST at SemEval-2021 Task 4: WordNet-Enhanced Dual Multi-head Co-Attention for Reading Comprehension of Abstract Meaning | Mar 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Watch What You Just Said: Image Captioning with Text-Conditional Attention | Jun 15, 2016 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| Representation Degeneration Problem in Training Natural Language Generation Models | Jul 28, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Plausible-Parrots @ MSP2023: Enhancing Semantic Plausibility Modeling using Entity and Event Knowledge | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Apr 9, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| PK-Chat: Pointer Network Guided Knowledge Driven Generative Dialogue Model | Apr 2, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation | Jun 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Tensor Product Attention Is All You Need | Jan 11, 2025 | AllLanguage Modeling | CodeCode Available | 0 |
| RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Jun 17, 2024 | BenchmarkingGeneral Knowledge | CodeCode Available | 0 |