| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma? | Mar 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Do Large Language Model Benchmarks Test Reliability? | Feb 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Language Generation with Strictly Proper Scoring Rules | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| BECEL: Benchmark for Consistency Evaluation of Language Models | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction | Oct 30, 2023 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 1 | 5 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IFSeg: Image-free Semantic Segmentation via Vision-Language Model | Mar 25, 2023 | Image SegmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| Implementing contextual biasing in GPU decoder for online ASR | Jun 23, 2023 | CPUDecoder | CodeCode Available | 1 | 5 |
| DomBERT: Domain-oriented Language Model for Aspect-based Sentiment Analysis | Apr 28, 2020 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 | 5 |
| DOMINO: A Dual-System for Multi-step Visual Language Reasoning | Oct 4, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 | 5 |
| iBOT: Image BERT Pre-Training with Online Tokenizer | Nov 15, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| Dynamic Grained Encoder for Vision Transformers | Jan 10, 2023 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach | Oct 10, 2023 | BenchmarkingCode Generation | CodeCode Available | 1 | 5 |
| An Efficient Multilingual Language Model Compression through Vocabulary Trimming | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| An Efficient Self-Supervised Cross-View Training For Sentence Embedding | Nov 6, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Downstream Model Design of Pre-trained Language Model for Relation Extraction Task | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities | Aug 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model | Jul 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 | 5 |
| Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation | Feb 18, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers | May 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs | Feb 11, 2024 | Inductive BiasLanguage Modeling | CodeCode Available | 1 | 5 |
| HYTREL: Hypergraph-enhanced Tabular Data Representation Learning | Jul 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Identifying Interpretable Subspaces in Image Representations | Jul 20, 2023 | counterfactualLanguage Modeling | CodeCode Available | 1 | 5 |
| Hybrid Ranking Network for Text-to-SQL | Aug 11, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Language Models are Unsupervised Multitask Learners | Feb 14, 2019 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 | 5 |
| AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion | Oct 16, 2023 | AllImage Quality Assessment | CodeCode Available | 1 | 5 |
| Hydra: A System for Large Multi-Model Deep Learning | Oct 16, 2021 | Deep LearningGPU | CodeCode Available | 1 | 5 |
| Language Model Unalignment: Parametric Red-Teaming to Expose Hidden Harms and Biases | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Language Model Uncertainty Quantification with Attention Chain | Mar 24, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 | 5 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions | Aug 20, 2021 | Code GenerationDiversity | CodeCode Available | 1 | 5 |
| AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Advancing High Resolution Vision-Language Models in Biomedicine | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation | Dec 17, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |