| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evolving Deep Neural Networks | Mar 1, 2017 | Deep LearningImage Captioning | CodeCode Available | 1 |
| Excuse me, sir? Your language model is leaking (information) | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Engorgio Prompt Makes Large Language Model Babble on | Dec 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Evaluation Benchmarks for Spanish Sentence Representations | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evolutionary Large Language Model for Automated Feature Transformation | May 25, 2024 | Efficient ExplorationEvolutionary Algorithms | CodeCode Available | 1 |
| Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering | Jul 22, 2023 | Graph Representation LearningLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective | Oct 6, 2024 | CPUGPU | CodeCode Available | 1 |
| Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples! | Mar 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge | Oct 8, 2023 | ARCLanguage Modeling | CodeCode Available | 1 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Model Finetuning Techniques for Low-resource Languages | Jun 30, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| Can Large Language Model Comprehend Ancient Chinese? A Preliminary Test on ACLUE | Oct 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Large Language Model Agents Balance Energy Systems? | Feb 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VLLaVO: Mitigating Visual Gap through LLMs | Jan 6, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Jan 31, 2023 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout Detection | Feb 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Protein Structure Tokenization: Benchmarking and New Recipe | Feb 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Bot or Human? Detecting ChatGPT Imposters with A Single Question | May 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model Unlearning via Embedding-Corrupted Prompts | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media | Sep 7, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference | Aug 7, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 1 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Latxa: An Open Language Model and Evaluation Suite for Basque | Mar 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adversarial Training for Aspect-Based Sentiment Analysis with BERT | Jan 30, 2020 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought | Mar 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| BiasEdit: Debiasing Stereotyped Language Models via Model Editing | Mar 11, 2025 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Dec 23, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement | Feb 9, 2024 | Code GenerationDecision Making | CodeCode Available | 1 |
| Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation | Jul 26, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Learning Approximate Inference Networks for Structured Prediction | Mar 9, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Chess as a Testbed for Language Model State Tracking | Feb 26, 2021 | Game of ChessLanguage Modeling | CodeCode Available | 1 |
| Epidemic Modeling with Generative Agents | Jul 11, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training | Dec 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |