OpenICL: An Open-Source Framework for In-context Learning Mar 6, 2023 In-Context Learning Language Modeling
Code Code Available 2Inseq: An Interpretability Toolkit for Sequence Generation Models Feb 27, 2023 Decoder Feature Importance
Code Code Available 2Binarized Neural Machine Translation Feb 9, 2023 Binarization Machine Translation
Code Code Available 2Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine Jan 20, 2023 Machine Translation Sentence
Code Code Available 2Democratizing Neural Machine Translation with OPUS-MT Dec 4, 2022 Machine Translation Translation
Code Code Available 2Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings Oct 23, 2022 Cross-Lingual NER Cross-Lingual Transfer
Code Code Available 2Mega: Moving Average Equipped Gated Attention Sep 21, 2022 Image Classification Inductive Bias
Code Code Available 2AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model Aug 2, 2022 Causal Language Modeling Common Sense Reasoning
Code Code Available 2No Language Left Behind: Scaling Human-Centered Machine Translation Jul 11, 2022 Machine Translation Mixture-of-Experts
Code Code Available 2Shifts 2.0: Extending The Dataset of Real Distributional Shifts Jun 30, 2022 Autonomous Driving image-classification
Code Code Available 2Cross-lingual and Multilingual CLIP Jun 1, 2022 Contrastive Learning Image-text Retrieval
Code Code Available 2CoNT: Contrastive Neural Text Generation May 29, 2022 Code Comment Generation Comment Generation
Code Code Available 2Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation May 25, 2022 Cross-Lingual Transfer Machine Translation
Code Code Available 2Automated Deep Learning: Neural Architecture Search Is Not the End Dec 16, 2021 Deep Learning Machine Translation
Code Code Available 2LightSeq2: Accelerated Training for Transformer-based Models on GPUs Oct 12, 2021 Decoder GPU
Code Code Available 2When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute Feb 24, 2021 GPU Language Modeling
Code Code Available 2LightSeq: A High Performance Inference Library for Transformers Oct 23, 2020 GPU Machine Translation
Code Code Available 2TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP Apr 29, 2020 Adversarial Attack Adversarial Text
Code Code Available 2Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Oct 23, 2019 Answer Generation Common Sense Reasoning
Code Code Available 2MASS: Masked Sequence to Sequence Pre-training for Language Generation May 7, 2019 Conversational Response Generation Decoder
Code Code Available 2GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism Nov 16, 2018 Fine-Grained Image Classification image-classification
Code Code Available 2Neural Speech Synthesis with Transformer Network Sep 19, 2018 Decoder Machine Translation
Code Code Available 2Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation Sep 4, 2018 Machine Translation Text Generation
Code Code Available 2The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation Apr 26, 2018 Machine Translation Translation
Code Code Available 2Simple Recurrent Units for Highly Parallelizable Recurrence Sep 8, 2017 General Classification Machine Translation
Code Code Available 2Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Jan 23, 2017 Computational Efficiency GPU
Code Code Available 2TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration Jun 10, 2025 Machine Translation Translation
Code Code Available 1Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs May 25, 2025 Machine Translation Mathematical Reasoning
Code Code Available 1MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS Apr 25, 2025 Clinical Language Translation Machine Translation
Code Code Available 1Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling Apr 18, 2025 Machine Translation Translation
Code Code Available 1Sun-Shine: A Large Language Model for Tibetan Culture Mar 24, 2025 Language Modeling Language Modelling
Code Code Available 1Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions Mar 20, 2025 2D Object Detection Distributed Computing
Code Code Available 1Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation Mar 9, 2025 Decoder Machine Translation
Code Code Available 1Automatic Input Rewriting Improves Translation with Large Language Models Feb 23, 2025 Machine Translation Text Simplification
Code Code Available 1Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs Feb 20, 2025 Cross-Lingual Transfer Machine Translation
Code Code Available 1Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu Feb 17, 2025 Data Augmentation In-Context Learning
Code Code Available 1TUMLU: A Unified and Native Language Understanding Benchmark for Turkic Languages Feb 16, 2025 Machine Translation MMLU
Code Code Available 1How to Select Datapoints for Efficient Human Evaluation of NLG Models? Jan 30, 2025 HumanEval Machine Translation
Code Code Available 1Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages Jan 10, 2025 Machine Translation
Code Code Available 1Merging Feed-Forward Sublayers for Compressed Transformers Jan 10, 2025 image-classification Image Classification
Code Code Available 1Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation Jan 6, 2025 Machine Translation Translation
Code Code Available 1M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation Dec 28, 2024 Machine Translation
Code Code Available 1Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models Dec 24, 2024 Machine Translation Molecular Property Prediction
Code Code Available 1MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation Dec 16, 2024 All Benchmarking
Code Code Available 1Retrieval-Augmented Machine Translation with Unstructured Knowledge Dec 5, 2024 Knowledge Graphs Machine Translation
Code Code Available 1Context-Informed Machine Translation of Manga using Multimodal Large Language Models Nov 4, 2024 Machine Translation Translation
Code Code Available 1MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference Calibration Nov 1, 2024 Bayesian Optimization Gaussian Processes
Code Code Available 1Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation Oct 28, 2024 Document Level Machine Translation Machine Translation
Code Code Available 1How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs Oct 24, 2024 2k Machine Translation
Code Code Available 1MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators Sep 22, 2024 Automatic Post-Editing Machine Translation
Code Code Available 1