Vashantor: A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language Nov 18, 2023 Machine Translation Translation
Code Code Available 1SentAlign: Accurate and Scalable Sentence Alignment Nov 15, 2023 Machine Translation Sentence
Code Code Available 1Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding Nov 14, 2023 Machine Translation NMT
Code Code Available 1Non-autoregressive Machine Translation with Probabilistic Context-free Grammar Nov 14, 2023 Machine Translation Translation
Code Code Available 1Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture Transcripts Nov 7, 2023 Benchmarking Machine Translation
Code Code Available 1The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics Oct 30, 2023 Machine Translation Text Generation
Code Code Available 1CreoleVal: Multilingual Multitask Benchmarks for Creoles Oct 30, 2023 Machine Translation Reading Comprehension
Code Code Available 1Enhanced Simultaneous Machine Translation with Word-level Policies Oct 25, 2023 Machine Translation Translation
Code Code Available 1Non-autoregressive Streaming Transformer for Simultaneous Translation Oct 23, 2023 Decoder Machine Translation
Code Code Available 1Linguistically Motivated Sign Language Segmentation Oct 21, 2023 Machine Translation Optical Flow Estimation
Code Code Available 1On Bilingual Lexicon Induction with Large Language Models Oct 21, 2023 Bilingual Lexicon Induction Cross-Lingual Word Embeddings
Code Code Available 1CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages Oct 20, 2023 Diversity GPU
Code Code Available 1knn-seq: Efficient, Extensible kNN-MT Framework Oct 18, 2023 Machine Translation NMT
Code Code Available 1xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection Oct 16, 2023 Machine Translation Sentence
Code Code Available 1In-Context Explainers: Harnessing LLMs for Explaining Black Box Models Oct 9, 2023 Explainable artificial intelligence Explainable Artificial Intelligence (XAI)
Code Code Available 1CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation Oct 8, 2023 Code Translation Machine Translation
Code Code Available 1Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns Oct 3, 2023 Language Modeling Language Modelling
Code Code Available 1Enhancing Sharpness-Aware Optimization Through Variance Suppression Sep 27, 2023 Data Augmentation image-classification
Code Code Available 1SignBank+: Preparing a Multilingual Sign Language Dataset for Machine Translation Using Large Language Models Sep 20, 2023 Machine Translation Sign Language Translation
Code Code Available 1GECTurk: Grammatical Error Correction and Detection Dataset for Turkish Sep 20, 2023 Articles Decoder
Code Code Available 1SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects Sep 14, 2023 Cross-Lingual Transfer Language Modelling
Code Code Available 1Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding Sep 13, 2023 Machine Translation Translation
Code Code Available 1Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout Analysis Aug 29, 2023 Document AI Document Layout Analysis
Code Code Available 1CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation Aug 29, 2023 Image Captioning Machine Translation
Code Code Available 1Translate Meanings, Not Just Words: IdiomKB's Role in Optimizing Idiomatic Translation with Language Models Aug 26, 2023 Machine Translation Translation
Code Code Available 1Improving Translation Faithfulness of Large Language Models via Augmenting Instructions Aug 24, 2023 Instruction Following Machine Translation
Code Code Available 1SOTASTREAM: A Streaming Approach to Machine Translation Training Aug 14, 2023 Machine Translation Management
Code Code Available 1Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation Aug 6, 2023 Machine Translation Scene Text Editing
Code Code Available 1ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation Aug 4, 2023 Abstractive Text Summarization Language Modeling
Code Code Available 1Do Multilingual Language Models Think Better in English? Aug 2, 2023 Common Sense Reasoning Cross-Lingual Natural Language Inference
Code Code Available 1mCLIP: Multilingual CLIP via Cross-lingual Transfer Jul 10, 2023 Contrastive Learning Cross-Lingual Transfer
Code Code Available 1X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents Jun 30, 2023 Entity Alignment Machine Translation
Code Code Available 1Tokenization and the Noiseless Channel Jun 29, 2023 Machine Translation
Code Code Available 1VisText: A Benchmark for Semantically Rich Chart Captioning Jun 28, 2023 Machine Translation Text Generation
Code Code Available 1Training Transformers with 4-bit Integers Jun 21, 2023 image-classification Image Classification
Code Code Available 1GIO: Gradient Information Optimization for Training Dataset Selection Jun 20, 2023 Machine Translation Spelling Correction
Code Code Available 1Explicit Syntactic Guidance for Neural Text Generation Jun 20, 2023 Diversity Machine Translation
Code Code Available 1Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations Jun 14, 2023 image-classification Image Classification
Code Code Available 1INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation Jun 10, 2023 Machine Translation Translation
Code Code Available 1MCTS: A Multi-Reference Chinese Text Simplification Dataset Jun 5, 2023 Machine Translation Text Simplification
Code Code Available 1Binary and Ternary Natural Language Generation Jun 2, 2023 Machine Translation Quantization
Code Code Available 1BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation May 30, 2023 Machine Translation Sentence
Code Code Available 1A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets May 29, 2023 Bias Detection Code Generation
Code Code Available 1An Open-Source Gloss-Based Baseline for Spoken to Signed Language Translation May 28, 2023 Machine Translation Sentence
Code Code Available 1Exploring Better Text Image Translation with Multimodal Codebook May 27, 2023 Machine Translation Optical Character Recognition
Code Code Available 1BIG-C: a Multimodal Multi-Purpose Dataset for Bemba May 26, 2023 Machine Translation speech-recognition
Code Code Available 1Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation May 26, 2023 Domain Adaptation Machine Translation
Code Code Available 1Songs Across Borders: Singable and Controllable Neural Lyric Translation May 26, 2023 Machine Translation NMT
Code Code Available 1Towards Higher Pareto Frontier in Multilingual Machine Translation May 25, 2023 Knowledge Distillation Machine Translation
Code Code Available 1CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation May 24, 2023 Machine Translation Translation
Code Code Available 1