| Diffusion Bridge Implicit Models | May 24, 2024 | DenoisingDiversity | CodeCode Available | 2 |
| MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering | May 20, 2024 | BenchmarkingQuestion Answering | CodeCode Available | 2 |
| Self-Consistent Recursive Diffusion Bridge for Medical Image Translation | May 10, 2024 | DenoisingScheduling | CodeCode Available | 2 |
| GenN2N: Generative NeRF2NeRF Translation | Apr 3, 2024 | ColorizationContrastive Learning | CodeCode Available | 2 |
| StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation | Mar 29, 2024 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Mar 18, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| Large Language Models are In-Context Molecule Learners | Mar 7, 2024 | Cross-Modal RetrievalIn-Context Learning | CodeCode Available | 2 |
| TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement | Feb 26, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| Centroid-Based Efficient Minimum Bayes Risk Decoding | Feb 17, 2024 | de-enTranslation | CodeCode Available | 2 |
| GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators | Feb 10, 2024 | Machine TranslationSpeech-to-Speech Translation | CodeCode Available | 2 |
| With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text Generation | Jan 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Language Models Can Learn Temporal Reasoning | Jan 12, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT | Oct 7, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 2 |
| Quantifying the Plausibility of Context Reliance in Neural Machine Translation | Oct 2, 2023 | Machine TranslationTranslation | CodeCode Available | 2 |
| A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models | Sep 20, 2023 | Language ModellingMachine Translation | CodeCode Available | 2 |
| SeamlessM4T: Massively Multilingual & Multimodal Machine Translation | Aug 22, 2023 | Automatic Speech RecognitionMachine Translation | CodeCode Available | 2 |
| SONAR: Sentence-Level Multimodal and Language-Agnostic Representations | Aug 22, 2023 | DecoderMachine Translation | CodeCode Available | 2 |
| BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models | Jun 19, 2023 | Instruction FollowingText Generation | CodeCode Available | 2 |
| QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction | Jun 18, 2023 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages | May 29, 2023 | Machine TranslationTranslation | CodeCode Available | 2 |
| IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages | May 25, 2023 | AllMachine Translation | CodeCode Available | 2 |
| Unpaired Image-to-Image Translation via Neural Schrödinger Bridge | May 24, 2023 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation | May 19, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video | May 1, 2023 | Face ReenactmentTranslation | CodeCode Available | 2 |
| Stabilizing Transformer Training by Preventing Attention Entropy Collapse | Mar 11, 2023 | Automatic Speech Recognitionimage-classification | CodeCode Available | 2 |
| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 |
| Inseq: An Interpretability Toolkit for Sequence Generation Models | Feb 27, 2023 | DecoderFeature Importance | CodeCode Available | 2 |
| Binarized Neural Machine Translation | Feb 9, 2023 | BinarizationMachine Translation | CodeCode Available | 2 |
| Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine | Jan 20, 2023 | Machine TranslationSentence | CodeCode Available | 2 |
| A Rotation-Translation-Decoupled Solution for Robust and Efficient Visual-Inertial Initialization | Jan 1, 2023 | Translation | CodeCode Available | 2 |
| BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Democratizing Neural Machine Translation with OPUS-MT | Dec 4, 2022 | Machine TranslationTranslation | CodeCode Available | 2 |
| Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | Nov 22, 2022 | Image GenerationImage-to-Image Translation | CodeCode Available | 2 |
| Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings | Oct 23, 2022 | Cross-Lingual NERCross-Lingual Transfer | CodeCode Available | 2 |
| RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map | Oct 12, 2022 | Translation | CodeCode Available | 2 |
| Diffusion-based Image Translation using Disentangled Style and Content Representation | Sep 30, 2022 | Style TransferTranslation | CodeCode Available | 2 |
| Unsupervised Medical Image Translation with Adversarial Diffusion Models | Jul 17, 2022 | DiversityImage Generation | CodeCode Available | 2 |
| EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations | Jul 14, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| No Language Left Behind: Scaling Human-Centered Machine Translation | Jul 11, 2022 | Machine TranslationMixture-of-Experts | CodeCode Available | 2 |
| DCT-Net: Domain-Calibrated Translation for Portrait Stylization | Jul 6, 2022 | Few-Shot LearningStyle Transfer | CodeCode Available | 2 |
| JGLUE: Japanese General Language Understanding Evaluation | Jun 1, 2022 | FLUENatural Language Understanding | CodeCode Available | 2 |
| Kernel Neural Optimal Transport | May 30, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| Pretraining is All You Need for Image-to-Image Translation | May 25, 2022 | AllImage-to-Image Translation | CodeCode Available | 2 |
| BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models | May 16, 2022 | Image GenerationImage-to-Image Translation | CodeCode Available | 2 |
| READ: Large-Scale Neural Scene Rendering for Autonomous Driving | May 11, 2022 | 3D Scene ReconstructionAutonomous Driving | CodeCode Available | 2 |
| BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix | Apr 25, 2022 | Breast Cancer DetectionBreast Cancer Histology Image Classification | CodeCode Available | 2 |
| Learning to generate line drawings that convey geometry and semantics | Mar 23, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| Dual Diffusion Implicit Bridges for Image-to-Image Translation | Mar 16, 2022 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours | Mar 2, 2022 | Protein Structure PredictionTranslation | CodeCode Available | 2 |