| Automatic music mixing with deep learning and out-of-domain data | Aug 24, 2022 | | CodeCode Available | 1 |
| SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation | Jun 16, 2022 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| Densely Connected Attention Propagation for Reading Comprehension | Nov 10, 2018 | AllOpen-Domain Question Answering | CodeCode Available | 1 |
| Neural Data-Dependent Transform for Learned Image Compression | Mar 9, 2022 | DecoderImage Compression | CodeCode Available | 1 |
| VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model | Feb 26, 2025 | Reinforcement Learning (RL) | CodeCode Available | 1 |
| A Structured Self-attentive Sentence Embedding | Mar 9, 2017 | Author ProfilingGeneral Classification | CodeCode Available | 1 |
| Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection | Jan 1, 2023 | Anomaly DetectionKnowledge Distillation | CodeCode Available | 1 |
| SEEDS: Superpixels Extracted via Energy-Driven Sampling | Sep 16, 2013 | CPUSuperpixels | CodeCode Available | 1 |
| D^3: Scaling Up Deepfake Detection by Learning from Discrepancy | Apr 6, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 1 |
| Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation | Aug 17, 2016 | Caption GenerationDecoder | CodeCode Available | 1 |
| Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings | Oct 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FlexIT: Towards Flexible Semantic Image Translation | Mar 9, 2022 | Image GenerationTranslation | CodeCode Available | 1 |
| VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation | May 8, 2020 | Graph Neural NetworkSelf-Driving Cars | CodeCode Available | 1 |
| RoboCLIP: One Demonstration is Enough to Learn Robot Policies | Oct 11, 2023 | Imitation Learningreinforcement-learning | CodeCode Available | 1 |
| MMFace4D: A Large-Scale Multi-Modal 4D Face Dataset for Audio-Driven 3D Face Animation | Mar 17, 2023 | 3D Face Animation | CodeCode Available | 1 |
| Music Mood Detection Based On Audio And Lyrics With Deep Neural Net | Sep 19, 2018 | Multimodal Emotion RecognitionMusic Emotion Recognition | CodeCode Available | 1 |
| Paying Attention to Descriptions Generated by Image Captioning Models | Apr 24, 2017 | Image Captioning | CodeCode Available | 1 |
| Extending Logic Explained Networks to Text Classification | Nov 4, 2022 | ClassificationSensitivity | CodeCode Available | 1 |
| DocMAE: Document Image Rectification via Self-supervised Representation Learning | Apr 20, 2023 | Representation LearningSelf-Supervised Learning | CodeCode Available | 1 |
| ThirdEye: Triplet Based Iris Recognition without Normalization | Jul 13, 2019 | Iris RecognitionTriplet | CodeCode Available | 1 |
| Variational Distillation for Multi-View Learning | Jun 20, 2022 | MULTI-VIEW LEARNINGRepresentation Learning | CodeCode Available | 1 |
| Deep Image Homography Estimation | Jun 13, 2016 | Homography Estimation | CodeCode Available | 1 |
| Temporally Coherent Video Harmonization Using Adversarial Networks | Sep 5, 2018 | Video Harmonization | CodeCode Available | 1 |
| Reinforcement Recommendation Reasoning through Knowledge Graphs for Explanation Path Quality | Sep 11, 2022 | DiversityExplainable Recommendation | CodeCode Available | 1 |
| Pareto Dominance Archive and Coordinated Selection Strategy-Based Many-Objective Optimizer for Protein Structure Prediction | Feb 22, 2023 | Evolutionary AlgorithmsProtein Folding | CodeCode Available | 1 |
| Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations | Feb 14, 2022 | Click-Through Rate PredictionRecommendation Systems | CodeCode Available | 1 |
| Multi Visual Modality Fall Detection Dataset | Jun 25, 2022 | Anomaly Detection | CodeCode Available | 1 |
| AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements | Feb 10, 2025 | Sentence | CodeCode Available | 1 |
| Causality Compensated Attention for Contextual Biased Visual Recognition | Feb 25, 2023 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | CodeCode Available | 1 |
| Learning to Group Auxiliary Datasets for Molecule | Jul 8, 2023 | | CodeCode Available | 1 |
| Safety-Critical Control with Bounded Inputs via Reduced Order Models | Mar 6, 2023 | | CodeCode Available | 1 |
| On the Transferability of Large-Scale Self-Supervision to Few-Shot Audio Classification | Feb 2, 2024 | Audio ClassificationFew-Shot Audio Classification | CodeCode Available | 1 |
| SLICER: Learning universal audio representations using low-resource self-supervised pre-training | Nov 2, 2022 | Audio ClassificationClustering | CodeCode Available | 1 |
| Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition | Jun 29, 2022 | speech-recognitionSpeech Recognition | CodeCode Available | 1 |
| Vision Transformer for NeRF-Based View Synthesis from a Single Input Image | Jul 12, 2022 | NeRFNovel View Synthesis | CodeCode Available | 1 |
| The Change You Want to See | Sep 28, 2022 | Change DetectionSemantic Segmentation | CodeCode Available | 1 |
| Encryption-Friendly LLM Architecture | Oct 3, 2024 | Privacy Preserving | CodeCode Available | 1 |
| A Comparative Study of Self-supervised Speech Representation Based Voice Conversion | Jul 10, 2022 | Voice Conversion | CodeCode Available | 1 |
| Lane2Seq: Towards Unified Lane Detection via Sequence Generation | Feb 27, 2024 | DecoderLane Detection | CodeCode Available | 1 |
| Object Detection with Deep Reinforcement Learning | Aug 9, 2022 | Active Object LocalizationDeep Reinforcement Learning | CodeCode Available | 1 |
| DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning | Oct 11, 2022 | Hierarchical Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Fixed Effects and the Generalized Mundlak Estimator | Jul 5, 2018 | regression | CodeCode Available | 1 |
| Detecting Language Model Attacks with Perplexity | Aug 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| NDD20: A large-scale few-shot dolphin dataset for coarse and fine-grained categorisation | May 27, 2020 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Renderable Neural Radiance Map for Visual Navigation | Mar 1, 2023 | DescriptiveVisual Localization | CodeCode Available | 1 |
| Latent Graph Representations for Critical View of Safety Assessment | Dec 8, 2022 | AnatomyGraph Neural Network | CodeCode Available | 1 |
| Graph-Based Stock Recommendation by Time-Aware Relational Attention Network | Feb 1, 2022 | RelationStock Prediction | CodeCode Available | 1 |
| Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines | Nov 25, 2024 | multimodal generationRAG | CodeCode Available | 1 |
| Vector Quantized Bayesian Neural Network Inference for Data Streams | Jul 12, 2019 | Semantic Segmentation | CodeCode Available | 1 |
| PROSE-FD: A Multimodal PDE Foundation Model for Learning Multiple Operators for Forecasting Fluid Dynamics | Sep 15, 2024 | Operator learningPrediction | CodeCode Available | 1 |