| Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation | Jul 19, 2023 | Talking Head GenerationVideo Generation | CodeCode Available | 2 |
| DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI | Jul 19, 2023 | Conversational RecommendationDiversity | CodeCode Available | 2 |
| FABRIC: Personalizing Diffusion Models with Iterative Feedback | Jul 19, 2023 | | CodeCode Available | 2 |
| DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering | Jul 19, 2023 | Camera CalibrationNovel View Synthesis | CodeCode Available | 2 |
| Android in the Wild: A Large-Scale Dataset for Android Device Control | Jul 19, 2023 | | CodeCode Available | 2 |
| Benchmarking Potential Based Rewards for Learning Humanoid Locomotion | Jul 19, 2023 | BenchmarkingReinforcement Learning (RL) | CodeCode Available | 2 |
| Using the IBM Analog In-Memory Hardware Acceleration Kit for Neural Network Training and Inference | Jul 18, 2023 | | CodeCode Available | 2 |
| A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future | Jul 18, 2023 | Knowledge Distillationobject-detection | CodeCode Available | 2 |
| Conformal prediction under ambiguous ground truth | Jul 18, 2023 | Conformal PredictionPrediction | CodeCode Available | 2 |
| Flow Matching in Latent Space | Jul 17, 2023 | Computational EfficiencyImage Generation | CodeCode Available | 2 |
| Dynamic Snake Convolution based on Topological Geometric Constraints for Tubular Structure Segmentation | Jul 17, 2023 | SegmentationSpecificity | CodeCode Available | 2 |
| DOT: A Distillation-Oriented Trainer | Jul 17, 2023 | Knowledge Distillation | CodeCode Available | 2 |
| BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs | Jul 17, 2023 | Instruction FollowingSentence | CodeCode Available | 2 |
| EGE-UNet: an Efficient Group Enhanced UNet for skin lesion segmentation | Jul 17, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Revisiting Scene Text Recognition: A Data Perspective | Jul 17, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| Scale-Aware Modulation Meet Transformer | Jul 17, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| Planting a SEED of Vision in Large Language Model | Jul 16, 2023 | Image GenerationImage to text | CodeCode Available | 2 |
| A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning | Jul 16, 2023 | Continual LearningFederated Learning | CodeCode Available | 2 |
| Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling | Jul 16, 2023 | DiagnosticLanguage Modelling | CodeCode Available | 2 |
| EasyTPP: Towards Open Benchmarking Temporal Point Processes | Jul 16, 2023 | BenchmarkingPoint Processes | CodeCode Available | 2 |
| Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph | Jul 15, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 2 |
| Drive Like a Human: Rethinking Autonomous Driving with Large Language Models | Jul 14, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| A Dynamic Points Removal Benchmark in Point Cloud Maps | Jul 14, 2023 | BenchmarkingDynamic Point Removal | CodeCode Available | 2 |
| Self-regulating Prompts: Foundational Model Adaptation without Forgetting | Jul 13, 2023 | Diversitymodel | CodeCode Available | 2 |
| Machine-learned molecular mechanics force field for the simulation of protein-ligand systems and beyond | Jul 13, 2023 | Drug DesignDrug Discovery | CodeCode Available | 2 |
| LimSim: A Long-term Interactive Multi-scenario Traffic Simulator | Jul 13, 2023 | Autonomous Driving | CodeCode Available | 2 |
| Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation | Jul 13, 2023 | RetrievalVideo Generation | CodeCode Available | 2 |
| Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data | Jul 13, 2023 | 2D Human Pose EstimationPose Estimation | CodeCode Available | 2 |
| Generating Benchmarks for Factuality Evaluation of Language Models | Jul 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models | Jul 12, 2023 | FormLanguage Modelling | CodeCode Available | 2 |
| T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation | Jul 12, 2023 | AttributeImage Generation | CodeCode Available | 2 |
| BiRP: Learning Robot Generalized Bimanual Coordination using Relative Parameterization Method on Human Demonstration | Jul 12, 2023 | Data Augmentation | CodeCode Available | 2 |
| Global birdsong embeddings enable superior transfer learning for bioacoustic classification | Jul 12, 2023 | Audio ClassificationDecision Making | CodeCode Available | 2 |
| balance -- a Python package for balancing biased data samples | Jul 12, 2023 | | CodeCode Available | 2 |
| An Open-Source Knowledge Graph Ecosystem for the Life Sciences | Jul 11, 2023 | Knowledge Graphs | CodeCode Available | 2 |
| Differentiable Forward Projector for X-ray Computed Tomography | Jul 11, 2023 | CT ReconstructionDeep Learning | CodeCode Available | 2 |
| PIGEON: Predicting Image Geolocations | Jul 11, 2023 | Photo geolocation estimation | CodeCode Available | 2 |
| AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning | Jul 10, 2023 | Image Animation | CodeCode Available | 2 |
| Recent Advancements in End-to-End Autonomous Driving using Deep Learning: A Survey | Jul 10, 2023 | Autonomous Driving | CodeCode Available | 2 |
| MiVOLO: Multi-input Transformer for Age and Gender Estimation | Jul 10, 2023 | Age And Gender ClassificationAge and Gender Estimation | CodeCode Available | 2 |
| AmadeusGPT: a natural language interface for interactive animal behavioral analysis | Jul 10, 2023 | Descriptive | CodeCode Available | 2 |
| InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval | Jul 10, 2023 | GPUInformation Retrieval | CodeCode Available | 2 |
| FreeDrag: Feature Dragging for Reliable Point-based Image Editing | Jul 10, 2023 | Point Tracking | CodeCode Available | 2 |
| RoCo: Dialectic Multi-Robot Collaboration with Large Language Models | Jul 10, 2023 | Trajectory Planning | CodeCode Available | 2 |
| VampNet: Music Generation via Masked Acoustic Token Modeling | Jul 10, 2023 | Music CompressionMusic Generation | CodeCode Available | 2 |
| Cross-modal Orthogonal High-rank Augmentation for RGB-Event Transformer-trackers | Jul 9, 2023 | Object Tracking | CodeCode Available | 2 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 |
| When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment | Jul 7, 2023 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Exploring the Potential of Large Language Models (LLMs) in Learning on Graphs | Jul 7, 2023 | General KnowledgeNode Classification | CodeCode Available | 2 |
| A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection | Jul 7, 2023 | Anomaly DetectionImputation | CodeCode Available | 2 |