| Decentralization and Acceleration Enables Large-Scale Bundle Adjustment | May 11, 2023 | | CodeCode Available | 2 |
| CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model | May 11, 2023 | DenoisingGPU | CodeCode Available | 2 |
| Active Retrieval Augmented Generation | May 11, 2023 | RetrievalRetrieval-augmented Generation | CodeCode Available | 2 |
| Autonomous GIS: the next-generation AI-powered GIS | May 10, 2023 | Code GenerationInformation Retrieval | CodeCode Available | 2 |
| Low-Light Image Enhancement via Structure Modeling and Guidance | May 10, 2023 | Edge DetectionImage Enhancement | CodeCode Available | 2 |
| Think Twice before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | May 10, 2023 | Autonomous DrivingBench2Drive | CodeCode Available | 2 |
| Reconstructing Animatable Categories from Videos | May 10, 2023 | 3D Shape Reconstruction from VideosDynamic Reconstruction | CodeCode Available | 2 |
| HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion | May 10, 2023 | Motion SynthesisNovel View Synthesis | CodeCode Available | 2 |
| DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation | May 10, 2023 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| Towards Building the Federated GPT: Federated Instruction Tuning | May 9, 2023 | Federated Learning | CodeCode Available | 2 |
| Graph Neural Network-based surrogate model for granular flows | May 9, 2023 | Graph Neural Network | CodeCode Available | 2 |
| PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | May 9, 2023 | NeRFSurface Reconstruction | CodeCode Available | 2 |
| FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance | May 9, 2023 | | CodeCode Available | 2 |
| TidyBot: Personalized Robot Assistance with Large Language Models | May 9, 2023 | | CodeCode Available | 2 |
| Recommender Systems with Generative Retrieval | May 8, 2023 | Recommendation SystemsRetrieval | CodeCode Available | 2 |
| Video Object Segmentation in Panoptic Wild Scenes | May 8, 2023 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds | May 8, 2023 | 2D Object Detection3D Object Detection | CodeCode Available | 2 |
| RelPose++: Recovering 6D Poses from Sparse-view Observations | May 8, 2023 | 3D ReconstructionPose Estimation | CodeCode Available | 2 |
| Exploring Human-Like Translation Strategy with Large Language Models | May 6, 2023 | HallucinationMachine Translation | CodeCode Available | 2 |
| DocDiff: Document Enhancement via Residual Diffusion Models | May 6, 2023 | DeblurringDenoising | CodeCode Available | 2 |
| ZipIt! Merging Models from Different Tasks without Training | May 4, 2023 | | CodeCode Available | 2 |
| OctFormer: Octree-based Transformers for 3D Point Clouds | May 4, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| SuperNOVA: Design Strategies and Opportunities for Interactive Visualization in Computational Notebooks | May 4, 2023 | | CodeCode Available | 2 |
| Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion | May 4, 2023 | Image Generation | CodeCode Available | 2 |
| RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | May 4, 2023 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 |
| DynamicStereo: Consistent Dynamic Depth from Stereo Videos | May 3, 2023 | | CodeCode Available | 2 |
| SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model | May 3, 2023 | Instance SegmentationObject | CodeCode Available | 2 |
| Bicubic++: Slim, Slimmer, Slimmest -- Designing an Industry-Grade Super-Resolution Network | May 3, 2023 | 4kImage Super-Resolution | CodeCode Available | 2 |
| Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes | May 3, 2023 | | CodeCode Available | 2 |
| pyPESTO: A modular and scalable tool for parameter estimation for dynamic models | May 2, 2023 | parameter estimationUncertainty Quantification | CodeCode Available | 2 |
| On Uni-Modal Feature Learning in Supervised Multi-Modal Learning | May 2, 2023 | | CodeCode Available | 2 |
| Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation | May 2, 2023 | Image GenerationPreference Mapping | CodeCode Available | 2 |
| VPGTrans: Transfer Visual Prompt Generator across LLMs | May 2, 2023 | GPUTransfer Learning | CodeCode Available | 2 |
| Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl | May 2, 2023 | Interpretable Machine Learningregression | CodeCode Available | 2 |
| Geometric Latent Diffusion Models for 3D Molecule Generation | May 2, 2023 | 3D Molecule GenerationUnconditional Molecule Generation | CodeCode Available | 2 |
| Huatuo-26M, a Large-scale Chinese Medical QA Dataset | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis | May 2, 2023 | Moment RetrievalMotion Generation | CodeCode Available | 2 |
| TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding | May 1, 2023 | 3D Object DetectionMonocular Depth Estimation | CodeCode Available | 2 |
| Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation | May 1, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video | May 1, 2023 | Face ReenactmentTranslation | CodeCode Available | 2 |
| In-Context Learning Unlocked for Diffusion Models | May 1, 2023 | In-Context Learningtext-guided-image-editing | CodeCode Available | 2 |
| SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support | Apr 30, 2023 | Chatbot | CodeCode Available | 2 |
| TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with Recommendation | Apr 30, 2023 | Domain GeneralizationIn-Context Learning | CodeCode Available | 2 |
| Enhancing Video Super-Resolution via Implicit Resampling-based Alignment | Apr 29, 2023 | Super-ResolutionVideo Super-Resolution | CodeCode Available | 2 |
| NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation Learning | Apr 28, 2023 | Graph Representation LearningKnowledge Graphs | CodeCode Available | 2 |
| Causal Reasoning and Large Language Models: Opening a New Frontier for Causality | Apr 28, 2023 | Causal DiscoveryCommon Sense Reasoning | CodeCode Available | 2 |
| LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions | Apr 27, 2023 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 2 |
| Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM | Apr 27, 2023 | Surface Reconstruction | CodeCode Available | 2 |
| Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving | Apr 27, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| PMC-LLaMA: Towards Building Open-source Language Models for Medicine | Apr 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |