| SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection | Apr 27, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| JaxPruner: A concise library for sparsity research | Apr 27, 2023 | | CodeCode Available | 2 |
| DataComp: In search of the next generation of multimodal datasets | Apr 27, 2023 | | CodeCode Available | 2 |
| Lightweight, Pre-trained Transformers for Remote Sensing Timeseries | Apr 27, 2023 | Crop ClassificationSelf-Supervised Learning | CodeCode Available | 2 |
| string2string: A Modern Python Library for String-to-String Algorithms | Apr 27, 2023 | | CodeCode Available | 2 |
| EasyPortrait -- Face Parsing and Portrait Segmentation Dataset | Apr 26, 2023 | DiversityDomain Generalization | CodeCode Available | 2 |
| OpenBox: A Python Toolkit for Generalized Black-box Optimization | Apr 26, 2023 | Experimental Design | CodeCode Available | 2 |
| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis | Apr 26, 2023 | Speech Synthesistext-to-speech | CodeCode Available | 2 |
| Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation | Apr 26, 2023 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| CompletionFormer: Depth Completion with Convolutions and Vision Transformers | Apr 25, 2023 | Depth CompletionDepth Estimation | CodeCode Available | 2 |
| A Strong and Reproducible Object Detector with Only Public Datasets | Apr 25, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| TensoIR: Tensorial Inverse Rendering | Apr 24, 2023 | Inverse RenderingNovel View Synthesis | CodeCode Available | 2 |
| Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis | Apr 24, 2023 | | CodeCode Available | 2 |
| Unlocking Context Constraints of LLMs: Enhancing Context Efficiency of LLMs with Self-Information-Based Content Filtering | Apr 24, 2023 | ArticlesQuestion Answering | CodeCode Available | 2 |
| Renate: A Library for Real-World Continual Learning | Apr 24, 2023 | Continual Learning | CodeCode Available | 2 |
| Towards Realistic Generative 3D Face Models | Apr 24, 2023 | 3D Face ReconstructionFace Model | CodeCode Available | 2 |
| Efficient Training of Deep Equilibrium Models | Apr 23, 2023 | | CodeCode Available | 2 |
| LLM+P: Empowering Large Language Models with Optimal Planning Proficiency | Apr 22, 2023 | Zero-shot Generalization | CodeCode Available | 2 |
| PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel | Apr 21, 2023 | | CodeCode Available | 2 |
| DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction | Apr 21, 2023 | In-Context LearningText to SQL | CodeCode Available | 2 |
| SILVR: Guided Diffusion for Molecule Generation | Apr 21, 2023 | Drug Design | CodeCode Available | 2 |
| Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review | Apr 20, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs | Apr 20, 2023 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| GPT-NER: Named Entity Recognition via Large Language Models | Apr 20, 2023 | Hallucinationnamed-entity-recognition | CodeCode Available | 2 |
| Collaborative Diffusion for Multi-Modal Face Generation and Editing | Apr 20, 2023 | DenoisingFace Generation | CodeCode Available | 2 |
| OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping | Apr 20, 2023 | 3D Lane DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Scaling the leading accuracy of deep equivariant models to biomolecular simulations of realistic size | Apr 20, 2023 | GPU | CodeCode Available | 2 |
| Architectures of Topological Deep Learning: A Survey of Message-Passing Topological Neural Networks | Apr 20, 2023 | | CodeCode Available | 2 |
| Omni Aggregation Networks for Lightweight Image Super-Resolution | Apr 20, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation | Apr 19, 2023 | AllVideo Frame Interpolation | CodeCode Available | 2 |
| Transformer-Based Visual Segmentation: A Survey | Apr 19, 2023 | Autonomous DrivingPoint Cloud Segmentation | CodeCode Available | 2 |
| GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information | Apr 19, 2023 | In-Context LearningRetrieval | CodeCode Available | 2 |
| Progressive-Hint Prompting Improves Reasoning in Large Language Models | Apr 19, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-Assignment | Apr 19, 2023 | Multiple People Tracking | CodeCode Available | 2 |
| Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents | Apr 19, 2023 | Information RetrievalPassage Ranking | CodeCode Available | 2 |
| Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes | Apr 19, 2023 | | CodeCode Available | 2 |
| Heterogeneous-Agent Reinforcement Learning | Apr 19, 2023 | LEMMAMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra | Apr 19, 2023 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| Scaling Transformer to 1M tokens and beyond with RMT | Apr 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| VMA: Divide-and-Conquer Vectorized Map Annotation System for Large-Scale Driving Scene | Apr 19, 2023 | Autonomous Driving | CodeCode Available | 2 |
| MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning | Apr 18, 2023 | Emotion RecognitionMulti-Label Learning | CodeCode Available | 2 |
| NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers | Apr 18, 2023 | In-Context LearningSpeech Synthesis | CodeCode Available | 2 |
| Deep Unrestricted Document Image Rectification | Apr 18, 2023 | Local Distortion | CodeCode Available | 2 |
| Interactive and Explainable Region-guided Radiology Report Generation | Apr 17, 2023 | Medical Report Generation | CodeCode Available | 2 |
| Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models | Apr 17, 2023 | Bokeh Effect RenderingDenoising | CodeCode Available | 2 |
| Text2Performer: Text-Driven Human Video Generation | Apr 17, 2023 | Video Generation | CodeCode Available | 2 |
| LongForm: Effective Instruction Tuning with Reverse Instructions | Apr 17, 2023 | Long Form Question AnsweringNews Generation | CodeCode Available | 2 |
| Learning to Compress Prompts with Gist Tokens | Apr 17, 2023 | Decoder | CodeCode Available | 2 |
| MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing | Apr 17, 2023 | Image GenerationText-based Image Editing | CodeCode Available | 2 |