| HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors | Nov 17, 2022 | Activity PredictionActivity Recognition | CodeCode Available | 3 |
| Unifying Flow, Stereo and Depth Estimation | Nov 10, 2022 | Depth EstimationOptical Flow Estimation | CodeCode Available | 3 |
| OneFormer: One Transformer to Rule Universal Image Segmentation | Nov 10, 2022 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 3 |
| TAP-Vid: A Benchmark for Tracking Any Point in a Video | Nov 7, 2022 | Optical Flow EstimationPoint Tracking | CodeCode Available | 3 |
| Large Language Models Are Human-Level Prompt Engineers | Nov 3, 2022 | Few-Shot LearningIn-Context Learning | CodeCode Available | 3 |
| Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast | Nov 3, 2022 | | CodeCode Available | 3 |
| MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model | Nov 1, 2022 | Anomaly DetectionBrain Tumor Segmentation | CodeCode Available | 3 |
| Delay-penalized transducer for low-latency streaming ASR | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| QuEst: Graph Transformer for Quantum Circuit Reliability Estimation | Oct 30, 2022 | | CodeCode Available | 3 |
| What Language Model to Train if You Have One Million GPU Hours? | Oct 27, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| Deep Generative Models on 3D Representations: A Survey | Oct 27, 2022 | 3D-Aware Image Synthesis3D Shape Generation | CodeCode Available | 3 |
| DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models | Oct 26, 2022 | DiversityMisinformation | CodeCode Available | 3 |
| MetaFormer Baselines for Vision | Oct 24, 2022 | Domain GeneralizationImage Classification | CodeCode Available | 3 |
| NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields | Oct 24, 2022 | NeRF | CodeCode Available | 3 |
| An Improved RaftStereo Trained with A Mixed Dataset for the Robust Vision Challenge 2022 | Oct 23, 2022 | Stereo Matching | CodeCode Available | 3 |
| Deep Learning in Single-Cell Analysis | Oct 22, 2022 | Cell SegmentationDeep Learning | CodeCode Available | 3 |
| Scaling Instruction-Finetuned Language Models | Oct 20, 2022 | Coreference ResolutionCross-Lingual Question Answering | CodeCode Available | 3 |
| Token Merging: Your ViT But Faster | Oct 17, 2022 | Efficient ViTs | CodeCode Available | 3 |
| A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation Models | Oct 17, 2022 | CPUGPU | CodeCode Available | 3 |
| Vision-Language Pre-training: Basics, Recent Advances, and Future Trends | Oct 17, 2022 | Few-Shot LearningImage Captioning | CodeCode Available | 3 |
| Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative Representations | Oct 15, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 3 |
| PDEBENCH: An Extensive Benchmark for Scientific Machine Learning | Oct 13, 2022 | | CodeCode Available | 3 |
| CORL: Research-oriented Deep Offline Reinforcement Learning Library | Oct 13, 2022 | BenchmarkingD4RL | CodeCode Available | 3 |
| MotionBERT: A Unified Perspective on Learning Human Motion Representations | Oct 12, 2022 | 3D Human Pose Estimation3D Pose Estimation | CodeCode Available | 3 |
| MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library | Oct 11, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 3 |
| Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning | Oct 11, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 |
| Discovered Policy Optimisation | Oct 11, 2022 | IngenuityMeta-Learning | CodeCode Available | 3 |
| NerfAcc: A General NeRF Acceleration Toolbox | Oct 10, 2022 | NeRF | CodeCode Available | 3 |
| Bird-Eye Transformers for Text Generation Models | Oct 8, 2022 | AttributeInductive Bias | CodeCode Available | 3 |
| GNM: A General Navigation Model to Drive Any Robot | Oct 7, 2022 | | CodeCode Available | 3 |
| On Distillation of Guided Diffusion Models | Oct 6, 2022 | DenoisingImage Generation | CodeCode Available | 3 |
| Flow Matching for Generative Modeling | Oct 6, 2022 | Density EstimationImage Generation | CodeCode Available | 3 |
| DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking | Oct 4, 2022 | Blind DockingDrug Design | CodeCode Available | 3 |
| Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought | Oct 3, 2022 | Mathematical ReasoningQuestion Answering | CodeCode Available | 3 |
| Probabilistic Volumetric Fusion for Dense Monocular SLAM | Oct 3, 2022 | | CodeCode Available | 3 |
| Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals | Sep 29, 2022 | Text Generation | CodeCode Available | 3 |
| Rectified Flow: A Marginal Preserving Approach to Optimal Transport | Sep 29, 2022 | valid | CodeCode Available | 3 |
| All are Worth Words: A ViT Backbone for Diffusion Models | Sep 25, 2022 | AllConditional Image Generation | CodeCode Available | 3 |
| Revisiting Image Pyramid Structure for High Resolution Salient Object Detection | Sep 20, 2022 | Dichotomous Image SegmentationObject Detection | CodeCode Available | 3 |
| OpenFHE: Open-Source Fully Homomorphic Encryption Library | Sep 15, 2022 | | CodeCode Available | 3 |
| OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network | Sep 10, 2022 | Continual LearningObject | CodeCode Available | 3 |
| Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow | Sep 7, 2022 | Domain AdaptationImage Generation | CodeCode Available | 3 |
| PyTorch Image Quality: Metrics for Image Quality Assessment | Aug 31, 2022 | GPUImage Quality Assessment | CodeCode Available | 3 |
| SimpleRecon: 3D Reconstruction Without 3D Convolutions | Aug 31, 2022 | 3D ReconstructionDepth Estimation | CodeCode Available | 3 |
| MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction | Aug 30, 2022 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 3 |
| Towards Accurate Reconstruction of 3D Scene Shape from A Single Monocular Image | Aug 28, 2022 | Depth EstimationDepth Prediction | CodeCode Available | 3 |
| Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned | Aug 23, 2022 | Language ModellingRed Teaming | CodeCode Available | 3 |
| Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise | Aug 19, 2022 | Image RestorationVariational Inference | CodeCode Available | 3 |
| DPA-1: Pretraining of Attention-based Deep Potential Model for Molecular Simulation | Aug 17, 2022 | | CodeCode Available | 3 |
| ROLAND: Graph Learning Framework for Dynamic Graphs | Aug 15, 2022 | Graph LearningGraph Representation Learning | CodeCode Available | 3 |