| DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation | Oct 6, 2022 | Object | CodeCode Available | 2 |
| VIMA: General Robot Manipulation with Multimodal Prompts | Oct 6, 2022 | Imitation LearningLanguage Modelling | CodeCode Available | 2 |
| Leveraging Instance Features for Label Aggregation in Programmatic Weak Supervision | Oct 6, 2022 | Variational Inference | CodeCode Available | 2 |
| Language Models are Multilingual Chain-of-Thought Reasoners | Oct 6, 2022 | GSM8KMath | CodeCode Available | 2 |
| Binding Language Models in Symbolic Languages | Oct 6, 2022 | Language ModellingSemantic Parsing | CodeCode Available | 2 |
| ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs | Oct 6, 2022 | GPUVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| Real-World Robot Learning with Masked Visual Pre-training | Oct 6, 2022 | | CodeCode Available | 2 |
| Adaptive Ranking-based Sample Selection for Weakly Supervised Class-imbalanced Text Classification | Oct 6, 2022 | text-classificationText Classification | CodeCode Available | 2 |
| Mask3D: Mask Transformer for 3D Semantic Instance Segmentation | Oct 6, 2022 | 3D Instance Segmentation3D Semantic Instance Segmentation | CodeCode Available | 2 |
| Phenaki: Variable Length Video Generation From Open Domain Textual Description | Oct 5, 2022 | DecoderVideo Generation | CodeCode Available | 2 |
| Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection | Oct 5, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Ask Me Anything: A simple strategy for prompting language models | Oct 5, 2022 | Coreference ResolutionNatural Language Inference | CodeCode Available | 2 |
| Temporally Consistent Transformers for Video Generation | Oct 5, 2022 | MinecraftVideo Generation | CodeCode Available | 2 |
| DigiFace-1M: 1 Million Digital Face Images for Face Recognition | Oct 5, 2022 | AttributeFace Recognition | CodeCode Available | 2 |
| Centralized Feature Pyramid for Object Detection | Oct 5, 2022 | Objectobject-detection | CodeCode Available | 2 |
| CW-ERM: Improving Autonomous Driving Planning with Closed-loop Weighted Empirical Risk Minimization | Oct 5, 2022 | Autonomous DrivingImitation Learning | CodeCode Available | 2 |
| SHINE-Mapping: Large-Scale 3D Mapping Using Sparse Hierarchical Implicit Neural Representations | Oct 5, 2022 | 3D ReconstructionContinual Learning | CodeCode Available | 2 |
| GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models | Oct 5, 2022 | Out-of-Distribution DetectionSegmentation | CodeCode Available | 2 |
| One Transformer Can Understand Both 2D & 3D Molecular Data | Oct 4, 2022 | Graph Regressionmolecular representation | CodeCode Available | 2 |
| VICRegL: Self-Supervised Learning of Local Visual Features | Oct 4, 2022 | SegmentationSelf-Supervised Learning | CodeCode Available | 2 |
| Capturing and Animation of Body and Clothing from Monocular Video | Oct 4, 2022 | Virtual Try-on | CodeCode Available | 2 |
| Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings | Oct 4, 2022 | Gesture GenerationRhythm | CodeCode Available | 2 |
| When and why vision-language models behave like bags-of-words, and what to do about it? | Oct 4, 2022 | Contrastive LearningRetrieval | CodeCode Available | 2 |
| OpBoost: A Vertical Federated Tree Boosting Framework Based on Order-Preserving Desensitization | Oct 4, 2022 | Federated LearningPrivacy Preserving | CodeCode Available | 2 |
| The Long Tail of Context: Does it Exist and Matter? | Oct 3, 2022 | Recommendation Systems | CodeCode Available | 2 |
| rPPG-Toolbox: Deep Remote PPG Toolbox | Oct 3, 2022 | BenchmarkingData Augmentation | CodeCode Available | 2 |
| Omnigrok: Grokking Beyond Algorithmic Data | Oct 3, 2022 | AttributeRepresentation Learning | CodeCode Available | 2 |
| Contrastive Audio-Visual Masked Autoencoder | Oct 2, 2022 | Audio ClassificationAudio Tagging | CodeCode Available | 2 |
| CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers | Oct 1, 2022 | Grammatical Error CorrectionSentence | CodeCode Available | 2 |
| LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings | Oct 1, 2022 | Graph Representation LearningKnowledge Graph Completion | CodeCode Available | 2 |
| Multimodal Analogical Reasoning over Knowledge Graphs | Oct 1, 2022 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 2 |
| TabDDPM: Modelling Tabular Data with Diffusion Models | Sep 30, 2022 | Denoising | CodeCode Available | 2 |
| An efficient encoder-decoder architecture with top-down attention for speech separation | Sep 30, 2022 | CPU | CodeCode Available | 2 |
| MobileViTv3: Mobile-Friendly Vision Transformer with Simple and Effective Fusion of Local, Global and Input Features | Sep 30, 2022 | Image Classification | CodeCode Available | 2 |
| Diffusion-based Image Translation using Disentangled Style and Content Representation | Sep 30, 2022 | Style TransferTranslation | CodeCode Available | 2 |
| Equivariant Energy-Guided SDE for Inverse Molecular Design | Sep 30, 2022 | 3D Molecule GenerationDrug Discovery | CodeCode Available | 2 |
| Protein structure generation via folding diffusion | Sep 30, 2022 | DenoisingProtein Structure Prediction | CodeCode Available | 2 |
| State-specific protein-ligand complex structure prediction with a multi-scale deep generative model | Sep 30, 2022 | BenchmarkingBlind Docking | CodeCode Available | 2 |
| Towards Multi-spatiotemporal-scale Generalized PDE Modeling | Sep 30, 2022 | PDE Surrogate Modeling | CodeCode Available | 2 |
| Building Normalizing Flows with Stochastic Interpolants | Sep 30, 2022 | BenchmarkingDensity Estimation | CodeCode Available | 2 |
| Dilated Neighborhood Attention Transformer | Sep 29, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Spikformer: When Spiking Neural Network Meets Transformer | Sep 29, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Diffusion Posterior Sampling for General Noisy Inverse Problems | Sep 29, 2022 | DeblurringRetrieval | CodeCode Available | 2 |
| DiGress: Discrete Denoising diffusion for graph generation | Sep 29, 2022 | DenoisingEdge Classification | CodeCode Available | 2 |
| Recipro-CAM: Fast gradient-free visual explanations for convolutional neural networks | Sep 28, 2022 | Explainable Artificial Intelligence (XAI) | CodeCode Available | 2 |
| Multilingual Search with Subword TF-IDF | Sep 28, 2022 | Information RetrievalRetrieval | CodeCode Available | 2 |
| Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning | Sep 28, 2022 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Sep 27, 2022 | NeRFVisual Odometry | CodeCode Available | 2 |
| SuperYOLO: Super Resolution Assisted Object Detection in Multimodal Remote Sensing Imagery | Sep 27, 2022 | Object DetectionReal-Time Object Detection | CodeCode Available | 2 |