| Koopman neural operator as a mesh-free solver of non-linear partial differential equations | Jan 24, 2023 | Precipitation Forecasting | CodeCode Available | 2 |
| Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting | Jan 24, 2023 | Optical Flow EstimationVideo Inpainting | CodeCode Available | 2 |
| Revisiting Power Systems Time-domain Simulation Methods and Models | Jan 24, 2023 | | CodeCode Available | 2 |
| On the Expressive Power of Geometric Graph Neural Networks | Jan 23, 2023 | | CodeCode Available | 2 |
| HexPlane: A Fast Representation for Dynamic Scenes | Jan 23, 2023 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| Prediction-Powered Inference | Jan 23, 2023 | AstronomyPrediction | CodeCode Available | 2 |
| DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion | Jan 23, 2023 | Image-text ClassificationNode Classification | CodeCode Available | 2 |
| PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development | Jan 23, 2023 | Question AnsweringReading Comprehension | CodeCode Available | 2 |
| Adapting a Language Model While Preserving its General Knowledge | Jan 21, 2023 | Continual LearningGeneral Knowledge | CodeCode Available | 2 |
| Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine | Jan 20, 2023 | Machine TranslationSentence | CodeCode Available | 2 |
| A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles | Jan 20, 2023 | Navigate | CodeCode Available | 2 |
| Source-free Subject Adaptation for EEG-based Visual Recognition | Jan 20, 2023 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| PDFormer: Propagation Delay-Aware Dynamic Long-Range Transformer for Traffic Flow Prediction | Jan 19, 2023 | Computational EfficiencyGraph Neural Network | CodeCode Available | 2 |
| Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception | Jan 19, 2023 | Autonomous DrivingData Augmentation | CodeCode Available | 2 |
| Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture | Jan 19, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 2 |
| Multiview Compressive Coding for 3D Reconstruction | Jan 19, 2023 | 3D ReconstructionDecoder | CodeCode Available | 2 |
| Learning-Rate-Free Learning by D-Adaptation | Jan 18, 2023 | | CodeCode Available | 2 |
| Synthcity: facilitating innovative use cases of synthetic data in different data modalities | Jan 18, 2023 | FairnessIrregular Time Series | CodeCode Available | 2 |
| OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation | Jan 18, 2023 | Novel View SynthesisObject | CodeCode Available | 2 |
| Behind the Scenes: Density Fields for Single View Reconstruction | Jan 18, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 2 |
| COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM | Jan 17, 2023 | Pose Estimation | CodeCode Available | 2 |
| Heterogeneous Multi-Robot Reinforcement Learning | Jan 17, 2023 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| DPE: Disentanglement of Pose and Expression for General Video Portrait Editing | Jan 16, 2023 | DisentanglementFace Generation | CodeCode Available | 2 |
| Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems | Jan 16, 2023 | Simultaneous Localization and Mapping | CodeCode Available | 2 |
| Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models | Jan 16, 2023 | Audio ClassificationFew-Shot Learning | CodeCode Available | 2 |
| T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations | Jan 15, 2023 | Motion GenerationMotion Synthesis | CodeCode Available | 2 |
| DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets | Jan 15, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Diffusion-based Generation, Optimization, and Planning in 3D Scenes | Jan 15, 2023 | DenoisingGrasp Generation | CodeCode Available | 2 |
| Discovery of 2D materials using Transformer Network based Generative Design | Jan 14, 2023 | Formation EnergySelf-Learning | CodeCode Available | 2 |
| Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint) | Jan 14, 2023 | Benchmarking | CodeCode Available | 2 |
| Mephisto: A Framework for Portable, Reproducible, and Iterative Crowdsourcing | Jan 12, 2023 | | CodeCode Available | 2 |
| DEA-Net: Single image dehazing based on detail-enhanced convolution and content-guided attention | Jan 12, 2023 | Image Dehazing | CodeCode Available | 2 |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Jan 12, 2023 | Semantic SegmentationTime Series | CodeCode Available | 2 |
| Wildfire Smoke Detection with Computer Vision | Jan 12, 2023 | Object Detection | CodeCode Available | 2 |
| ImMesh: An Immediate LiDAR Localization and Meshing Framework | Jan 12, 2023 | CPUDimensionality Reduction | CodeCode Available | 2 |
| Tracr: Compiled Transformers as a Laboratory for Interpretability | Jan 12, 2023 | Decoder | CodeCode Available | 2 |
| AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers | Jan 11, 2023 | DenoisingInductive Bias | CodeCode Available | 2 |
| DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation | Jan 10, 2023 | DenoisingTalking Head Generation | CodeCode Available | 2 |
| Transformers as Policies for Variable Action Environments | Jan 9, 2023 | | CodeCode Available | 2 |
| Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review | Jan 9, 2023 | Medical Image Analysis | CodeCode Available | 2 |
| Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement | Jan 8, 2023 | DenoisingDisentanglement | CodeCode Available | 2 |
| CGI-Stereo: Accurate and Real-Time Stereo Matching via Context and Geometry Interaction | Jan 7, 2023 | Stereo Matching | CodeCode Available | 2 |
| Text2Poster: Laying out Stylized Texts on Retrieved Images | Jan 6, 2023 | Image RetrievalLayout Design | CodeCode Available | 2 |
| CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior | Jan 6, 2023 | 3D Face Animationregression | CodeCode Available | 2 |
| IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling | Jan 6, 2023 | Link PredictionOptical Character Recognition | CodeCode Available | 2 |
| Robust Dynamic Radiance Fields | Jan 5, 2023 | | CodeCode Available | 2 |
| HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling | Jan 5, 2023 | Novel View SynthesisVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution | Jan 5, 2023 | DecoderTime Series | CodeCode Available | 2 |
| TextDescriptives: A Python package for calculating a large variety of metrics from text | Jan 5, 2023 | | CodeCode Available | 2 |
| Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes | Jan 4, 2023 | | CodeCode Available | 2 |