| SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections | Feb 2, 2023 | Scene Generation | CodeCode Available | 2 |
| EdgeYOLO: An Edge-Real-Time Object Detector | Feb 15, 2023 | Data AugmentationEdge-computing | CodeCode Available | 2 |
| DIRE for Diffusion-Generated Image Detection | Mar 16, 2023 | | CodeCode Available | 2 |
| A Dynamic Multi-Scale Voxel Flow Network for Video Prediction | Mar 17, 2023 | Video Prediction | CodeCode Available | 2 |
| Leapfrog Diffusion Model for Stochastic Trajectory Prediction | Mar 20, 2023 | Denoisingmodel | CodeCode Available | 2 |
| Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection | Mar 21, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment | May 20, 2024 | Contrastive LearningDomain Adaptation | CodeCode Available | 2 |
| Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language Models | May 19, 2023 | BenchmarkingDiversity | CodeCode Available | 2 |
| TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series | May 19, 2023 | DiversitySynthetic Data Generation | CodeCode Available | 2 |
| MAGE: Machine-generated Text Detection in the Wild | May 22, 2023 | Binary text classificationFace Swapping | CodeCode Available | 2 |
| Efficient Multi-Scale Attention Module with Cross-Spatial Learning | May 23, 2023 | Dimensionality Reductionimage-classification | CodeCode Available | 2 |
| Dink-Net: Neural Clustering on Large Graphs | May 28, 2023 | ClusteringGraph Clustering | CodeCode Available | 2 |
| LibFewShot: A Comprehensive Library for Few-shot Learning | Sep 10, 2021 | Data AugmentationFew-Shot Image Classification | CodeCode Available | 2 |
| From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought | Jun 22, 2023 | Bayesian InferenceProbabilistic Programming | CodeCode Available | 2 |
| MedLSAM: Localize and Segment Anything Model for 3D CT Images | Jun 26, 2023 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| BiRP: Learning Robot Generalized Bimanual Coordination using Relative Parameterization Method on Human Demonstration | Jul 12, 2023 | Data Augmentation | CodeCode Available | 2 |
| Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting | Sep 13, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud | Sep 18, 2023 | Motion EstimationMotion Segmentation | CodeCode Available | 2 |
| Breaking of brightness consistency in optical flow with a lightweight CNN network | Oct 24, 2023 | CPUOptical Flow Estimation | CodeCode Available | 2 |
| Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training | Nov 15, 2023 | Passage RetrievalPosition | CodeCode Available | 2 |
| Acceleration Algorithms in GNNs: A Survey | May 7, 2024 | Graph LearningSurvey | CodeCode Available | 2 |
| War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars | Nov 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video | Nov 30, 2023 | 3D ReconstructionObject | CodeCode Available | 2 |
| MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing | Jun 16, 2023 | Image Editing | CodeCode Available | 2 |
| 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation | Dec 1, 2023 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 2 |
| Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting | Dec 1, 2023 | Time SeriesTraffic Prediction | CodeCode Available | 2 |
| LLMLight: Large Language Models as Traffic Signal Control Agents | Dec 26, 2023 | Decision MakingManagement | CodeCode Available | 2 |
| SVGDreamer: Text Guided SVG Generation with Diffusion Model | Dec 27, 2023 | DiversityVector Graphics | CodeCode Available | 2 |
| ODTrack: Online Dense Temporal Token Learning for Visual Tracking | Jan 3, 2024 | Semi-Supervised Video Object SegmentationVideo Object Tracking | CodeCode Available | 2 |
| SCITUNE: Aligning Large Language Models with Scientific Multimodal Instructions | Jul 3, 2023 | | CodeCode Available | 2 |
| Credence: Augmenting Datacenter Switch Buffer Sharing with ML Predictions | Jan 5, 2024 | | CodeCode Available | 2 |
| SELFIES and the future of molecular string representations | Mar 31, 2022 | valid | CodeCode Available | 2 |
| CascadedGaze: Efficiency in Global Context Extraction for Image Restoration | Jan 26, 2024 | DeblurringDecoder | CodeCode Available | 2 |
| MouSi: Poly-Visual-Expert Vision-Language Models | Jan 30, 2024 | Image SegmentationImage-text matching | CodeCode Available | 2 |
| A Survey on Domain Generalization for Medical Image Analysis | Feb 7, 2024 | Domain GeneralizationMedical Image Analysis | CodeCode Available | 2 |
| Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers | Feb 8, 2024 | Video Compression | CodeCode Available | 2 |
| Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models | Feb 8, 2024 | | CodeCode Available | 2 |
| Neural SPH: Improved Neural Modeling of Lagrangian Fluid Dynamics | Feb 9, 2024 | | CodeCode Available | 2 |
| Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs | Feb 16, 2024 | Quantization | CodeCode Available | 2 |
| Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding | Feb 21, 2024 | Text Generation | CodeCode Available | 2 |
| Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap | Feb 29, 2024 | Math | CodeCode Available | 2 |
| DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Feb 29, 2024 | DenoisingGraph Neural Network | CodeCode Available | 2 |
| KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques | Mar 9, 2024 | Knowledge GraphsLong Form Question Answering | CodeCode Available | 2 |
| Scalable Spatiotemporal Prediction with Bayesian Neural Fields | Mar 12, 2024 | Bayesian InferenceDemand Forecasting | CodeCode Available | 2 |
| BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics | Mar 15, 2024 | Audio ClassificationClassification | CodeCode Available | 2 |
| View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network | Mar 21, 2024 | Person Re-Identification | CodeCode Available | 2 |
| Volumetric Environment Representation for Vision-Language Navigation | Mar 21, 2024 | 3D geometryMulti-Task Learning | CodeCode Available | 2 |
| CoverUp: Effective High Coverage Test Generation for Python | Mar 24, 2024 | software testing | CodeCode Available | 2 |
| MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion | Apr 12, 2024 | Image ReconstructionMamba | CodeCode Available | 2 |
| FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining | Apr 15, 2024 | MambaRain Removal | CodeCode Available | 2 |