| To Spike or Not To Spike: A Digital Hardware Perspective on Deep Learning Acceleration | Jun 27, 2023 | | CodeCode Available | 2 |
| CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \10,000 Budget; An Extra \4,000 Unlocks 81.8% Accuracy | Jun 27, 2023 | | CodeCode Available | 2 |
| CellViT: Vision Transformers for Precise Cell Segmentation and Classification | Jun 27, 2023 | Cell DetectionCell Segmentation | CodeCode Available | 2 |
| PMaF: Deep Declarative Layers for Principal Matrix Features | Jun 26, 2023 | | CodeCode Available | 2 |
| DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genome | Jun 26, 2023 | Computational EfficiencyCore Promoter Detection | CodeCode Available | 2 |
| DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models | Jun 26, 2023 | | CodeCode Available | 2 |
| Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | Jun 26, 2023 | HallucinationVisual Question Answering | CodeCode Available | 2 |
| RVT: Robotic View Transformer for 3D Object Manipulation | Jun 26, 2023 | ObjectRobot Manipulation | CodeCode Available | 2 |
| MedLSAM: Localize and Segment Anything Model for 3D CT Images | Jun 26, 2023 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback | Jun 26, 2023 | BenchmarkingCode Generation | CodeCode Available | 2 |
| H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models | Jun 24, 2023 | GPU | CodeCode Available | 2 |
| ToolQA: A Dataset for LLM Question Answering with External Tools | Jun 23, 2023 | HallucinationQuestion Answering | CodeCode Available | 2 |
| OpenMask3D: Open-Vocabulary 3D Instance Segmentation | Jun 23, 2023 | 3D Instance Segmentation3D Open-Vocabulary Instance Segmentation | CodeCode Available | 2 |
| MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models | Jun 23, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| 3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation | Jun 23, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Maintaining Plasticity in Deep Continual Learning | Jun 23, 2023 | Binary ClassificationContinual Learning | CodeCode Available | 2 |
| 3D Reconstruction of Spherical Images based on Incremental Structure from Motion | Jun 22, 2023 | 3D Reconstruction | CodeCode Available | 2 |
| From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought | Jun 22, 2023 | Bayesian InferenceProbabilistic Programming | CodeCode Available | 2 |
| Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model | Jun 22, 2023 | | CodeCode Available | 2 |
| SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph Transformer | Jun 22, 2023 | Object | CodeCode Available | 2 |
| PyKoopman: A Python Package for Data-Driven Approximation of the Koopman Operator | Jun 22, 2023 | | CodeCode Available | 2 |
| PromptIR: Prompting for All-in-One Blind Image Restoration | Jun 22, 2023 | AllBlind All-in-One Image Restoration | CodeCode Available | 2 |
| Visual Adversarial Examples Jailbreak Aligned Large Language Models | Jun 22, 2023 | | CodeCode Available | 2 |
| OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents | Jun 21, 2023 | MMR total | CodeCode Available | 2 |
| EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations | Jun 21, 2023 | Graph Property Prediction | CodeCode Available | 2 |
| SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning | Jun 21, 2023 | SentenceText Generation | CodeCode Available | 2 |
| SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT | Jun 20, 2023 | PredictionVideo Prediction | CodeCode Available | 2 |
| RoMe: Towards Large Scale Road Surface Reconstruction via Mesh Representation | Jun 20, 2023 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| PyRCA: A Library for Metric-based Root Cause Analysis | Jun 20, 2023 | Causal Discoverygraph construction | CodeCode Available | 2 |
| RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing | Jun 20, 2023 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 |
| Multi-Fidelity Active Learning with GFlowNets | Jun 20, 2023 | Active LearningBayesian Optimization | CodeCode Available | 2 |
| A Simple and Effective Pruning Approach for Large Language Models | Jun 20, 2023 | Network Pruning | CodeCode Available | 2 |
| LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching | Jun 20, 2023 | Brain Tumor ClassificationContrastive Learning | CodeCode Available | 2 |
| Maximum Entropy Heterogeneous-Agent Reinforcement Learning | Jun 19, 2023 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| SGFormer: Simplifying and Empowering Transformers for Large-Graph Representations | Jun 19, 2023 | Node Property PredictionPhilosophy | CodeCode Available | 2 |
| BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models | Jun 19, 2023 | Instruction FollowingText Generation | CodeCode Available | 2 |
| RemoteCLIP: A Vision Language Foundation Model for Remote Sensing | Jun 19, 2023 | ClassificationCross-Modal Retrieval | CodeCode Available | 2 |
| OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender Systems | Jun 19, 2023 | BenchmarkingDecoder | CodeCode Available | 2 |
| Guiding Language Models of Code with Global Context using Monitors | Jun 19, 2023 | Code CompletionCode Generation | CodeCode Available | 2 |
| QCNeXt: A Next-Generation Framework For Joint Multi-Agent Trajectory Prediction | Jun 18, 2023 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| MachMap: End-to-End Vectorized Solution for Compact HD-Map Construction | Jun 17, 2023 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| DCdetector: Dual Attention Contrastive Representation Learning for Time Series Anomaly Detection | Jun 17, 2023 | Anomaly DetectionContrastive Learning | CodeCode Available | 2 |
| MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing | Jun 16, 2023 | Image Editing | CodeCode Available | 2 |
| MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification | Jun 16, 2023 | Diabetic Retinopathy Gradingimage-classification | CodeCode Available | 2 |
| End-to-End Vectorized HD-map Construction with Piecewise Bezier Curve | Jun 16, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX | Jun 16, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 2 |
| Full Parameter Fine-tuning for Large Language Models with Limited Resources | Jun 16, 2023 | GPUparameter-efficient fine-tuning | CodeCode Available | 2 |
| Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects | Jun 16, 2023 | Anomaly DetectionSelf-Supervised Learning | CodeCode Available | 2 |
| The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge | Jun 16, 2023 | Autonomous Driving | CodeCode Available | 2 |
| RED^ FM: a Filtered and Multilingual Relation Extraction Dataset | Jun 16, 2023 | RelationRelation Extraction | CodeCode Available | 2 |