| Vulnerability Detection with Code Language Models: How Far Are We? | Mar 27, 2024 | Vulnerability Detection | CodeCode Available | 3 |
| ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Mar 27, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 3 |
| A Python library for efficient computation of molecular fingerprints | Mar 27, 2024 | Drug DiscoveryMolecular Property Prediction | CodeCode Available | 3 |
| skscope: Fast Sparsity-Constrained Optimization in Python | Mar 27, 2024 | | CodeCode Available | 3 |
| Learning Inclusion Matching for Animation Paint Bucket Colorization | Mar 27, 2024 | Colorization | CodeCode Available | 3 |
| PhoWhisper: Automatic Speech Recognition for Vietnamese | Mar 27, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 3 |
| PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition | Mar 26, 2024 | Image ClassificationInstance Segmentation | CodeCode Available | 3 |
| SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic | Mar 26, 2024 | Motion Planning | CodeCode Available | 3 |
| PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models | Mar 26, 2024 | Code CompletionFew-Shot Learning | CodeCode Available | 3 |
| Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance | Mar 26, 2024 | DeblurringDenoising | CodeCode Available | 3 |
| Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography | Mar 26, 2024 | Anomaly DetectionLarge Language Model | CodeCode Available | 3 |
| The Unreasonable Ineffectiveness of the Deeper Layers | Mar 26, 2024 | GPUQuantization | CodeCode Available | 3 |
| AgentStudio: A Toolkit for Building General Virtual Agents | Mar 26, 2024 | Visual Grounding | CodeCode Available | 3 |
| Segment Any Medical Model Extended | Mar 26, 2024 | Data AugmentationImage Segmentation | CodeCode Available | 3 |
| AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation | Mar 26, 2024 | 3D Multi-Person Mesh RecoveryAll | CodeCode Available | 3 |
| PathoTune: Adapting Visual Foundation Model to Pathological Specialists | Mar 25, 2024 | model | CodeCode Available | 3 |
| Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning | Mar 25, 2024 | Visual Question Answering (VQA) | CodeCode Available | 3 |
| Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects | Mar 25, 2024 | Action RecognitionMotion Generation | CodeCode Available | 3 |
| FlashFace: Human Image Personalization with High-fidelity Identity Preservation | Mar 25, 2024 | Face SwappingImage Generation | CodeCode Available | 3 |
| Producing and Leveraging Online Map Uncertainty in Trajectory Prediction | Mar 25, 2024 | Autonomous DrivingPrediction | CodeCode Available | 3 |
| Multiple Object Tracking as ID Prediction | Mar 25, 2024 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 3 |
| Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Mar 25, 2024 | Denoising | CodeCode Available | 3 |
| RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection | Mar 25, 2024 | 3D Object Detection3D Object Detection (RoI) | CodeCode Available | 3 |
| Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement | Mar 24, 2024 | 2D Object DetectionComputational Efficiency | CodeCode Available | 3 |
| Segment Anything Model for Road Network Graph Extraction | Mar 24, 2024 | Graph LearningGraph Neural Network | CodeCode Available | 3 |
| UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction | Mar 22, 2024 | DiversityPrediction | CodeCode Available | 3 |
| SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series | Mar 22, 2024 | Inductive BiasMamba | CodeCode Available | 3 |
| Fundus: A Simple-to-Use News Scraper Optimized for High Quality Extractions | Mar 22, 2024 | Articles | CodeCode Available | 3 |
| ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars | Mar 22, 2024 | 3D GenerationDiversity | CodeCode Available | 3 |
| IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Mar 22, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 3 |
| AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation | Mar 21, 2024 | AllBlind All-in-One Image Restoration | CodeCode Available | 3 |
| Physics-Informed Diffusion Models | Mar 21, 2024 | Denoising | CodeCode Available | 3 |
| A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond | Mar 21, 2024 | Survey | CodeCode Available | 3 |
| Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | Mar 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| The Elements of Differentiable Programming | Mar 21, 2024 | | CodeCode Available | 3 |
| Implicit Style-Content Separation using B-LoRA | Mar 21, 2024 | Image StylizationStyle Transfer | CodeCode Available | 3 |
| PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model | Mar 21, 2024 | DecoderGeneralized Referring Expression Segmentation | CodeCode Available | 3 |
| Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity | Mar 21, 2024 | Question AnsweringRAG | CodeCode Available | 3 |
| Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing | Mar 21, 2024 | DenoisingVirtual Try-on | CodeCode Available | 3 |
| Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond | Mar 21, 2024 | Anomaly DetectionDeep Learning | CodeCode Available | 3 |
| HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression | Mar 21, 2024 | 3DGSAttribute | CodeCode Available | 3 |
| Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians | Mar 21, 2024 | Binarization | CodeCode Available | 3 |
| ReNoise: Real Image Inversion Through Iterative Noising | Mar 21, 2024 | DenoisingImage Manipulation | CodeCode Available | 3 |
| DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing | Mar 21, 2024 | Image Generationspatial-aware image editing | CodeCode Available | 3 |
| Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models | Mar 21, 2024 | | CodeCode Available | 3 |
| Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion | Mar 20, 2024 | Autonomous VehiclesDenoising | CodeCode Available | 3 |
| Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion | Mar 20, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 3 |
| DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping | Mar 20, 2024 | Optical Flow EstimationSensor Fusion | CodeCode Available | 3 |
| Rotary Position Embedding for Vision Transformer | Mar 20, 2024 | Position | CodeCode Available | 3 |
| Declarative generation of RDF-star graphs from heterogeneous data | Mar 20, 2024 | Data IntegrationKnowledge Graphs | CodeCode Available | 3 |