| Graph Neural Networks for Learning Equivariant Representations of Neural Networks | Mar 18, 2024 | | CodeCode Available | 2 | 5 |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Mar 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| A Multimodal Vision Foundation Model for Clinical Dermatology | Oct 19, 2024 | DiagnosticLesion Segmentation | CodeCode Available | 2 | 5 |
| Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion | Mar 25, 2024 | Decoder | CodeCode Available | 2 | 5 |
| AID: Attention Interpolation of Text-to-Image Diffusion | Mar 26, 2024 | Spatial Interpolation | CodeCode Available | 2 | 5 |
| Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders | Mar 26, 2024 | ObjectSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model | Mar 26, 2024 | DenoisingReference-based Super-Resolution | CodeCode Available | 2 | 5 |
| Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance | Mar 26, 2024 | Motion GenerationMotion Synthesis | CodeCode Available | 2 | 5 |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Mar 28, 2024 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 | 5 |
| Infrared Small Target Detection with Scale and Location Sensitivity | Mar 28, 2024 | Sensitivity | CodeCode Available | 2 | 5 |
| Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners | Apr 2, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 | 5 |
| Linear Attention Sequence Parallelism | Apr 3, 2024 | 2k | CodeCode Available | 2 | 5 |
| LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity | Apr 4, 2024 | Sensitivity | CodeCode Available | 2 | 5 |
| OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian Splatting | Apr 4, 2024 | GPU | CodeCode Available | 2 | 5 |
| DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection | Apr 3, 2024 | Autonomous Vehiclesobject-detection | CodeCode Available | 2 | 5 |
| Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Apr 3, 2024 | CPUGPU | CodeCode Available | 2 | 5 |
| RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos | Apr 9, 2024 | Mamba | CodeCode Available | 2 | 5 |
| SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions | Apr 9, 2024 | | CodeCode Available | 2 | 5 |
| The CAST package for training and assessment of spatial prediction models in R | Apr 10, 2024 | feature selectionModel Selection | CodeCode Available | 2 | 5 |
| Manipulating Large Language Models to Increase Product Visibility | Apr 11, 2024 | STS | CodeCode Available | 2 | 5 |
| DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation | Apr 11, 2024 | | CodeCode Available | 2 | 5 |
| From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo | Apr 11, 2024 | 3D Reconstruction | CodeCode Available | 2 | 5 |
| SFSORT: Scene Features-based Simple Online Real-Time Tracker | Apr 11, 2024 | CPUMulti-Object Tracking | CodeCode Available | 2 | 5 |
| Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation | Apr 13, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 2 | 5 |
| LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism | Apr 15, 2024 | GPU | CodeCode Available | 2 | 5 |
| NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results | Apr 17, 2024 | Formvalid | CodeCode Available | 2 | 5 |
| Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References | Apr 19, 2024 | Image Harmonization | CodeCode Available | 2 | 5 |
| LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| An empirical study of LLaMA3 quantization: from LLMs to MLLMs | Apr 22, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| JGLUE: Japanese General Language Understanding Evaluation | Jun 1, 2022 | FLUENatural Language Understanding | CodeCode Available | 2 | 5 |
| Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization | Jun 3, 2024 | Survey | CodeCode Available | 2 | 5 |
| Gradformer: Graph Transformer with Exponential Decay | Apr 24, 2024 | Graph ClassificationGraph Neural Network | CodeCode Available | 2 | 5 |
| Large Language Models for Next Point-of-Interest Recommendation | Apr 19, 2024 | | CodeCode Available | 2 | 5 |
| S^2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification | Apr 28, 2024 | Hyperspectral Image Classificationimage-classification | CodeCode Available | 2 | 5 |
| Paint by Inpaint: Learning to Add Image Objects by Removing Them First | Apr 28, 2024 | Image InpaintingLanguage Modeling | CodeCode Available | 2 | 5 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| GraCo: Granularity-Controllable Interactive Segmentation | May 1, 2024 | Interactive SegmentationSegmentation | CodeCode Available | 2 | 5 |
| FeNNol: an Efficient and Flexible Library for Building Force-field-enhanced Neural Network Potentials | May 2, 2024 | GPU | CodeCode Available | 2 | 5 |
| Time Evidence Fusion Network: Multi-source View in Long-Term Time Series Forecasting | May 10, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 | 5 |
| Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention | May 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | May 13, 2024 | Decision MakingLoop Closure Detection | CodeCode Available | 2 | 5 |
| Evaluation of Retrieval-Augmented Generation: A Survey | May 13, 2024 | Information RetrievalRAG | CodeCode Available | 2 | 5 |
| From NeRFs to Gaussian Splats, and Back | May 15, 2024 | SSIM | CodeCode Available | 2 | 5 |
| Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance | May 17, 2024 | Crowd Counting | CodeCode Available | 2 | 5 |
| xFinder: Robust and Pinpoint Answer Extraction for Large Language Models | May 20, 2024 | | CodeCode Available | 2 | 5 |
| Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography | May 20, 2024 | Breast Cancer DetectionDiversity | CodeCode Available | 2 | 5 |
| ProtT3: Protein-to-Text Generation for Text-based Protein Understanding | May 21, 2024 | Property PredictionQuestion Answering | CodeCode Available | 2 | 5 |
| ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles | May 22, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| Efficient Visual State Space Model for Image Deblurring | May 23, 2024 | DeblurringImage Deblurring | CodeCode Available | 2 | 5 |