| Learning Transferable Negative Prompts for Out-of-Distribution Detection | Apr 4, 2024 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 2 |
| ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing | Apr 5, 2024 | Image Manipulation | CodeCode Available | 2 |
| Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer | Apr 7, 2024 | 3D Human Reconstruction3D Object Reconstruction | CodeCode Available | 2 |
| LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion | Mar 30, 2024 | DiversityImage Generation | CodeCode Available | 2 |
| Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes | Apr 6, 2024 | Point Cloud Registration | CodeCode Available | 2 |
| Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation | Apr 9, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SafeGen: Mitigating Sexually Explicit Content Generation in Text-to-Image Models | Apr 10, 2024 | | CodeCode Available | 2 |
| MindBridge: A Cross-Subject Brain Decoding Framework | Apr 11, 2024 | Brain DecodingData Augmentation | CodeCode Available | 2 |
| Content-Adaptive Non-Local Convolution for Remote Sensing Pansharpening | Apr 11, 2024 | Pansharpening | CodeCode Available | 2 |
| Inheritune: Training Smaller Yet More Attentive Language Models | Apr 12, 2024 | DecoderLanguage Modelling | CodeCode Available | 2 |
| Latent Guard: a Safety Framework for Text-to-image Generation | Apr 11, 2024 | Contrastive LearningImage Generation | CodeCode Available | 2 |
| TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models | Apr 14, 2024 | | CodeCode Available | 2 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |
| Confidential Federated Computations | Apr 16, 2024 | Federated Learning | CodeCode Available | 2 |
| PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify | Jun 1, 2024 | | CodeCode Available | 2 |
| Point-In-Context: Understanding Point Cloud via In-Context Learning | Apr 18, 2024 | In-Context Learning | CodeCode Available | 2 |
| MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model | Apr 19, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| MAexp: A Generic Platform for RL-based Multi-Agent Exploration | Apr 19, 2024 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Improving Sequential Recommendations with LLMs | Feb 2, 2024 | Sequential Recommendation | CodeCode Available | 2 |
| Retrieval-Augmented Generation-based Relation Extraction | Apr 20, 2024 | RelationRelation Extraction | CodeCode Available | 2 |
| TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models | Apr 25, 2024 | DenoisingImage to Video Generation | CodeCode Available | 2 |
| Classifier-guided neural blind deconvolution: a physics-informed denoising module for bearing fault diagnosis under heavy noise | Apr 11, 2024 | Deep LearningDenoising | CodeCode Available | 2 |
| Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting | Apr 29, 2024 | | CodeCode Available | 2 |
| 3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping | Dec 14, 2022 | Generative Adversarial NetworkImage Generation | CodeCode Available | 2 |
| Causal Evaluation of Language Models | May 1, 2024 | Causal DiscoveryCausal Inference | CodeCode Available | 2 |
| Joint Signal Detection and Automatic Modulation Classification via Deep Learning | Apr 29, 2024 | Deep Learning | CodeCode Available | 2 |
| PLeak: Prompt Leaking Attacks against Large Language Model Applications | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Transcriptomics-guided Slide Representation Learning in Computational Pathology | May 19, 2024 | Contrastive LearningRepresentation Learning | CodeCode Available | 2 |
| End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music | May 20, 2024 | Synthetic Data Generation | CodeCode Available | 2 |
| Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning | May 17, 2024 | Dictionary Learning | CodeCode Available | 2 |
| RoGs: Large Scale Road Surface Reconstruction with Meshgrid Gaussian | May 23, 2024 | Autonomous DrivingSurface Reconstruction | CodeCode Available | 2 |
| Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators | May 23, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| LM4LV: A Frozen Large Language Model for Low-level Vision Tasks | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Sparse maximal update parameterization: A holistic approach to sparse training dynamics | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Frustratingly Easy Test-Time Adaptation of Vision-Language Models | May 28, 2024 | Test-time Adaptation | CodeCode Available | 2 |
| Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification | May 28, 2024 | Person Re-IdentificationTriplet | CodeCode Available | 2 |
| Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations | May 28, 2024 | GPU | CodeCode Available | 2 |
| Benchmarking and Improving Detail Image Caption | May 29, 2024 | BenchmarkingImage Captioning | CodeCode Available | 2 |
| WorldGUI: An Interactive Benchmark for Desktop GUI Automation from Any Starting Point | Feb 12, 2025 | | CodeCode Available | 2 |
| Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification | May 24, 2024 | EEGElectrocardiography (ECG) | CodeCode Available | 2 |
| TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy | Jun 3, 2024 | Language ModellingQuestion Answering | CodeCode Available | 2 |
| DroneVis: Versatile Computer Vision Library for Drones | Jun 1, 2024 | | CodeCode Available | 2 |
| Neural Optimal Transport with Lagrangian Costs | Jun 1, 2024 | | CodeCode Available | 2 |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Jun 3, 2024 | Audio GenerationIn-Context Learning | CodeCode Available | 2 |
| Parameter-Inverted Image Pyramid Networks | Jun 6, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks | Jun 7, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 2 |
| FRAG: Frequency Adapting Group for Diffusion Video Editing | Jun 10, 2024 | DenoisingVideo Editing | CodeCode Available | 2 |
| Towards Lifelong Learning of Large Language Models: A Survey | Jun 10, 2024 | Continual PretrainingIncremental Learning | CodeCode Available | 2 |
| Needle In A Multimodal Haystack | Jun 11, 2024 | Retrieval | CodeCode Available | 2 |
| DafnyBench: A Benchmark for Formal Software Verification | Jun 12, 2024 | | CodeCode Available | 2 |