| Polyp SAM 2: Advancing Zero shot Polyp Segmentation in Colorectal Cancer Detection | Aug 12, 2024 | Segmentation | CodeCode Available | 1 |
| Operator Learning Using Random Features: A Tool for Scientific Computing | Aug 12, 2024 | Operator learningregression | CodeCode Available | 1 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation | Aug 12, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation | Aug 12, 2024 | PredictionRecommendation Systems | CodeCode Available | 1 |
| Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization | Aug 12, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Multimodal Large Language Models for Phishing Webpage Detection and Identification | Aug 12, 2024 | | CodeCode Available | 1 |
| RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation | Aug 12, 2024 | 3D Point Cloud ClassificationPoint Cloud Classification | CodeCode Available | 1 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| MovieSum: An Abstractive Summarization Dataset for Movie Screenplays | Aug 12, 2024 | Abstractive Text SummarizationDocument Summarization | CodeCode Available | 1 |
| Benchmarking tree species classification from proximally-sensed laser scanning data: introducing the FOR-species20K dataset | Aug 12, 2024 | Benchmarking | CodeCode Available | 1 |
| Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection | Aug 12, 2024 | Human-Object Interaction DetectionZero-Shot Human-Object Interaction Detection | CodeCode Available | 1 |
| ClickAttention: Click Region Similarity Guided Interactive Segmentation | Aug 12, 2024 | Interactive Segmentation | CodeCode Available | 1 |
| BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation | Aug 12, 2024 | Response Generation | CodeCode Available | 1 |
| HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization | Aug 12, 2024 | Action LocalizationTemporal Action Localization | CodeCode Available | 1 |
| Pattern-Matching Dynamic Memory Network for Dual-Mode Traffic Prediction | Aug 12, 2024 | PredictionTraffic Prediction | CodeCode Available | 1 |
| Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Aug 12, 2024 | 3DGSNeRF | CodeCode Available | 1 |
| Prompto: An open source library for asynchronous querying of LLM endpoints | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Aug 12, 2024 | Image Generation | CodeCode Available | 1 |
| Freehand Sketch Generation from Mechanical Components | Aug 12, 2024 | | CodeCode Available | 1 |
| What Ails Generative Structure-based Drug Design: Expressivity is Too Little or Too Much? | Aug 12, 2024 | AttributeDrug Design | CodeCode Available | 1 |
| Image Denoising Using Green Channel Prior | Aug 12, 2024 | DenoisingImage Denoising | CodeCode Available | 1 |
| Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI | Aug 11, 2024 | DecoderSegmentation | CodeCode Available | 1 |
| MTSCI: A Conditional Diffusion Model for Multivariate Time Series Consistent Imputation | Aug 11, 2024 | DenoisingImputation | CodeCode Available | 1 |
| LaWa: Using Latent Space for In-Generation Image Watermarking | Aug 11, 2024 | | CodeCode Available | 1 |
| Language-Informed Beam Search Decoding for Multilingual Machine Translation | Aug 11, 2024 | Language IdentificationMachine Translation | CodeCode Available | 1 |
| Iterative Improvement of an Additively Regularized Topic Model | Aug 11, 2024 | modelTopic Models | CodeCode Available | 1 |
| Divide-and-Conquer Predictive Coding: a structured Bayesian inference algorithm | Aug 11, 2024 | Bayesian InferenceVariational Inference | CodeCode Available | 1 |
| PhishLang: A Real-Time, Fully Client-Side Phishing Detection Framework Using MobileBERT | Aug 11, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition | Aug 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection | Aug 11, 2024 | Few-Shot Object Detectionobject-detection | CodeCode Available | 1 |
| Open Role-Playing with Delta-Engines | Aug 11, 2024 | | CodeCode Available | 1 |
| FADE: A Dataset for Detecting Falling Objects around Buildings in Video | Aug 11, 2024 | Moving Object DetectionObject | CodeCode Available | 1 |
| TC-KANRecon: High-Quality and Accelerated MRI Reconstruction via Adaptive KAN Mechanisms and Intelligent Feature Scaling | Aug 11, 2024 | DenoisingImage Denoising | CodeCode Available | 1 |
| StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model | Aug 11, 2024 | | CodeCode Available | 1 |
| HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training | Aug 11, 2024 | DecoderSelf-Supervised Learning | CodeCode Available | 1 |
| Deep Learning in Medical Image Registration: Magic or Mirage? | Aug 11, 2024 | Deep LearningImage Registration | CodeCode Available | 1 |
| Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Aug 11, 2024 | Denoising | CodeCode Available | 1 |
| EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition | Aug 10, 2024 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification | Aug 10, 2024 | Contrastive LearningPerson Re-Identification | CodeCode Available | 1 |
| PRTGaussian: Efficient Relighting Using 3D Gaussians with Precomputed Radiance Transfer | Aug 10, 2024 | Computational EfficiencyNovel View Synthesis | CodeCode Available | 1 |
| SAM-FNet: SAM-Guided Fusion Network for Laryngo-Pharyngeal Tumor Detection | Aug 10, 2024 | DiagnosticSemantic Segmentation | CodeCode Available | 1 |
| Pretrained-Guided Conditional Diffusion Models for Microbiome Data Analysis | Aug 10, 2024 | DenoisingImputation | CodeCode Available | 1 |
| ViC: Virtual Compiler Is All You Need For Assembly Code Search | Aug 10, 2024 | AllCode Search | CodeCode Available | 1 |
| Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions | Aug 10, 2024 | Machine Unlearning | CodeCode Available | 1 |
| Investigating Instruction Tuning Large Language Models on Graphs | Aug 10, 2024 | Instruction Following | CodeCode Available | 1 |
| ZePo: Zero-Shot Portrait Stylization with Faster Sampling | Aug 10, 2024 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling | Aug 10, 2024 | Representation Learning | CodeCode Available | 1 |
| CryoBench: Diverse and challenging datasets for the heterogeneity problem in cryo-EM | Aug 10, 2024 | 3D Reconstruction | CodeCode Available | 1 |
| Eigen Attention: Attention in Low-Rank Space for KV Cache Compression | Aug 10, 2024 | | CodeCode Available | 1 |