| Grams: Gradient Descent with Adaptive Momentum Scaling | Dec 22, 2024 | | CodeCode Available | 1 |
| Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality | Dec 21, 2024 | GPU | CodeCode Available | 1 |
| MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights | Dec 21, 2024 | Drug DesignMamba | CodeCode Available | 1 |
| Trusted Mamba Contrastive Network for Multi-View Clustering | Dec 21, 2024 | ClusteringContrastive Learning | CodeCode Available | 1 |
| DOFEN: Deep Oblivious Forest ENsemble | Dec 21, 2024 | | CodeCode Available | 1 |
| Solving Inverse Problems via Diffusion Optimal Control | Dec 21, 2024 | DeblurringImage Reconstruction | CodeCode Available | 1 |
| An explainable operator approximation framework under the guideline of Green's function | Dec 21, 2024 | | CodeCode Available | 1 |
| L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression | Dec 21, 2024 | Data CompressionText Compression | CodeCode Available | 1 |
| Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution | Dec 21, 2024 | Face RecognitionSuper-Resolution | CodeCode Available | 1 |
| Query Quantized Neural SLAM | Dec 21, 2024 | Simultaneous Localization and Mapping | CodeCode Available | 1 |
| Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity | Dec 21, 2024 | Novel View SynthesisSSIM | CodeCode Available | 1 |
| Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding | Dec 21, 2024 | AttributeQuestion Answering | CodeCode Available | 1 |
| Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection | Dec 20, 2024 | 2D Object DetectionImage Enhancement | CodeCode Available | 1 |
| DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation | Dec 20, 2024 | Data AugmentationDefect Detection | CodeCode Available | 1 |
| Learning Group Interactions and Semantic Intentions for Multi-Object Trajectory Prediction | Dec 20, 2024 | PredictionTrajectory Prediction | CodeCode Available | 1 |
| Measuring Cross-Modal Interactions in Multimodal Models | Dec 20, 2024 | | CodeCode Available | 1 |
| Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support | Dec 20, 2024 | | CodeCode Available | 1 |
| Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation | Dec 20, 2024 | DenoisingImage Denoising | CodeCode Available | 1 |
| S^2DN: Learning to Denoise Unconvincing Knowledge for Inductive Knowledge Graph Completion | Dec 20, 2024 | DenoisingInductive knowledge graph completion | CodeCode Available | 1 |
| MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection | Dec 20, 2024 | Cancer ClassificationChatbot | CodeCode Available | 1 |
| ASPIRE: Assistive System for Performance Evaluation in IR | Dec 20, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| Using matrix-product states for time-series machine learning | Dec 20, 2024 | AstronomyImputation | CodeCode Available | 1 |
| Fine-tuning Whisper on Low-Resource Languages for Real-World Applications | Dec 20, 2024 | FormSentence | CodeCode Available | 1 |
| Score-based Generative Diffusion Models for Social Recommendations | Dec 20, 2024 | Self-Supervised Learning | CodeCode Available | 1 |
| LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance | Dec 20, 2024 | Computational EfficiencyDepth Estimation | CodeCode Available | 1 |
| Learned Compression of Nonlinear Time Series With Random Access | Dec 20, 2024 | Time Series | CodeCode Available | 1 |
| EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene | Dec 20, 2024 | Novel View Synthesis | CodeCode Available | 1 |
| Continual Learning with Strategic Selection and Forgetting for Network Intrusion Detection | Dec 20, 2024 | Continual LearningIntrusion Detection | CodeCode Available | 1 |
| BS-LDM: Effective Bone Suppression in High-Resolution Chest X-Ray Images with Conditional Latent Diffusion Models | Dec 20, 2024 | Bone Suppression From Dual Energy Chest X-RaysDiagnostic | CodeCode Available | 1 |
| Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks | Dec 20, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Pre-training Graph Neural Networks on Molecules by Using Subgraph-Conditioned Graph Information Bottleneck | Dec 20, 2024 | Graph Neural NetworkSelf-Supervised Learning | CodeCode Available | 1 |
| RiTTA: Modeling Event Relations in Text-to-Audio Generation | Dec 20, 2024 | Audio GenerationRelation | CodeCode Available | 1 |
| Continual Learning Using a Kernel-Based Method Over Foundation Models | Dec 20, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium | Dec 20, 2024 | Image GenerationNovel Concepts | CodeCode Available | 1 |
| Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Dec 20, 2024 | Classificationimage-classification | CodeCode Available | 1 |
| Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems | Dec 20, 2024 | Combinatorial OptimizationMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula | Dec 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| DiffSim: Taming Diffusion Models for Evaluating Visual Similarity | Dec 19, 2024 | Contrastive LearningDenoising | CodeCode Available | 1 |
| Unified Image Restoration and Enhancement: Degradation Calibrated Cycle Reconstruction Diffusion Model | Dec 19, 2024 | DenoisingImage Deblurring | CodeCode Available | 1 |
| A Survey of RWKV | Dec 19, 2024 | Natural Language UnderstandingSurvey | CodeCode Available | 1 |
| WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network | Dec 19, 2024 | Action DetectionAction Recognition | CodeCode Available | 1 |
| PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization | Dec 19, 2024 | InformativenessRAG | CodeCode Available | 1 |
| A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate Simulation | Dec 19, 2024 | Decision Making | CodeCode Available | 1 |
| Each Fake News is Fake in its Own Way: An Attribution Multi-Granularity Benchmark for Multimodal Fake News Detection | Dec 19, 2024 | Fake News Detection | CodeCode Available | 1 |
| Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation | Dec 19, 2024 | Code Generation | CodeCode Available | 1 |
| Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models | Dec 19, 2024 | Bilevel OptimizationKnowledge Distillation | CodeCode Available | 1 |
| Generative CKM Construction using Partially Observed Data with Diffusion Model | Dec 19, 2024 | Benchmarking | CodeCode Available | 1 |
| HiCM^2: Hierarchical Compact Memory Modeling for Dense Video Captioning | Dec 19, 2024 | Dense Video CaptioningVideo Captioning | CodeCode Available | 1 |
| CLDG: Contrastive Learning on Dynamic Graphs | Dec 19, 2024 | Contrastive LearningTranslation | CodeCode Available | 1 |
| Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models | Dec 19, 2024 | Knowledge Distillation | CodeCode Available | 1 |