| Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights | Jan 7, 2025 | Neural Architecture SearchOut-of-Distribution Generalization | CodeCode Available | 1 |
| Chirpy3D: Creative Fine-grained 3D Object Fabrication via Part Sampling | Jan 7, 2025 | 3D Generation | CodeCode Available | 1 |
| Materialist: Physically Based Editing Using Single-Image Inverse Rendering | Jan 7, 2025 | Inverse Rendering | CodeCode Available | 1 |
| ConcealGS: Concealing Invisible Copyright Information in 3D Gaussian Splatting | Jan 7, 2025 | 3D ReconstructionKnowledge Distillation | CodeCode Available | 1 |
| BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning | Jan 6, 2025 | In-Context LearningMath | CodeCode Available | 1 |
| DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Texture Generation on 3D Meshes | Jan 6, 2025 | 3D Surface GenerationGeometry-based operator learning | CodeCode Available | 1 |
| CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems | Jan 6, 2025 | Computational EfficiencyMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Jan 6, 2025 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| ScaleMAI: Accelerating the Development of Trusted Datasets and AI Models | Jan 6, 2025 | | CodeCode Available | 1 |
| MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification | Jan 6, 2025 | Automatic Sleep Stage Classification | CodeCode Available | 1 |
| Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis | Jan 6, 2025 | BenchmarkingImage Enhancement | CodeCode Available | 1 |
| AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation | Jan 6, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 1 |
| Normalizing Batch Normalization for Long-Tailed Recognition | Jan 6, 2025 | | CodeCode Available | 1 |
| ICFNet: Integrated Cross-modal Fusion Network for Survival Prediction | Jan 6, 2025 | Decision MakingSurvival Prediction | CodeCode Available | 1 |
| Key-value memory in the brain | Jan 6, 2025 | Retrieval | CodeCode Available | 1 |
| Fuzzy Granule Density-Based Outlier Detection with Multi-Scale Granular Balls | Jan 6, 2025 | Outlier Detection | CodeCode Available | 1 |
| RadDet: A Wideband Dataset for Real-Time Radar Spectrum Detection | Jan 6, 2025 | | CodeCode Available | 1 |
| Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model | Jan 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| OpenLKA: an open dataset of lane keeping assist from market autonomous vehicles | Jan 6, 2025 | Autonomous VehiclesTrajectory Planning | CodeCode Available | 1 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Dr. Tongue: Sign-Oriented Multi-label Detection for Remote Tongue Diagnosis | Jan 6, 2025 | AttributeDiagnostic | CodeCode Available | 1 |
| Analyzing Fine-tuning Representation Shift for Multimodal LLMs Steering alignment | Jan 6, 2025 | | CodeCode Available | 1 |
| Holistic Semantic Representation for Navigational Trajectory Generation | Jan 6, 2025 | Few-Shot LearningZero-Shot Learning | CodeCode Available | 1 |
| Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation | Jan 6, 2025 | Machine TranslationTranslation | CodeCode Available | 1 |
| Geometry Restoration and Dewarping of Camera-Captured Document Images | Jan 6, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection | Jan 6, 2025 | Face Anti-SpoofingFace Presentation Attack Detection | CodeCode Available | 1 |
| SALT: Sales Autocompletion Linked Business Tables Dataset | Jan 6, 2025 | ERPRepresentation Learning | CodeCode Available | 1 |
| Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation | Jan 5, 2025 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Generalization-Enhanced Few-Shot Object Detection in Remote Sensing | Jan 5, 2025 | Few-Shot LearningFew-Shot Object Detection | CodeCode Available | 1 |
| HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs | Jan 5, 2025 | Efficient Neural Networkparameter-efficient fine-tuning | CodeCode Available | 1 |
| LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations | Jan 5, 2025 | GPU | CodeCode Available | 1 |
| Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection | Jan 5, 2025 | Contrastive LearningHighlight Detection | CodeCode Available | 1 |
| FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance | Jan 5, 2025 | Token Reduction | CodeCode Available | 1 |
| Unsupervised Search for Ethnic Minorities' Medical Segmentation Training Set | Jan 5, 2025 | | CodeCode Available | 1 |
| Multispectral Pedestrian Detection with Sparsely Annotated Label | Jan 5, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| DenseGNN: universal and scalable deeper graph neural networks for high-performance property prediction in crystals and molecules | Jan 5, 2025 | Domain AdaptationProperty Prediction | CodeCode Available | 1 |
| KM-UNet KAN Mamba UNet for medical image segmentation | Jan 5, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs? | Jan 5, 2025 | Image CaptioningImage to text | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference | Jan 4, 2025 | | CodeCode Available | 1 |
| Easing Optimization Paths: a Circuit Perspective | Jan 4, 2025 | | CodeCode Available | 1 |
| Personalized Graph-Based Retrieval for Large Language Models | Jan 4, 2025 | Knowledge GraphsRetrieval | CodeCode Available | 1 |
| RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Jan 4, 2025 | 3D Object Detection3D Object Detection (RoI) | CodeCode Available | 1 |
| V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection | Jan 4, 2025 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing | Jan 4, 2025 | ChunkingImputation | CodeCode Available | 1 |
| Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs | Jan 3, 2025 | Contrastive LearningGraph Embedding | CodeCode Available | 1 |
| Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding | Jan 3, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Detecting Music Performance Errors with Transformers | Jan 3, 2025 | | CodeCode Available | 1 |
| Robust Self-Paced Hashing for Cross-Modal Retrieval with Noisy Labels | Jan 3, 2025 | Computational EfficiencyCross-Modal Retrieval | CodeCode Available | 1 |
| QuantumBind-RBFE: Accurate Relative Binding Free Energy Calculations Using Neural Network Potentials | Jan 3, 2025 | Drug Discovery | CodeCode Available | 1 |