| Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model | Dec 12, 2024 | Anomaly DetectionVideo Anomaly Detection | CodeCode Available | 1 |
| A physics-informed transformer neural operator for learning generalized solutions of initial boundary value problems | Dec 12, 2024 | Operator learning | CodeCode Available | 1 |
| Augmenting Sequential Recommendation with Balanced Relevance and Diversity | Dec 11, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| HARP: A challenging human-annotated math reasoning benchmark | Dec 11, 2024 | Math | CodeCode Available | 1 |
| Repository-Level Graph Representation Learning for Enhanced Security Patch Detection | Dec 11, 2024 | graph constructionGraph Representation Learning | CodeCode Available | 1 |
| PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion | Dec 11, 2024 | 3D Object DetectionPoint Cloud Completion | CodeCode Available | 1 |
| Revisiting Weight Averaging for Model Merging | Dec 11, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Fast Prompt Alignment for Text-to-Image Generation | Dec 11, 2024 | Image GenerationIn-Context Learning | CodeCode Available | 1 |
| Concept Bottleneck Large Language Models | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning | Dec 11, 2024 | Representation LearningSelf-Supervised Learning | CodeCode Available | 1 |
| EmoVerse: Exploring Multimodal Large Language Models for Sentiment and Emotion Understanding | Dec 11, 2024 | Depression DetectionEmotion-Cause Pair Extraction | CodeCode Available | 1 |
| TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning | Dec 11, 2024 | Prompt Learning | CodeCode Available | 1 |
| PepMNet: a hybrid deep learning model for predicting peptide properties using hierarchical graph representations | Dec 11, 2024 | Drug DesignProperty Prediction | CodeCode Available | 1 |
| SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting | Dec 11, 2024 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Magneto: Combining Small and Large Language Models for Schema Matching | Dec 11, 2024 | Reranking | CodeCode Available | 1 |
| Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models | Dec 11, 2024 | Adversarial Attack | CodeCode Available | 1 |
| ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder | Dec 11, 2024 | Computational EfficiencyDecoder | CodeCode Available | 1 |
| Adversarial Vulnerabilities in Large Language Models for Time Series Forecasting | Dec 11, 2024 | Adversarial AttackTime Series | CodeCode Available | 1 |
| GDSG: Graph Diffusion-based Solution Generator for Optimization Problems in MEC Networks | Dec 11, 2024 | Graph Neural Network | CodeCode Available | 1 |
| NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis | Dec 11, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 1 |
| Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training | Dec 11, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel | Dec 11, 2024 | | CodeCode Available | 1 |
| Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion | Dec 11, 2024 | Point Cloud Completion | CodeCode Available | 1 |
| SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation | Dec 11, 2024 | MambaSegmentation | CodeCode Available | 1 |
| Boundary Exploration of Next Best View Policy in 3D Robotic Scanning | Dec 11, 2024 | | CodeCode Available | 1 |
| SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World | Dec 10, 2024 | | CodeCode Available | 1 |
| T-TIME: Test-Time Information Maximization Ensemble for Plug-and-Play BCIs | Dec 10, 2024 | Brain Computer InterfaceEEG | CodeCode Available | 1 |
| A New Federated Learning Framework Against Gradient Inversion Attacks | Dec 10, 2024 | Federated LearningPrivacy Preserving | CodeCode Available | 1 |
| Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model | Dec 10, 2024 | | CodeCode Available | 1 |
| FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error | Dec 10, 2024 | Image Forensics | CodeCode Available | 1 |
| Mask prior-guided denoising diffusion improves inverse protein folding | Dec 10, 2024 | DenoisingProtein Folding | CodeCode Available | 1 |
| Optimizing Personalized Federated Learning through Adaptive Layer-Wise Learning | Dec 10, 2024 | Federated LearningPersonalized Federated Learning | CodeCode Available | 1 |
| Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Dec 10, 2024 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| Unlocking the Potential of Reverse Distillation for Anomaly Detection | Dec 10, 2024 | Anomaly DetectionDecoder | CodeCode Available | 1 |
| EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision | Dec 10, 2024 | Event-based visionOptical Flow Estimation | CodeCode Available | 1 |
| On Evaluating the Durability of Safeguards for Open-Weight LLMs | Dec 10, 2024 | | CodeCode Available | 1 |
| Scaling Sequential Recommendation Models with Transformers | Dec 10, 2024 | Recommendation SystemsSequential Recommendation | CodeCode Available | 1 |
| RFL: Simplifying Chemical Structure Recognition with Ring-Free Language | Dec 10, 2024 | Decoder | CodeCode Available | 1 |
| Efficient 3D Recognition with Event-driven Spike Sparse Convolution | Dec 10, 2024 | Attribute | CodeCode Available | 1 |
| Cloud Object Detector Adaptation by Integrating Different Source Knowledge | Dec 10, 2024 | Domain AdaptationKnowledge Distillation | CodeCode Available | 1 |
| Monte Carlo Tree Search based Space Transfer for Black-box Optimization | Dec 10, 2024 | Bayesian OptimizationTransfer Learning | CodeCode Available | 1 |
| Towards Automated Cross-domain Exploratory Data Analysis through Large Language Models | Dec 10, 2024 | Data VisualizationDomain Generalization | CodeCode Available | 1 |
| ReCap: Better Gaussian Relighting with Cross-Environment Captures | Dec 10, 2024 | | CodeCode Available | 1 |
| Temporal Linear Item-Item Model for Sequential Recommendation | Dec 10, 2024 | modelSequential Recommendation | CodeCode Available | 1 |
| IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents | Dec 10, 2024 | Cross-Modal RetrievalImage Classification | CodeCode Available | 1 |
| Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs | Dec 10, 2024 | Knowledge GraphsRAG | CodeCode Available | 1 |
| Modeling Dual-Exposure Quad-Bayer Patterns for Joint Denoising and Deblurring | Dec 10, 2024 | DeblurringDenoising | CodeCode Available | 1 |
| PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models | Dec 10, 2024 | | CodeCode Available | 1 |