| GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization | Apr 8, 2025 | Bayesian OptimizationContrastive Learning | CodeCode Available | 1 |
| An Empirical Study of GPT-4o Image Generation Capabilities | Apr 8, 2025 | BenchmarkingImage Generation | CodeCode Available | 1 |
| Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking | Apr 8, 2025 | Image Generation | CodeCode Available | 1 |
| Retrieval Augmented Generation with Collaborative Filtering for Personalized Text Generation | Apr 8, 2025 | Collaborative FilteringContrastive Learning | CodeCode Available | 1 |
| A Control-Oriented Simplified Single Particle Model with Grouped Parameter and Sensitivity Analysis for Lithium-Ion Batteries | Apr 8, 2025 | Computational Efficiencyparameter estimation | CodeCode Available | 1 |
| Temporal Alignment-Free Video Matching for Few-shot Action Recognition | Apr 8, 2025 | Action RecognitionFew-Shot action recognition | CodeCode Available | 1 |
| FEABench: Evaluating Language Models on Multiphysics Reasoning Ability | Apr 8, 2025 | | CodeCode Available | 1 |
| Knowledge Graph Completion with Relation-Aware Anchor Enhancement | Apr 8, 2025 | Knowledge Graph CompletionLink Prediction | CodeCode Available | 1 |
| A Multi-Modal AI System for Screening Mammography: Integrating 2D and 3D Imaging to Improve Breast Cancer Detection in a Prospective Clinical Study | Apr 8, 2025 | Breast Cancer DetectionDiagnostic | CodeCode Available | 1 |
| V-MAGE: A Game Evaluation Framework for Assessing Vision-Centric Capabilities in Multimodal Large Language Models | Apr 8, 2025 | BenchmarkingVisual Reasoning | CodeCode Available | 1 |
| CamContextI2V: Context-aware Controllable Video Generation | Apr 8, 2025 | DiversityScene Understanding | CodeCode Available | 1 |
| Robo-taxi Fleet Coordination at Scale via Reinforcement Learning | Apr 8, 2025 | Computational EfficiencyGraph Representation Learning | CodeCode Available | 1 |
| Leanabell-Prover: Posttraining Scaling in Formal Reasoning | Apr 8, 2025 | Automated Theorem Provingreinforcement-learning | CodeCode Available | 1 |
| Why is Normalization Necessary for Linear Recommenders? | Apr 8, 2025 | Collaborative FilteringRecommendation Systems | CodeCode Available | 1 |
| kNN-SVC: Robust Zero-Shot Singing Voice Conversion with Additive Synthesis and Concatenation Smoothness Optimization | Apr 8, 2025 | Voice Conversion | CodeCode Available | 1 |
| Reconstruction-Free Anomaly Detection with Diffusion Models via Direct Latent Likelihood Evaluation | Apr 8, 2025 | Anomaly Detection | CodeCode Available | 1 |
| HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling | Apr 8, 2025 | DecoderGPU | CodeCode Available | 1 |
| To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition | Apr 8, 2025 | Re-RankingRetrieval | CodeCode Available | 1 |
| Learning Affine Correspondences by Integrating Geometric Constraints | Apr 7, 2025 | Pose Estimation | CodeCode Available | 1 |
| ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering | Apr 7, 2025 | Chart Question AnsweringChart Understanding | CodeCode Available | 1 |
| Predicting Survivability of Cancer Patients with Metastatic Patterns Using Explainable AI | Apr 7, 2025 | PrognosisSurvival Analysis | CodeCode Available | 1 |
| mixEEG: Enhancing EEG Federated Learning for Cross-subject EEG Classification with Tailored mixup | Apr 7, 2025 | Domain AdaptationEEG | CodeCode Available | 1 |
| Advanced Codebook Design for SCMA-aided NTNs With Randomly Distributed Users | Apr 7, 2025 | DiversityFairness | CodeCode Available | 1 |
| Climplicit: Climatic Implicit Embeddings for Global Ecological Tasks | Apr 7, 2025 | Deep Learning | CodeCode Available | 1 |
| Continuous Locomotive Crowd Behavior Generation | Apr 7, 2025 | | CodeCode Available | 1 |
| Dynamic Vision Mamba | Apr 7, 2025 | Mamba | CodeCode Available | 1 |
| R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation | Apr 7, 2025 | validVulnerability Detection | CodeCode Available | 1 |
| Data Augmentation as Free Lunch: Exploring the Test-Time Augmentation for Sequential Recommendation | Apr 7, 2025 | Data AugmentationSequential Recommendation | CodeCode Available | 1 |
| ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines | Apr 7, 2025 | AI AgentText to SQL | CodeCode Available | 1 |
| On the Robustness of GUI Grounding Models Against Image Attacks | Apr 7, 2025 | | CodeCode Available | 1 |
| Concise Reasoning via Reinforcement Learning | Apr 7, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| 3DM-WeConvene: Learned Image Compression with 3D Multi-Level Wavelet-Domain Convolution and Entropy Model | Apr 7, 2025 | Image Compression | CodeCode Available | 1 |
| Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval | Apr 7, 2025 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Apr 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| System Log Parsing with Large Language Models: A Review | Apr 7, 2025 | Anomaly DetectionLog Parsing | CodeCode Available | 1 |
| Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM | Apr 7, 2025 | 3DGSPose Tracking | CodeCode Available | 1 |
| Scaling Graph Neural Networks for Particle Track Reconstruction | Apr 7, 2025 | Edge ClassificationGPU | CodeCode Available | 1 |
| A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions | Apr 7, 2025 | | CodeCode Available | 1 |
| EquiCPI: SE(3)-Equivariant Geometric Deep Learning for Structure-Aware Prediction of Compound-Protein Interactions | Apr 7, 2025 | Drug DiscoveryRe-Ranking | CodeCode Available | 1 |
| Can LLM-Driven Hard Negative Sampling Empower Collaborative Filtering? Findings and Potentials | Apr 7, 2025 | Collaborative FilteringProfile Generation | CodeCode Available | 1 |
| Joint Pedestrian and Vehicle Traffic Optimization in Urban Environments using Reinforcement Learning | Apr 7, 2025 | Reinforcement Learning (RL)Traffic Signal Control | CodeCode Available | 1 |
| LoopGen: Training-Free Loopable Music Generation | Apr 6, 2025 | Music Generation | CodeCode Available | 1 |
| CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization | Apr 6, 2025 | BenchmarkingCombinatorial Optimization | CodeCode Available | 1 |
| COHESION: Composite Graph Convolutional Network with Dual-Stage Fusion for Multimodal Recommendation | Apr 6, 2025 | Multimodal RecommendationRepresentation Learning | CodeCode Available | 1 |
| WaveNet-Volterra Neural Networks for Active Noise Control: A Fully Causal Approach | Apr 6, 2025 | | CodeCode Available | 1 |
| Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source) | Apr 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Window Token Concatenation for Efficient Visual Large Language Models | Apr 5, 2025 | Token Reduction | CodeCode Available | 1 |
| MSL: Not All Tokens Are What You Need for Tuning LLM as a Recommender | Apr 5, 2025 | AllLanguage Modeling | CodeCode Available | 1 |
| Collaboration and Controversy Among Experts: Rumor Early Detection by Tuning a Comment Generator | Apr 5, 2025 | | CodeCode Available | 1 |
| A Survey of Pathology Foundation Model: Progress and Future Directions | Apr 5, 2025 | BenchmarkingMultiple Instance Learning | CodeCode Available | 1 |