| Federated Learning with New Knowledge: Fundamentals, Advances, and Futures | Feb 3, 2024 | Federated LearningPrivacy Preserving | CodeCode Available | 2 |
| Cross-view Masked Diffusion Transformers for Person Image Synthesis | Feb 2, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping | Feb 2, 2024 | 3D ReconstructionEarth Observation | CodeCode Available | 2 |
| Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram | Feb 2, 2024 | DiagnosticECG Classification | CodeCode Available | 2 |
| A Single Simple Patch is All You Need for AI-generated Image Detection | Feb 2, 2024 | All | CodeCode Available | 2 |
| SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training? | Feb 2, 2024 | | CodeCode Available | 2 |
| Improving Sequential Recommendations with LLMs | Feb 2, 2024 | Sequential Recommendation | CodeCode Available | 2 |
| LitLLM: A Toolkit for Scientific Literature Review | Feb 2, 2024 | RAGRetrieval | CodeCode Available | 2 |
| TrustAgent: Towards Safe and Trustworthy LLM-based Agents | Feb 2, 2024 | Task Planning | CodeCode Available | 2 |
| StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback | Feb 2, 2024 | Code CompletionCode Generation | CodeCode Available | 2 |
| Efficient and Effective Time-Series Forecasting with Spiking Neural Networks | Feb 2, 2024 | Model SelectionTime Series | CodeCode Available | 2 |
| InfMAE: A Foundation Model in the Infrared Modality | Feb 1, 2024 | DecoderSelf-Supervised Learning | CodeCode Available | 2 |
| EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models | Feb 1, 2024 | | CodeCode Available | 2 |
| Towards Efficient Exact Optimization of Language Model Alignment | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Survey on Hallucination in Large Vision-Language Models | Feb 1, 2024 | HallucinationSurvey | CodeCode Available | 2 |
| Graph Domain Adaptation: Challenges, Progress and Prospects | Feb 1, 2024 | Domain AdaptationGRAPH DOMAIN ADAPTATION | CodeCode Available | 2 |
| Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents | Feb 1, 2024 | | CodeCode Available | 2 |
| Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| On the Challenges of Fuzzing Techniques via Large Language Models | Feb 1, 2024 | software testingSurvey | CodeCode Available | 2 |
| CapHuman: Capture Your Moments in Parallel Universes | Feb 1, 2024 | Image Generation | CodeCode Available | 2 |
| Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management | Feb 1, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 2 |
| PAM: Prompting Audio-Language Models for Audio Quality Assessment | Feb 1, 2024 | Audio Quality AssessmentMusic Generation | CodeCode Available | 2 |
| CF4J: Collaborative Filtering for Java | Feb 1, 2024 | Collaborative FilteringRecommendation Systems | CodeCode Available | 2 |
| Improved Scene Landmark Detection for Camera Localization | Jan 31, 2024 | Camera LocalizationPose Estimation | CodeCode Available | 2 |
| EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning | Jan 31, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 |
| On Prompt-Driven Safeguarding for Large Language Models | Jan 31, 2024 | | CodeCode Available | 2 |
| SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks | Jan 31, 2024 | Sentence | CodeCode Available | 2 |
| Fin-GAN: forecasting and classifying financial time series via generative adversarial networks | Jan 31, 2024 | Generative Adversarial NetworkProbabilistic Time Series Forecasting | CodeCode Available | 2 |
| AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jan 31, 2024 | Denoising | CodeCode Available | 2 |
| ControlCap: Controllable Region-level Captioning | Jan 31, 2024 | Dense Captioning | CodeCode Available | 2 |
| Local Feature Matching Using Deep Learning: A Survey | Jan 31, 2024 | 3D ReconstructionDeep Learning | CodeCode Available | 2 |
| LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement | Jan 31, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition | Jan 31, 2024 | Novel View SynthesisSegmentation | CodeCode Available | 2 |
| M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval | Jan 31, 2024 | RetrievalText Retrieval | CodeCode Available | 2 |
| EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks | Jan 31, 2024 | Audio GenerationSpeech Synthesis | CodeCode Available | 2 |
| Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators | Jan 31, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain | Jan 30, 2024 | Image ComprehensionInstruction Following | CodeCode Available | 2 |
| Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens | Jan 30, 2024 | Language Modelling | CodeCode Available | 2 |
| TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese | Jan 30, 2024 | Text Generation | CodeCode Available | 2 |
| Weak-to-Strong Jailbreaking on Large Language Models | Jan 30, 2024 | | CodeCode Available | 2 |
| Finetuning Large Language Models for Vulnerability Detection | Jan 30, 2024 | Transfer LearningVulnerability Detection | CodeCode Available | 2 |
| Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks | Jan 30, 2024 | | CodeCode Available | 2 |
| Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models | Jan 30, 2024 | Data CompressionLanguage Modelling | CodeCode Available | 2 |
| Multi-granularity Correspondence Learning from Long-term Noisy Videos | Jan 30, 2024 | Action SegmentationLong Video Retrieval (Background Removed) | CodeCode Available | 2 |
| LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation | Jan 30, 2024 | HallucinationKnowledge Distillation | CodeCode Available | 2 |
| An Open Software Suite for Event-Based Video | Jan 30, 2024 | | CodeCode Available | 2 |
| MF-MOS: A Motion-Focused Model for Moving Object Segmentation | Jan 30, 2024 | Autonomous DrivingObject | CodeCode Available | 2 |
| MouSi: Poly-Visual-Expert Vision-Language Models | Jan 30, 2024 | Image SegmentationImage-text matching | CodeCode Available | 2 |
| Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios | Jan 30, 2024 | Benchmarking | CodeCode Available | 2 |
| MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models | Jan 30, 2024 | | CodeCode Available | 2 |