| MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning | Oct 12, 2024 | Domain AdaptationMulti-Task Learning | CodeCode Available | 1 |
| Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks | Oct 12, 2024 | parameter-efficient fine-tuningVisual Reasoning | CodeCode Available | 1 |
| Rethinking Data Selection at Scale: Random Selection is Almost All You Need | Oct 12, 2024 | All | CodeCode Available | 1 |
| LogLM: From Task-based to Instruction-based Automated Log Analysis | Oct 12, 2024 | Anomaly DetectionLog Parsing | CodeCode Available | 1 |
| DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Oct 11, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| OpenCity: A Scalable Platform to Simulate Urban Activities with Massive LLM Agents | Oct 11, 2024 | | CodeCode Available | 1 |
| Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting | Oct 11, 2024 | DiversityImage Generation | CodeCode Available | 1 |
| Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture | Oct 11, 2024 | ECG ClassificationElectrocardiography (ECG) | CodeCode Available | 1 |
| Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation | Oct 11, 2024 | BenchmarkingImage Segmentation | CodeCode Available | 1 |
| AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation | Oct 11, 2024 | Safety Alignment | CodeCode Available | 1 |
| Distillation of Discrete Diffusion through Dimensional Correlations | Oct 11, 2024 | | CodeCode Available | 1 |
| Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents | Oct 11, 2024 | ChatbotRed Teaming | CodeCode Available | 1 |
| Retraining-Free Merging of Sparse MoE via Hierarchical Clustering | Oct 11, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Parameter-Efficient Fine-Tuning of State Space Models | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Chain-of-Restoration: Multi-Task Image Restoration Models are Zero-Shot Step-by-Step Universal Image Restorers | Oct 11, 2024 | Image Restoration | CodeCode Available | 1 |
| SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models | Oct 11, 2024 | Few-Shot LearningMultiple-choice | CodeCode Available | 1 |
| Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization | Oct 11, 2024 | | CodeCode Available | 1 |
| DiffPO: A causal diffusion model for learning distributions of potential outcomes | Oct 11, 2024 | Causal InferenceDecision Making | CodeCode Available | 1 |
| E-Motion: Future Motion Simulation via Event Sequence Diffusion | Oct 11, 2024 | | CodeCode Available | 1 |
| Zeroth-Order Fine-Tuning of LLMs in Random Subspaces | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models | Oct 11, 2024 | Out of Distribution (OOD) Detection | CodeCode Available | 1 |
| Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Oct 11, 2024 | | CodeCode Available | 1 |
| Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Oct 11, 2024 | Handwritten Text RecognitionHTR | CodeCode Available | 1 |
| Do Unlearning Methods Remove Information from Language Model Weights? | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors | Oct 11, 2024 | Drug Discovery | CodeCode Available | 1 |
| VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding | Oct 11, 2024 | HallucinationMoment Retrieval | CodeCode Available | 1 |
| Language Imbalance Driven Rewarding for Multilingual Self-improving | Oct 11, 2024 | Arithmetic ReasoningInstruction Following | CodeCode Available | 1 |
| MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices | Oct 11, 2024 | | CodeCode Available | 1 |
| Low-complexity Attention-based Unsupervised Anomalous Sound Detection exploiting Separable Convolutions and Angular Loss | Oct 11, 2024 | Anomaly DetectionTask 2 | CodeCode Available | 1 |
| When Graph meets Multimodal: Benchmarking on Multimodal Attributed Graphs Learning | Oct 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Oct 11, 2024 | DiversityMuJoCo | CodeCode Available | 1 |
| Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping | Oct 11, 2024 | MMEQuestion Answering | CodeCode Available | 1 |
| Batched Energy-Entropy acquisition for Bayesian Optimization | Oct 11, 2024 | Bayesian OptimizationGaussian Processes | CodeCode Available | 1 |
| MiRAGeNews: Multimodal Realistic AI-Generated News Detection | Oct 11, 2024 | | CodeCode Available | 1 |
| Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Oct 11, 2024 | Knowledge Distillation | CodeCode Available | 1 |
| Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models | Oct 11, 2024 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Zero-Shot Offline Imitation Learning via Optimal Transport | Oct 11, 2024 | Imitation Learning | CodeCode Available | 1 |
| Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient | Oct 11, 2024 | MambaModel-based Reinforcement Learning | CodeCode Available | 1 |
| PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents | Oct 11, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning | Oct 11, 2024 | Data PoisoningLanguage Modeling | CodeCode Available | 1 |
| A foundation model for generalizable disease diagnosis in chest X-ray images | Oct 11, 2024 | Self-Supervised Learning | CodeCode Available | 1 |
| SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction | Oct 11, 2024 | Autonomous Vehiclesmotion prediction | CodeCode Available | 1 |
| Recovering complex ecological dynamics from time series using state-space universal dynamic equations | Oct 11, 2024 | Time Series | CodeCode Available | 1 |
| DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Oct 11, 2024 | General Knowledgeobject-detection | CodeCode Available | 1 |
| CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation | Oct 10, 2024 | Crack SegmentationDenoising | CodeCode Available | 1 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models | Oct 10, 2024 | | CodeCode Available | 1 |
| Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines | Oct 10, 2024 | | CodeCode Available | 1 |
| Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SPA: 3D Spatial-Awareness Enables Effective Embodied Representation | Oct 10, 2024 | GPUNeural Rendering | CodeCode Available | 1 |