| TI-PREGO: Chain of Thought and In-Context Learning for Online Mistake Detection in PRocedural EGOcentric Videos | Nov 4, 2024 | In-Context LearningMistake Detection | CodeCode Available | 1 |
| LayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph Generation | Nov 4, 2024 | BenchmarkingGraph Generation | CodeCode Available | 1 |
| Revisiting K-mer Profile for Effective and Scalable Genome Representation Learning | Nov 4, 2024 | Representation Learning | CodeCode Available | 1 |
| Improving Steering Vectors by Targeting Sparse Autoencoder Features | Nov 4, 2024 | | CodeCode Available | 1 |
| Expanding Sparse Tuning for Low Memory Usage | Nov 4, 2024 | parameter-efficient fine-tuning | CodeCode Available | 1 |
| Context-Informed Machine Translation of Manga using Multimodal Large Language Models | Nov 4, 2024 | Machine TranslationTranslation | CodeCode Available | 1 |
| On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback | Nov 4, 2024 | | CodeCode Available | 1 |
| Can Language Models Learn to Skip Steps? | Nov 4, 2024 | | CodeCode Available | 1 |
| The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units | Nov 4, 2024 | Logical Reasoning | CodeCode Available | 1 |
| Multi-Transmotion: Pre-trained Model for Human Motion Prediction | Nov 4, 2024 | Human motion predictionmotion prediction | CodeCode Available | 1 |
| TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nov 4, 2024 | ChunkingLanguage Modelling | CodeCode Available | 1 |
| Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge | Nov 4, 2024 | | CodeCode Available | 1 |
| Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models | Nov 4, 2024 | | CodeCode Available | 1 |
| VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization | Nov 3, 2024 | QuantizationRepresentation Learning | CodeCode Available | 1 |
| Classifier-guided Gradient Modulation for Enhanced Multimodal Learning | Nov 3, 2024 | | CodeCode Available | 1 |
| Polar R-CNN: End-to-End Lane Detection with Fewer Anchors | Nov 3, 2024 | Autonomous DrivingLane Detection | CodeCode Available | 1 |
| ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Nov 3, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Activating Self-Attention for Multi-Scene Absolute Pose Regression | Nov 3, 2024 | Camera Pose EstimationPose Estimation | CodeCode Available | 1 |
| EDformer: Transformer-Based Event Denoising Across Varied Noise Levels | Nov 3, 2024 | Denoising | CodeCode Available | 1 |
| Conditional Controllable Image Fusion | Nov 3, 2024 | Denoising | CodeCode Available | 1 |
| Large-Scale Multi-Robot Coverage Path Planning on Grids with Path Deconfliction | Nov 3, 2024 | Multi-Agent Path Finding | CodeCode Available | 1 |
| Co-clustering for Federated Recommender System | Nov 3, 2024 | ClusteringCollaborative Filtering | CodeCode Available | 1 |
| FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing | Nov 3, 2024 | | CodeCode Available | 1 |
| GraphXForm: Graph transformer for computer-aided molecular design | Nov 3, 2024 | Drug DesignDrug Discovery | CodeCode Available | 1 |
| LinRec: Linear Attention Mechanism for Long-term Sequential Recommender Systems | Nov 3, 2024 | Recommendation SystemsSequential Recommendation | CodeCode Available | 1 |
| Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models | Nov 3, 2024 | | CodeCode Available | 1 |
| Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM | Nov 3, 2024 | LAMBADAText Generation | CodeCode Available | 1 |
| HeightMapNet: Explicit Height Modeling for End-to-End HD Map Learning | Nov 3, 2024 | | CodeCode Available | 1 |
| Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning | Nov 2, 2024 | Meta-Learning | CodeCode Available | 1 |
| MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction | Nov 2, 2024 | 3D Plane Detection | CodeCode Available | 1 |
| Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection | Nov 2, 2024 | Audio Source SeparationEvent Detection | CodeCode Available | 1 |
| Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging | Nov 2, 2024 | 3D Point Cloud ClassificationPoint Cloud Classification | CodeCode Available | 1 |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Nov 2, 2024 | Line DetectionSemantic Similarity | CodeCode Available | 1 |
| Visual Fourier Prompt Tuning | Nov 2, 2024 | Visual Prompt Tuning | CodeCode Available | 1 |
| Use Digital Twins to Support Fault Diagnosis From System-level Condition-monitoring Data | Nov 2, 2024 | Deep LearningFault Diagnosis | CodeCode Available | 1 |
| AutoPT: How Far Are We from the End2End Automated Web Penetration Testing? | Nov 2, 2024 | | CodeCode Available | 1 |
| Fast and Memory-Efficient Video Diffusion Using Streamlined Inference | Nov 2, 2024 | GPUVideo Generation | CodeCode Available | 1 |
| What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks | Nov 2, 2024 | | CodeCode Available | 1 |
| TaxaBind: A Unified Embedding Space for Ecological Applications | Nov 1, 2024 | Audio ClassificationCross-Modal Retrieval | CodeCode Available | 1 |
| Beyond Utility: Evaluating LLM as Recommender | Nov 1, 2024 | PositionRe-Ranking | CodeCode Available | 1 |
| PatternBoost: Constructions in Mathematics with a Little Help from AI | Nov 1, 2024 | | CodeCode Available | 1 |
| MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference Calibration | Nov 1, 2024 | Bayesian OptimizationGaussian Processes | CodeCode Available | 1 |
| LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation | Nov 1, 2024 | Logical ReasoningSequential Decision Making | CodeCode Available | 1 |
| Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification | Nov 1, 2024 | QuantizationRepresentation Learning | CodeCode Available | 1 |
| LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models | Nov 1, 2024 | BenchmarkingMixture-of-Experts | CodeCode Available | 1 |
| Attention Tracker: Detecting Prompt Injection Attacks in LLMs | Nov 1, 2024 | | CodeCode Available | 1 |
| Self-Evolved Reward Learning for LLMs | Nov 1, 2024 | | CodeCode Available | 1 |
| C2A: Client-Customized Adaptation for Parameter-Efficient Federated Learning | Nov 1, 2024 | Federated Learningparameter-efficient fine-tuning | CodeCode Available | 1 |
| Constant Acceleration Flow | Nov 1, 2024 | | CodeCode Available | 1 |
| Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities | Nov 1, 2024 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |