| Toward Enhancing Vehicle Color Recognition in Adverse Conditions: A Dataset and Benchmark | Aug 21, 2024 | AttributeFine-Grained Vehicle Classification | CodeCode Available | 1 |
| Low-Light Object Tracking: A Benchmark | Aug 21, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors | Aug 21, 2024 | Novel View Synthesis | CodeCode Available | 1 |
| Great Memory, Shallow Reasoning: Limits of kNN-LMs | Aug 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CHOTA: A Higher Order Accuracy Metric for Cell Tracking | Aug 21, 2024 | Cell TrackingMultiple Object Tracking | CodeCode Available | 1 |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Aug 21, 2024 | Image GenerationImage Retrieval | CodeCode Available | 1 |
| FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization | Aug 21, 2024 | Visual Localization | CodeCode Available | 1 |
| Sum of Squares Circuits | Aug 21, 2024 | | CodeCode Available | 1 |
| TWLV-I: Analysis and Insights from Holistic Evaluation on Video Foundation Models | Aug 21, 2024 | Action RecognitionEmbeddings Evaluation | CodeCode Available | 1 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Interpretable Long-term Action Quality Assessment | Aug 21, 2024 | Action Quality AssessmentDiversity | CodeCode Available | 1 |
| A Benchmark for AI-based Weather Data Assimilation | Aug 21, 2024 | Weather Forecasting | CodeCode Available | 1 |
| NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation | Aug 21, 2024 | DecoderDomain Generalization | CodeCode Available | 1 |
| CoPRA: Bridging Cross-domain Pretrained Sequence Models with Complex Structures for Protein-RNA Binding Affinity Prediction | Aug 21, 2024 | Drug DesignPrediction | CodeCode Available | 1 |
| OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal | Aug 21, 2024 | Image Restoration | CodeCode Available | 1 |
| V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard? | Aug 20, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 1 |
| A toolbox for calculating objective image properties in aesthetics research | Aug 20, 2024 | | CodeCode Available | 1 |
| Security Attacks on LLM-based Code Completion Tools | Aug 20, 2024 | Code Completion | CodeCode Available | 1 |
| Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors | Aug 20, 2024 | DecoderFace Recognition | CodeCode Available | 1 |
| OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding | Aug 20, 2024 | ObjectScene Understanding | CodeCode Available | 1 |
| Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm | Aug 20, 2024 | MambaSign Language Translation | CodeCode Available | 1 |
| Training Matting Models without Alpha Labels | Aug 20, 2024 | 2kImage Matting | CodeCode Available | 1 |
| Generalizable Facial Expression Recognition | Aug 20, 2024 | Domain AdaptationFacial Expression Recognition | CodeCode Available | 1 |
| Navigating Spatio-Temporal Heterogeneity: A Graph Transformer Approach for Traffic Forecasting | Aug 20, 2024 | AttributeMixture-of-Experts | CodeCode Available | 1 |
| Neural Exploratory Landscape Analysis for Meta-Black-Box-Optimization | Aug 20, 2024 | | CodeCode Available | 1 |
| Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Aug 20, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration | Aug 20, 2024 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Multi-view Hand Reconstruction with a Point-Embedded Transformer | Aug 20, 2024 | | CodeCode Available | 1 |
| CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network | Aug 20, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 |
| TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer Learning | Aug 20, 2024 | Action Recognitionparameter-efficient fine-tuning | CodeCode Available | 1 |
| DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection | Aug 20, 2024 | Fake News DetectionImage Manipulation | CodeCode Available | 1 |
| Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear Representations | Aug 20, 2024 | Position | CodeCode Available | 1 |
| EPiC: Cost-effective Search-based Prompt Engineering of LLMs for Code Generation | Aug 20, 2024 | Code GenerationPrompt Engineering | CodeCode Available | 1 |
| Hologram Reasoning for Solving Algebra Problems with Geometry Diagrams | Aug 20, 2024 | Deep Reinforcement LearningModel Selection | CodeCode Available | 1 |
| SubgoalXL: Subgoal-based Expert Learning for Theorem Proving | Aug 20, 2024 | Automated Theorem Proving | CodeCode Available | 1 |
| An Efficient Sign Language Translation Using Spatial Configuration and Motion Dynamics with LLMs | Aug 20, 2024 | Gloss-free Sign Language TranslationSign Language Translation | CodeCode Available | 1 |
| SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised Pretraining | Aug 20, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| SysBench: Can Large Language Models Follow System Messages? | Aug 20, 2024 | | CodeCode Available | 1 |
| Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering | Aug 20, 2024 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models | Aug 20, 2024 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval | Aug 20, 2024 | MambaNatural Language Queries | CodeCode Available | 1 |
| HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models | Aug 20, 2024 | GPULanguage Modelling | CodeCode Available | 1 |
| Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Wave-Mask/Mix: Exploring Wavelet-Based Augmentations for Time Series Forecasting | Aug 20, 2024 | Data AugmentationTime Series | CodeCode Available | 1 |
| ViLReF: An Expert Knowledge Enabled Vision-Language Retinal Foundation Model | Aug 20, 2024 | DiagnosticTransfer Learning | CodeCode Available | 1 |
| MPL: Lifting 3D Human Pose from Multi-view 2D Poses | Aug 20, 2024 | 2D Pose EstimationPose Estimation | CodeCode Available | 1 |
| Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval | Aug 20, 2024 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement | Aug 20, 2024 | Image EnhancementLanguage Modeling | CodeCode Available | 1 |
| CHECKWHY: Causal Fact Verification via Argument Structure | Aug 20, 2024 | Fact VerificationLogical Reasoning | CodeCode Available | 1 |
| Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique | Aug 20, 2024 | AI and SafetyDiversity | CodeCode Available | 1 |