| Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | Nov 6, 2024 | Diversity | CodeCode Available | 1 |
| MEG: Medical Knowledge-Augmented Large Language Models for Question Answering | Nov 6, 2024 | Knowledge Graph EmbeddingsMultiple-choice | CodeCode Available | 1 |
| The Recurrent Sticky Hierarchical Dirichlet Process Hidden Markov Model | Nov 6, 2024 | | CodeCode Available | 1 |
| Beyond Model Adaptation at Test Time: A Survey | Nov 6, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Learning Generalizable Policy for Obstacle-Aware Autonomous Drone Racing | Nov 6, 2024 | Deep Reinforcement LearningDrone navigation | CodeCode Available | 1 |
| Energy-based physics-informed neural network for frictionless contact problems under large deformation | Nov 6, 2024 | Computational EfficiencyContact mechanics | CodeCode Available | 1 |
| PocoLoco: A Point Cloud Diffusion Model of Human Shape in Loose Clothing | Nov 6, 2024 | DenoisingHuman Animation | CodeCode Available | 1 |
| Number Cookbook: Number Understanding of Language Models and How to Improve It | Nov 6, 2024 | | CodeCode Available | 1 |
| RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Nov 6, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation | Nov 6, 2024 | Image GenerationInductive Bias | CodeCode Available | 1 |
| MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba | Nov 6, 2024 | Mambaparameter-efficient fine-tuning | CodeCode Available | 1 |
| Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection | Nov 5, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Time-Causal VAE: Robust Financial Time Series Generator | Nov 5, 2024 | DecoderStochastic Optimization | CodeCode Available | 1 |
| MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs | Nov 5, 2024 | Bug fixingCode Generation | CodeCode Available | 1 |
| Membership Inference Attacks against Large Vision-Language Models | Nov 5, 2024 | Inference AttackMembership Inference Attack | CodeCode Available | 1 |
| Privacy-Preserving Graph-Based Machine Learning with Fully Homomorphic Encryption for Collaborative Anti-Money Laundering | Nov 5, 2024 | Computational EfficiencyGraph Neural Network | CodeCode Available | 1 |
| Generative Artificial Intelligence Meets Synthetic Aperture Radar: A Survey | Nov 5, 2024 | Survey | CodeCode Available | 1 |
| SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Nov 5, 2024 | DiversityFairness | CodeCode Available | 1 |
| Adversarial multi-task underwater acoustic target recognition: towards robustness against various influential factors | Nov 5, 2024 | | CodeCode Available | 1 |
| GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models | Nov 5, 2024 | Code CompletionCode Generation | CodeCode Available | 1 |
| Inference Optimal VLMs Need Fewer Visual Tokens and More Parameters | Nov 5, 2024 | Token ReductionVisual Reasoning | CodeCode Available | 1 |
| CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection | Nov 5, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices | Nov 5, 2024 | Operator learning | CodeCode Available | 1 |
| Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective | Nov 5, 2024 | DecoderSegmentation | CodeCode Available | 1 |
| Grounding Natural Language to SQL Translation with Data-Based Self-Explanations | Nov 5, 2024 | Translation | CodeCode Available | 1 |
| Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection | Nov 5, 2024 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 1 |
| LiVOS: Light Video Object Segmentation with Gated Linear Matching | Nov 5, 2024 | GPUSemantic Segmentation | CodeCode Available | 1 |
| Label Critic: Design Data Before Models | Nov 5, 2024 | | CodeCode Available | 1 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Breaking the Reclustering Barrier in Centroid-based Deep Clustering | Nov 4, 2024 | ClusteringDeep Clustering | CodeCode Available | 1 |
| Improving Steering Vectors by Targeting Sparse Autoencoder Features | Nov 4, 2024 | | CodeCode Available | 1 |
| QCS: Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition | Nov 4, 2024 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 |
| Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge | Nov 4, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Multi-Transmotion: Pre-trained Model for Human Motion Prediction | Nov 4, 2024 | Human motion predictionmotion prediction | CodeCode Available | 1 |
| MILU: A Multi-task Indic Language Understanding Benchmark | Nov 4, 2024 | Multiple-choiceQuestion Answering | CodeCode Available | 1 |
| TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nov 4, 2024 | ChunkingLanguage Modelling | CodeCode Available | 1 |
| PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text | Nov 4, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 |
| The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units | Nov 4, 2024 | Logical Reasoning | CodeCode Available | 1 |
| On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback | Nov 4, 2024 | | CodeCode Available | 1 |
| Learning to Assist Humans without Inferring Rewards | Nov 4, 2024 | Chatbotreinforcement-learning | CodeCode Available | 1 |
| Not Just Object, But State: Compositional Incremental Learning without Forgetting | Nov 4, 2024 | DiversityIncremental Learning | CodeCode Available | 1 |
| Sparsing Law: Towards Large Language Models with Greater Activation Sparsity | Nov 4, 2024 | | CodeCode Available | 1 |
| Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models | Nov 4, 2024 | | CodeCode Available | 1 |
| Bridge-IF: Learning Inverse Protein Folding with Markov Bridges | Nov 4, 2024 | Protein DesignProtein Folding | CodeCode Available | 1 |
| Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks | Nov 4, 2024 | Action GenerationBenchmarking | CodeCode Available | 1 |
| GraphXAIN: Narratives to Explain Graph Neural Networks | Nov 4, 2024 | DescriptiveFeature Importance | CodeCode Available | 1 |
| Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge | Nov 4, 2024 | | CodeCode Available | 1 |
| Expanding Sparse Tuning for Low Memory Usage | Nov 4, 2024 | parameter-efficient fine-tuning | CodeCode Available | 1 |
| Can Language Models Learn to Skip Steps? | Nov 4, 2024 | | CodeCode Available | 1 |
| Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs | Nov 4, 2024 | Lipreadingspeech-recognition | CodeCode Available | 1 |