| Rethinking HTG Evaluation: Bridging Generation and Recognition | Sep 4, 2024 | DiversityHandwriting generation | CodeCode Available | 1 |
| Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing | Sep 4, 2024 | Image Generation | CodeCode Available | 1 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Explainable AI for computational pathology identifies model limitations and tissue biomarkers | Sep 4, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 |
| iRangeGraph: Improvising Range-dedicated Graphs for Range-filtering Nearest Neighbor Search | Sep 4, 2024 | | CodeCode Available | 1 |
| HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Sep 4, 2024 | 4kDenoising | CodeCode Available | 1 |
| NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval | Sep 4, 2024 | Image RetrievalRAG | CodeCode Available | 1 |
| RTLRewriter: Methodologies for Large Models aided RTL Code Optimization | Sep 4, 2024 | Benchmarking | CodeCode Available | 1 |
| NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls | Sep 4, 2024 | | CodeCode Available | 1 |
| Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation | Sep 4, 2024 | Dichotomous Image SegmentationImage Segmentation | CodeCode Available | 1 |
| TASAR: Transfer-based Attack on Skeletal Action Recognition | Sep 4, 2024 | Action RecognitionActivity Recognition | CodeCode Available | 1 |
| RouterRetriever: Routing over a Mixture of Expert Embedding Models | Sep 4, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Topological Methods in Machine Learning: A Tutorial for Practitioners | Sep 4, 2024 | | CodeCode Available | 1 |
| Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments | Sep 3, 2024 | Autonomous DrivingPedestrian Trajectory Prediction | CodeCode Available | 1 |
| FC-KAN: Function Combinations in Kolmogorov-Arnold Networks | Sep 3, 2024 | Image ClassificationKolmogorov-Arnold Networks | CodeCode Available | 1 |
| Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models | Sep 3, 2024 | Image RestorationLanguage Modeling | CodeCode Available | 1 |
| LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement | Sep 3, 2024 | DecoderSpeech Enhancement | CodeCode Available | 1 |
| UNSURE: self-supervised learning with Unknown Noise level and Stein's Unbiased Risk Estimate | Sep 3, 2024 | Image ReconstructionSelf-Supervised Learning | CodeCode Available | 1 |
| Frequency-Spatial Entanglement Learning for Camouflaged Object Detection | Sep 3, 2024 | Objectobject-detection | CodeCode Available | 1 |
| FuzzCoder: Byte-level Fuzzing Test via Large Language Model | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Designing Large Foundation Models for Efficient Training and Inference: A Survey | Sep 3, 2024 | Knowledge DistillationModel Compression | CodeCode Available | 1 |
| EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding | Sep 3, 2024 | Chart Understanding | CodeCode Available | 1 |
| VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning | Sep 3, 2024 | Chart Question AnsweringData Visualization | CodeCode Available | 1 |
| PMLBmini: A Tabular Classification Benchmark Suite for Data-Scarce Applications | Sep 3, 2024 | AutoMLBinary Classification | CodeCode Available | 1 |
| LUK: Empowering Log Understanding with Expert Knowledge from Large Language Models | Sep 3, 2024 | | CodeCode Available | 1 |
| Generative Principal Component Regression via Variational Inference | Sep 3, 2024 | regressionVariational Inference | CodeCode Available | 1 |
| Decoding finger velocity from cortical spike trains with recurrent spiking neural networks | Sep 3, 2024 | Low-latency processing | CodeCode Available | 1 |
| Training on the Benchmark Is Not All You Need | Sep 3, 2024 | AllMultiple-choice | CodeCode Available | 1 |
| Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates | Sep 3, 2024 | Image Compression | CodeCode Available | 1 |
| LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs | Sep 3, 2024 | 16kBenchmarking | CodeCode Available | 1 |
| Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation | Sep 3, 2024 | | CodeCode Available | 1 |
| SFA-Net: Semantic Feature Adjustment Network for Remote Sensing Image Segmentation | Sep 3, 2024 | Change DetectionDecoder | CodeCode Available | 1 |
| Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement | Sep 3, 2024 | DisentanglementImage Enhancement | CodeCode Available | 1 |
| GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection | Sep 3, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Latent Distillation for Continual Object Detection at the Edge | Sep 3, 2024 | Class-Incremental Object DetectionContinual Learning | CodeCode Available | 1 |
| Early Design Exploration of Aerospace Systems Using Assume-Guarantee Contracts | Sep 3, 2024 | Management | CodeCode Available | 1 |
| Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization | Sep 3, 2024 | Multiview Detection | CodeCode Available | 1 |
| SPiKE: 3D Human Pose from Point Cloud Sequences | Sep 3, 2024 | 3D Human Pose EstimationPose Estimation | CodeCode Available | 1 |
| CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation | Sep 3, 2024 | Dataset GenerationQuestion Answering | CodeCode Available | 1 |
| What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices | Sep 3, 2024 | Question AnsweringQuestion Generation | CodeCode Available | 1 |
| CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models | Sep 2, 2024 | Text ClassificationText Generation | CodeCode Available | 1 |
| Real-Time Recurrent Learning using Trace Units in Reinforcement Learning | Sep 2, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Towards Student Actions in Classroom Scenes: New Dataset and Baseline | Sep 2, 2024 | Action DetectionBenchmarking | CodeCode Available | 1 |
| AMG: Avatar Motion Guided Video Generation | Sep 2, 2024 | Video Generation | CodeCode Available | 1 |
| Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning | Sep 2, 2024 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information | Sep 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning | Sep 2, 2024 | Continual LearningContrastive Learning | CodeCode Available | 1 |
| The Compressor-Retriever Architecture for Language Model OS | Sep 2, 2024 | CPUIn-Context Learning | CodeCode Available | 1 |
| Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference | Sep 2, 2024 | Computational EfficiencySentence | CodeCode Available | 1 |