| Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models | Sep 10, 2024 | Computational Efficiencyparameter-efficient fine-tuning | CodeCode Available | 1 |
| Sources of Uncertainty in 3D Scene Reconstruction | Sep 10, 2024 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 |
| EasyST: A Simple Framework for Spatio-Temporal Prediction | Sep 10, 2024 | Knowledge DistillationPrediction | CodeCode Available | 1 |
| Structure-Aware Single-Source Generalization with Pixel-Level Disentanglement for Joint Optic Disc and Cup Segmentation | Sep 10, 2024 | DisentanglementDomain Generalization | CodeCode Available | 1 |
| ALSS-YOLO: An Adaptive Lightweight Channel Split and Shuffling Network for TIR Wildlife Detection in UAV Imagery | Sep 10, 2024 | | CodeCode Available | 1 |
| Neural Laplacian Operator for 3D Point Clouds | Sep 10, 2024 | 3D geometry | CodeCode Available | 1 |
| Can Large Language Models Unlock Novel Scientific Research Ideas? | Sep 10, 2024 | | CodeCode Available | 1 |
| A Likelihood Ratio-Based Approach to Segmenting Unknown Objects | Sep 10, 2024 | Segmentation | CodeCode Available | 1 |
| Confident Teacher, Confident Student? A Novel User Study Design for Investigating the Didactic Potential of Explanations and their Impact on Uncertainty | Sep 10, 2024 | Experimental DesignExplainable artificial intelligence | CodeCode Available | 1 |
| LEROjD: Lidar Extended Radar-Only Object Detection | Sep 9, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework | Sep 9, 2024 | Computational EfficiencyCross-Modal Retrieval | CodeCode Available | 1 |
| Tele-LLMs: A Series of Specialized Large Language Models for Telecommunications | Sep 9, 2024 | | CodeCode Available | 1 |
| The Unseen AI Disruptions for Power Grids: LLM-Induced Transients | Sep 9, 2024 | | CodeCode Available | 1 |
| Generative Recommender with End-to-End Learnable Item Tokenization | Sep 9, 2024 | DecoderRecommendation Systems | CodeCode Available | 1 |
| OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System | Sep 9, 2024 | knowledge editingKnowledge Graphs | CodeCode Available | 1 |
| UAVDB: Trajectory-Guided Adaptable Bounding Boxes for UAV Detection | Sep 9, 2024 | 2D Object DetectionDiversity | CodeCode Available | 1 |
| Retrofitting Temporal Graph Neural Networks with Transformer | Sep 9, 2024 | Graph AttentionGraph Sampling | CodeCode Available | 1 |
| Prototype-Driven Multi-Feature Generation for Visible-Infrared Person Re-identification | Sep 9, 2024 | DiversityPerson Re-Identification | CodeCode Available | 1 |
| Early-exit Convolutional Neural Networks | Sep 9, 2024 | | CodeCode Available | 1 |
| DatAasee -- A Metadata-Lake as Metadata Catalog for a Virtual Data-Lake | Sep 9, 2024 | Management | CodeCode Available | 1 |
| A Survey of Multimodal Composite Editing and Retrieval | Sep 9, 2024 | RetrievalSurvey | CodeCode Available | 1 |
| FacialFlowNet: Advancing Facial Optical Flow Estimation with a Diverse Dataset and a Decomposed Model | Sep 9, 2024 | DecoderOptical Flow Estimation | CodeCode Available | 1 |
| Extracting the U.S. building types from OpenStreetMap data | Sep 9, 2024 | Classification | CodeCode Available | 1 |
| KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation | Sep 9, 2024 | Face GenerationSpeech to Facial Landmark | CodeCode Available | 1 |
| TextToucher: Fine-Grained Text-to-Touch Generation | Sep 9, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| AbGPT: De Novo Antibody Design via Generative Language Modeling | Sep 9, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Evaluating Multiview Object Consistency in Humans and Image Models | Sep 9, 2024 | Experimental Design | CodeCode Available | 1 |
| STLM Engineering Report: Dropout | Sep 9, 2024 | Language Modelling | CodeCode Available | 1 |
| Online 3D reconstruction and dense tracking in endoscopic videos | Sep 9, 2024 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 |
| EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels | Sep 9, 2024 | Depth EstimationSelf-Learning | CodeCode Available | 1 |
| RIRAG: Regulatory Information Retrieval and Answer Generation | Sep 9, 2024 | Answer GenerationInformation Retrieval | CodeCode Available | 1 |
| Celcomen: spatial causal disentanglement for single-cell and tissue perturbation modeling | Sep 9, 2024 | counterfactualDisentanglement | CodeCode Available | 1 |
| Deep Generic Representations for Domain-Generalized Anomalous Sound Detection | Sep 8, 2024 | | CodeCode Available | 1 |
| Insights from Benchmarking Frontier Language Models on Web App Code Generation | Sep 8, 2024 | BenchmarkingCode Generation | CodeCode Available | 1 |
| Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Detector | Sep 8, 2024 | Form | CodeCode Available | 1 |
| Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations | Sep 8, 2024 | Emotion RecognitionMamba | CodeCode Available | 1 |
| PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation | Sep 8, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| BBS: Bi-directional Bit-level Sparsity for Deep Learning Acceleration | Sep 8, 2024 | Deep LearningQuantization | CodeCode Available | 1 |
| Difference-in-Differences with Multiple Events | Sep 8, 2024 | | CodeCode Available | 1 |
| Can OOD Object Detectors Learn from Foundation Models? | Sep 8, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Time-independent Spiking Neuron via Membrane Potential Estimation for Efficient Spiking Neural Networks | Sep 8, 2024 | Computational Efficiency | CodeCode Available | 1 |
| Dual convolutional neural network with attention for image blind denoising | Sep 8, 2024 | DenoisingImage Denoising | CodeCode Available | 1 |
| READoc: A Unified Benchmark for Realistic Document Structured Extraction | Sep 8, 2024 | | CodeCode Available | 1 |
| Visual Grounding with Multi-modal Conditional Adaptation | Sep 8, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| PatchAlign:Fair and Accurate Skin Disease Image Classification by Alignment with Clinical Labels | Sep 8, 2024 | Fairnessimage-classification | CodeCode Available | 1 |
| SGSeg: Enabling Text-free Inference in Language-guided Segmentation of Chest X-rays via Self-guidance | Sep 7, 2024 | Image SegmentationPseudo Label | CodeCode Available | 1 |
| Cross-attention Inspired Selective State Space Models for Target Sound Extraction | Sep 7, 2024 | Computational EfficiencyMamba | CodeCode Available | 1 |
| SSFam: Scribble Supervised Salient Object Detection Family | Sep 7, 2024 | DecoderObject | CodeCode Available | 1 |
| Triple equivalence for the emergence of biological intelligence | Sep 7, 2024 | Bayesian InferenceModel Selection | CodeCode Available | 1 |
| Dual-stream Feature Augmentation for Domain Generalization | Sep 7, 2024 | Contrastive LearningDomain Generalization | CodeCode Available | 1 |