| A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders | Sep 22, 2024 | | CodeCode Available | 1 |
| PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects | Sep 22, 2024 | Surface Reconstruction | CodeCode Available | 1 |
| TabGraphs: A Benchmark and Strong Baselines for Learning on Graphs with Tabular Node Features | Sep 22, 2024 | | CodeCode Available | 1 |
| Lidar Panoptic Segmentation in an Open World | Sep 22, 2024 | Autonomous VehiclesInstance Segmentation | CodeCode Available | 1 |
| Towards Model-Agnostic Dataset Condensation by Heterogeneous Models | Sep 22, 2024 | Dataset Condensation | CodeCode Available | 1 |
| MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators | Sep 22, 2024 | Automatic Post-EditingMachine Translation | CodeCode Available | 1 |
| What Are They Doing? Joint Audio-Speech Co-Reasoning | Sep 22, 2024 | | CodeCode Available | 1 |
| UU-Mamba: Uncertainty-aware U-Mamba for Cardiovascular Segmentation | Sep 22, 2024 | MambaSegmentation | CodeCode Available | 1 |
| BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | Sep 21, 2024 | Burst Image Super-ResolutionImage Super-Resolution | CodeCode Available | 1 |
| Instruction Following without Instruction Tuning | Sep 21, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule | Sep 21, 2024 | Lung Cancer Diagnosisobject-detection | CodeCode Available | 1 |
| StateAct: State Tracking and Reasoning for Acting and Planning with Large Language Models | Sep 21, 2024 | In-Context Learning | CodeCode Available | 1 |
| GAInS: Gradient Anomaly-aware Biomedical Instance Segmentation | Sep 21, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| Content-aware Tile Generation using Exterior Boundary Inpainting | Sep 21, 2024 | Diversity | CodeCode Available | 1 |
| Accelerated Multi-Contrast MRI Reconstruction via Frequency and Spatial Mutual Learning | Sep 21, 2024 | MRI Reconstruction | CodeCode Available | 1 |
| BRep Boundary and Junction Detection for CAD Reverse Engineering | Sep 21, 2024 | Junction Detection | CodeCode Available | 1 |
| ChronoGAN: Supervised and Embedded Generative Adversarial Networks for Time Series Generation | Sep 21, 2024 | Time SeriesTime Series Generation | CodeCode Available | 1 |
| PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization | Sep 21, 2024 | Domain GeneralizationSource-free Domain Generalization | CodeCode Available | 1 |
| ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models | Sep 21, 2024 | Few-Shot LearningInstruction Following | CodeCode Available | 1 |
| SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information | Sep 21, 2024 | RAGRetrieval-augmented Generation | CodeCode Available | 1 |
| FracGM: A Fast Fractional Programming Technique for Geman-McClure Robust Estimator | Sep 21, 2024 | Point Cloud Registration | CodeCode Available | 1 |
| LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench | Sep 20, 2024 | | CodeCode Available | 1 |
| FAIR GPT: A virtual consultant for research data management in ChatGPT | Sep 20, 2024 | FairnessHallucination | CodeCode Available | 1 |
| Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling | Sep 20, 2024 | Text Detection | CodeCode Available | 1 |
| Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network | Sep 20, 2024 | Event Causality Identification | CodeCode Available | 1 |
| "I Never Said That": A dataset, taxonomy and baselines on response clarity classification | Sep 20, 2024 | | CodeCode Available | 1 |
| Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models | Sep 20, 2024 | Machine Unlearning | CodeCode Available | 1 |
| YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models | Sep 20, 2024 | BenchmarkingImage Captioning | CodeCode Available | 1 |
| Temporally Aligned Audio for Video with Autoregression | Sep 20, 2024 | Audio GenerationVideo-to-Sound Generation | CodeCode Available | 1 |
| AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularity | Sep 20, 2024 | Avg | CodeCode Available | 1 |
| Demystifying and Extracting Fault-indicating Information from Logs for Failure Diagnosis | Sep 20, 2024 | Anomaly DetectionFault Diagnosis | CodeCode Available | 1 |
| Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks | Sep 20, 2024 | ARCGSM8K | CodeCode Available | 1 |
| OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic Mapping | Sep 20, 2024 | Autonomous DrivingMamba | CodeCode Available | 1 |
| Federated Learning with Label-Masking Distillation | Sep 20, 2024 | Federated LearningPrivacy Preserving | CodeCode Available | 1 |
| MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension | Sep 20, 2024 | cross-modal alignmentReferring Expression | CodeCode Available | 1 |
| Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval | Sep 20, 2024 | Image RetrievalMetric Learning | CodeCode Available | 1 |
| A preliminary study on continual learning in computer vision using Kolmogorov-Arnold Networks | Sep 20, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey | Sep 20, 2024 | RAGRetrieval | CodeCode Available | 1 |
| PlainUSR: Chasing Faster ConvNet for Efficient Super-Resolution | Sep 20, 2024 | Super-Resolution | CodeCode Available | 1 |
| Intrinsic Single-Image HDR Reconstruction | Sep 20, 2024 | HDR ReconstructionSingle-Image-Based Hdr Reconstruction | CodeCode Available | 1 |
| Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model | Sep 20, 2024 | Image CaptioningPanoptic Segmentation | CodeCode Available | 1 |
| A Personalised 3D+t Mesh Generative Model for Unveiling Normal Heart Dynamics | Sep 20, 2024 | | CodeCode Available | 1 |
| Multiscale Encoder and Omni-Dimensional Dynamic Convolution Enrichment in nnU-Net for Brain Tumor Segmentation | Sep 20, 2024 | Brain Tumor SegmentationSegmentation | CodeCode Available | 1 |
| Cross-Domain Knowledge Transfer for Underwater Acoustic Classification Using Pre-trained Models | Sep 20, 2024 | Transfer LearningUnderwater Acoustic Classification | CodeCode Available | 1 |
| OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition | Sep 20, 2024 | CPUNetwork Pruning | CodeCode Available | 1 |
| Prithvi WxC: Foundation Model for Weather and Climate | Sep 20, 2024 | model | CodeCode Available | 1 |
| ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks | Sep 20, 2024 | Clone Detection | CodeCode Available | 1 |
| Exploring Text-Queried Sound Event Detection with Audio Source Separation | Sep 20, 2024 | Audio Source SeparationEvent Detection | CodeCode Available | 1 |
| Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation | Sep 20, 2024 | Image SegmentationReferring Expression | CodeCode Available | 1 |