| Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech | Feb 26, 2024 | QuantizationSpeech Enhancement | CodeCode Available | 2 |
| HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields | Feb 26, 2024 | 3D Hand Pose Estimationhand-object pose | CodeCode Available | 2 |
| Pretrained Visual Uncertainties | Feb 26, 2024 | Retrieval | CodeCode Available | 2 |
| DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models | Feb 26, 2024 | MambaState Space Models | CodeCode Available | 2 |
| CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision | Feb 26, 2024 | Representation LearningTransfer Learning | CodeCode Available | 2 |
| Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models | Feb 26, 2024 | Language Modelling | CodeCode Available | 2 |
| TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement | Feb 26, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| CodeS: Towards Building Open-source Language Models for Text-to-SQL | Feb 26, 2024 | Data AugmentationDiagnostic | CodeCode Available | 2 |
| Feedback Efficient Online Fine-Tuning of Diffusion Models | Feb 26, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design | Feb 26, 2024 | AvgDrug Design | CodeCode Available | 2 |
| Defending LLMs against Jailbreaking Attacks via Backtranslation | Feb 26, 2024 | Language Modelling | CodeCode Available | 2 |
| CARTE: Pretraining and Transfer for Tabular Learning | Feb 26, 2024 | Data IntegrationTransfer Learning | CodeCode Available | 2 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 |
| An Integrated Data Processing Framework for Pretraining Foundation Models | Feb 26, 2024 | | CodeCode Available | 2 |
| DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers | Feb 25, 2024 | In-Context LearningSafety Alignment | CodeCode Available | 2 |
| GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction | Feb 25, 2024 | 3D ReconstructionActive 3D Reconstruction | CodeCode Available | 2 |
| VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Feb 25, 2024 | Pose EstimationTransfer Learning | CodeCode Available | 2 |
| HiGPT: Heterogeneous Graph Language Model | Feb 25, 2024 | Graph LearningLanguage Modeling | CodeCode Available | 2 |
| GraphWiz: An Instruction-Following Language Model for Graph Problems | Feb 25, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Deep Homography Estimation for Visual Place Recognition | Feb 25, 2024 | Homography EstimationRe-Ranking | CodeCode Available | 2 |
| GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation | Feb 24, 2024 | | CodeCode Available | 2 |
| Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning | Feb 24, 2024 | ClassificationFine-Grained Image Recognition | CodeCode Available | 2 |
| Reliable Conflictive Multi-View Learning | Feb 24, 2024 | MULTI-VIEW LEARNING | CodeCode Available | 2 |
| HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models | Feb 24, 2024 | DenoisingImage Restoration | CodeCode Available | 2 |
| MACRec: a Multi-Agent Collaboration Framework for Recommendation | Feb 23, 2024 | Conversational RecommendationDecision Making | CodeCode Available | 2 |
| Morphological Symmetries in Robotics | Feb 23, 2024 | Data Augmentation | CodeCode Available | 2 |
| EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems | Feb 23, 2024 | Recommendation SystemsReinforcement Learning (RL) | CodeCode Available | 2 |
| Machine Unlearning of Pre-trained Large Language Models | Feb 23, 2024 | Machine Unlearning | CodeCode Available | 2 |
| An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning | Feb 23, 2024 | Arithmetic ReasoningAutomated Theorem Proving | CodeCode Available | 2 |
| ToMBench: Benchmarking Theory of Mind in Large Language Models | Feb 23, 2024 | BenchmarkingMultiple-choice | CodeCode Available | 2 |
| Foundation Policies with Hilbert Representations | Feb 23, 2024 | Reinforcement Learning (RL)Unsupervised Pre-training | CodeCode Available | 2 |
| EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Feb 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| GraphEdit: Large Language Models for Graph Structure Learning | Feb 23, 2024 | Graph structure learning | CodeCode Available | 2 |
| Fast Adversarial Attacks on Language Models In One GPU Minute | Feb 23, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation | Feb 23, 2024 | | CodeCode Available | 2 |
| ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition | Feb 23, 2024 | | CodeCode Available | 2 |
| Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition | Feb 23, 2024 | Image GenerationPersonalized Image Generation | CodeCode Available | 2 |
| The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG) | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior | Feb 23, 2024 | ObjectObject Rearrangement | CodeCode Available | 2 |
| Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion | Feb 22, 2024 | Music Generation | CodeCode Available | 2 |
| HyperFast: Instant Classification for Tabular Data | Feb 22, 2024 | AutoMLClassification | CodeCode Available | 2 |
| Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Feb 22, 2024 | AllMixture-of-Experts | CodeCode Available | 2 |
| Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset | Feb 22, 2024 | DiversityMath | CodeCode Available | 2 |
| Batch and match: black-box variational inference with a score-based divergence | Feb 22, 2024 | Variational Inference | CodeCode Available | 2 |
| HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced Attention | Feb 22, 2024 | Image Inpaintingspeech-recognition | CodeCode Available | 2 |
| GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion | Feb 22, 2024 | Denoising | CodeCode Available | 2 |
| tinyBenchmarks: evaluating LLMs with fewer examples | Feb 22, 2024 | MMLUMultiple-choice | CodeCode Available | 2 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 |
| Data Science with LLMs and Interpretable Models | Feb 22, 2024 | Additive modelsQuestion Answering | CodeCode Available | 2 |
| Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data | Feb 22, 2024 | Irregular Time SeriesMissing Values | CodeCode Available | 2 |