| 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Dec 24, 2024 | Natural Language UnderstandingScene Understanding | CodeCode Available | 2 | 5 |
| SEGAN: Speech Enhancement Generative Adversarial Network | Mar 28, 2017 | Generative Adversarial NetworkSpeech Enhancement | CodeCode Available | 2 | 5 |
| Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection | Mar 14, 2024 | Knowledge DistillationNovel Object Detection | CodeCode Available | 2 | 5 |
| Progressive Distillation for Fast Sampling of Diffusion Models | Feb 1, 2022 | Density EstimationImage Generation | CodeCode Available | 2 | 5 |
| Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Mar 25, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 | 5 |
| SC-DepthV3: Robust Self-supervised Monocular Depth Estimation for Dynamic Scenes | Nov 7, 2022 | Depth EstimationIndoor Monocular Depth Estimation | CodeCode Available | 2 | 5 |
| Think While You Generate: Discrete Diffusion with Planned Denoising | Oct 8, 2024 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| VDT: General-purpose Video Diffusion Transformers via Mask Modeling | May 22, 2023 | Autonomous DrivingVideo Generation | CodeCode Available | 2 | 5 |
| Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba | May 9, 2024 | Action RecognitionMamba | CodeCode Available | 2 | 5 |
| Lost in the Middle: How Language Models Use Long Contexts | Jul 6, 2023 | Language ModellingPosition | CodeCode Available | 2 | 5 |
| Representation Engineering: A Top-Down Approach to AI Transparency | Oct 2, 2023 | Question Answering | CodeCode Available | 2 | 5 |
| WeatherGS: 3D Scene Reconstruction in Adverse Weather Conditions via Gaussian Splatting | Dec 25, 2024 | 3DGS3D Reconstruction | CodeCode Available | 2 | 5 |
| The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Jun 26, 2024 | Action LocalizationMoment Retrieval | CodeCode Available | 2 | 5 |
| Aligning Text-to-Image Diffusion Models with Reward Backpropagation | Oct 5, 2023 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Temporal Graph Benchmark for Machine Learning on Temporal Graphs | Jul 3, 2023 | Node Property PredictionProperty Prediction | CodeCode Available | 2 | 5 |
| A Survey on Data Augmentation in Large Model Era | Jan 27, 2024 | Audio Signal ProcessingData Augmentation | CodeCode Available | 2 | 5 |
| On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference | Feb 9, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| Active-Learning-as-a-Service: An Automatic and Efficient MLOps System for Data-Centric AI | Jul 19, 2022 | Active LearningAutoML | CodeCode Available | 2 | 5 |
| XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation | Dec 20, 2024 | BenchmarkingDiagnostic | CodeCode Available | 2 | 5 |
| GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher | Aug 12, 2023 | EthicsRed Teaming | CodeCode Available | 2 | 5 |
| AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO | Feb 20, 2025 | Autonomous NavigationNavigate | CodeCode Available | 2 | 5 |
| Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs | Apr 21, 2025 | AttributeCamera Pose Estimation | CodeCode Available | 2 | 5 |
| AnyLoc: Towards Universal Visual Place Recognition | Aug 1, 2023 | Image RetrievalVisual Place Recognition | CodeCode Available | 2 | 5 |
| Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression | Dec 22, 2024 | 3D Lane DetectionLane Detection | CodeCode Available | 2 | 5 |
| Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching | May 29, 2024 | compressed sensingDeblurring | CodeCode Available | 2 | 5 |