| Perceptually Transparent Binaural Auralization of Simulated Sound Fields | Dec 6, 2024 | | CodeCode Available | 2 | 5 |
| Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models | Jan 2, 2024 | Autonomous Driving | CodeCode Available | 2 | 5 |
| GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Mar 14, 2024 | Contrastive LearningNeRF | CodeCode Available | 2 | 5 |
| RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition | Mar 20, 2024 | Contrastive LearningFine-Grained Visual Recognition | CodeCode Available | 2 | 5 |
| BackFed: An Efficient & Standardized Benchmark Suite for Backdoor Attacks in Federated Learning | Jul 7, 2025 | Federated Learning | CodeCode Available | 2 | 5 |
| An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models | Nov 25, 2024 | DenoisingScene Understanding | CodeCode Available | 2 | 5 |
| SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 | 5 |
| Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models | Jan 31, 2023 | Generative Semantic Nursing | CodeCode Available | 2 | 5 |
| HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Mar 18, 2024 | Scene Graph Generation | CodeCode Available | 2 | 5 |
| StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis | May 30, 2022 | Data AugmentationSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| SATO: Stable Text-to-Motion Framework | May 2, 2024 | | CodeCode Available | 2 | 5 |
| TopoNets: High Performing Vision and Language Models with Brain-Like Topography | Jan 27, 2025 | | CodeCode Available | 2 | 5 |
| Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting | Aug 17, 2024 | | CodeCode Available | 2 | 5 |
| 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians | Apr 15, 2025 | 3DGSAffordance Recognition | CodeCode Available | 2 | 5 |
| AlphaNet: Scaling Up Local-frame-based Atomistic Interatomic Potential | Jan 13, 2025 | Computational Efficiency | CodeCode Available | 2 | 5 |
| Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning | May 5, 2024 | GSM8KMath | CodeCode Available | 2 | 5 |
| Open-Source Ground-based Sky Image Datasets for Very Short-term Solar Forecasting, Cloud Analysis and Modeling: A Comprehensive Survey | Nov 27, 2022 | motion prediction | CodeCode Available | 2 | 5 |
| Syllabus: Portable Curricula for Reinforcement Learning Agents | Nov 18, 2024 | NetHackreinforcement-learning | CodeCode Available | 2 | 5 |
| ORFD: A Dataset and Benchmark for Off-Road Freespace Detection | Jun 20, 2022 | Autonomous DrivingSemantic Segmentation | CodeCode Available | 2 | 5 |
| NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEG | Apr 10, 2024 | Contrastive LearningEEG | CodeCode Available | 2 | 5 |
| Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions | Dec 20, 2022 | HallucinationQuestion Answering | CodeCode Available | 2 | 5 |
| DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction | Jul 24, 2024 | Camera Pose EstimationPose Estimation | CodeCode Available | 2 | 5 |
| EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model | Apr 18, 2025 | Diagnostic | CodeCode Available | 2 | 5 |
| Contourlet Refinement Gate Framework for Thermal Spectrum Distribution Regularized Infrared Image Super-Resolution | Nov 19, 2024 | Image EnhancementImage Super-Resolution | CodeCode Available | 2 | 5 |
| Predictive Data Selection: The Data That Predicts Is the Data That Teaches | Mar 2, 2025 | | CodeCode Available | 2 | 5 |
| Graph Meets LLMs: Towards Large Graph Models | Aug 28, 2023 | | CodeCode Available | 2 | 5 |
| The first Cadenza challenges: using machine learning competitions to improve music for listeners with a hearing loss | Sep 8, 2024 | | CodeCode Available | 2 | 5 |
| AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation | Mar 17, 2022 | | CodeCode Available | 2 | 5 |
| Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Jul 14, 2024 | DenoisingVideo Enhancement | CodeCode Available | 2 | 5 |
| Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese | Apr 11, 2024 | | CodeCode Available | 2 | 5 |
| Foundational Models in Medical Imaging: A Comprehensive Survey and Future Vision | Oct 28, 2023 | | CodeCode Available | 2 | 5 |
| PDE Generalization of In-Context Operator Networks: A Study on 1D Scalar Nonlinear Conservation Laws | Jan 14, 2024 | Operator learning | CodeCode Available | 2 | 5 |
| GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs | May 10, 2024 | graph constructionimage-classification | CodeCode Available | 2 | 5 |
| DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs | Mar 10, 2025 | Code GenerationInstruction Following | CodeCode Available | 2 | 5 |
| IDRNet: Intervention-Driven Relation Network for Semantic Segmentation | Oct 16, 2023 | RelationRelation Network | CodeCode Available | 2 | 5 |
| HyperSteer: Activation Steering at Scale with Hypernetworks | Jun 3, 2025 | Dictionary LearningText Generation | CodeCode Available | 2 | 5 |
| CSL: A Large-scale Chinese Scientific Literature Dataset | Sep 12, 2022 | text-classificationText Classification | CodeCode Available | 2 | 5 |
| Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs | Jun 13, 2024 | BenchmarkingGPU | CodeCode Available | 2 | 5 |
| Matryoshka Representation Learning | May 26, 2022 | 4kImage Classification | CodeCode Available | 2 | 5 |
| UniSim: A Neural Closed-Loop Sensor Simulator | Aug 3, 2023 | | CodeCode Available | 2 | 5 |
| Multi-Space Alignments Towards Universal LiDAR Segmentation | May 2, 2024 | Autonomous DrivingDiversity | CodeCode Available | 2 | 5 |
| CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization | May 6, 2025 | Active Speaker DetectionAudio-Visual Speech Recognition | CodeCode Available | 2 | 5 |
| Omni-Dimensional Dynamic Convolution | Sep 16, 2022 | | CodeCode Available | 2 | 5 |
| Towards Relation-centered Pooling and Convolution for Heterogeneous Graph Learning Networks | Oct 31, 2022 | Graph LearningGraph Neural Network | CodeCode Available | 2 | 5 |
| Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification | Aug 15, 2023 | Arithmetic ReasoningMath | CodeCode Available | 2 | 5 |
| SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process | Dec 19, 2023 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 | 5 |
| Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Jun 6, 2024 | NeRF | CodeCode Available | 2 | 5 |
| GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI | Aug 6, 2024 | Question AnsweringVisual Question Answering | CodeCode Available | 2 | 5 |
| NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |