| A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases | Sep 22, 2022 | Inductive Bias | CodeCode Available | 2 |
| Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Mar 31, 2025 | General Reinforcement LearningInstruction Following | CodeCode Available | 2 |
| MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning | Jun 28, 2023 | Deep LearningMultimodal Deep Learning | CodeCode Available | 2 |
| RecDiff: Diffusion Model for Social Recommendation | Jun 1, 2024 | Denoisingmodel | CodeCode Available | 2 |
| BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Computational Life: How Well-formed, Self-replicating Programs Emerge from Simple Interaction | Jun 27, 2024 | Artificial Life | CodeCode Available | 2 |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Feb 29, 2024 | Image SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs | Dec 2, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| A Survey of Personalized Large Language Models: Progress and Future Directions | Feb 17, 2025 | Emotion RecognitionGeneral Knowledge | CodeCode Available | 2 |
| TIES-Merging: Resolving Interference When Merging Models | Jun 2, 2023 | Transfer Learning | CodeCode Available | 2 |
| CLIP-Mesh: Generating textured meshes from text using pretrained image-text models | Mar 24, 2022 | | CodeCode Available | 2 |
| What's In My Big Data? | Oct 31, 2023 | Benchmarking | CodeCode Available | 2 |
| Continual Pre-training of Language Models | Feb 7, 2023 | Continual LearningContinual Pretraining | CodeCode Available | 2 |
| Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge | May 26, 2024 | ClassificationEdge Classification | CodeCode Available | 2 |
| HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling | Jan 5, 2023 | Novel View SynthesisVocal Bursts Intensity Prediction | CodeCode Available | 2 |
| F^2-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories | Mar 28, 2023 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution | Mar 17, 2022 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis | Nov 26, 2024 | Denoising | CodeCode Available | 2 |
| Star-convex Polyhedra for 3D Object Detection and Segmentation in Microscopy | Aug 9, 2019 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| Recent Advances in Speech Language Models: A Survey | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception | Jan 19, 2023 | Autonomous DrivingData Augmentation | CodeCode Available | 2 |
| SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression | Aug 21, 2023 | Decoderregression | CodeCode Available | 2 |
| Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation | Mar 26, 2025 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid | Feb 11, 2025 | | CodeCode Available | 2 |
| Large Language Model Safety: A Holistic Survey | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network | Apr 22, 2022 | Generative Adversarial NetworkImage Quality Assessment | CodeCode Available | 2 |
| MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens | Oct 3, 2023 | Image Generationmultimodal generation | CodeCode Available | 2 |
| pyrtklib: An open-source package for tightly coupled deep learning and GNSS integration for positioning in urban canyons | Sep 19, 2024 | Deep Learning | CodeCode Available | 2 |
| MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization | Jul 10, 2025 | 2kQuantization | CodeCode Available | 2 |
| BayesFlow: Amortized Bayesian Workflows With Neural Networks | Jun 28, 2023 | Bayesian InferenceData Compression | CodeCode Available | 2 |
| DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation | Mar 17, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention | Jun 18, 2024 | ObjectResponse Generation | CodeCode Available | 2 |
| Equivariant 3D-Conditional Diffusion Models for Molecular Linker Design | Oct 11, 2022 | Drug Discoveryvalid | CodeCode Available | 2 |
| NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging | Mar 6, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| InFoBench: Evaluating Instruction Following Ability in Large Language Models | Jan 7, 2024 | Instruction Following | CodeCode Available | 2 |
| ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities | Sep 30, 2024 | Decision Making | CodeCode Available | 2 |
| BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion | Jul 20, 2023 | Conditional Text-to-Image SynthesisDenoising | CodeCode Available | 2 |
| Improving Text-guided Object Inpainting with Semantic Pre-inpainting | Sep 12, 2024 | DenoisingObject | CodeCode Available | 2 |
| INQUIRE: A Natural World Text-to-Image Retrieval Benchmark | Nov 4, 2024 | Image RetrievalReranking | CodeCode Available | 2 |
| Adaptive Personalized Federated Learning | Mar 30, 2020 | Bilevel OptimizationFederated Learning | CodeCode Available | 2 |
| CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code | Feb 10, 2023 | Code Generation | CodeCode Available | 2 |
| A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D camera | Sep 17, 2022 | Autonomous DrivingCollision Avoidance | CodeCode Available | 2 |
| Predictive Dynamic Fusion | Jun 7, 2024 | Decision Making | CodeCode Available | 2 |
| MetaFormer: A Unified Meta Framework for Fine-Grained Recognition | Mar 5, 2022 | AttributeFine-Grained Image Classification | CodeCode Available | 2 |
| Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework | Mar 12, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Deep Patch Visual Odometry | Aug 8, 2022 | Monocular Visual OdometryVisual Odometry | CodeCode Available | 2 |
| PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Mar 24, 2024 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based Agents | May 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LingoQA: Visual Question Answering for Autonomous Driving | Dec 21, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| MemLong: Memory-Augmented Retrieval for Long Text Modeling | Aug 30, 2024 | 4kDecoder | CodeCode Available | 2 |