| Seg-metrics: a Python package to compute segmentation metrics | Jan 12, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking | Mar 24, 2024 | Object TrackingRgb-T Tracking | CodeCode Available | 2 |
| GAN Prior Embedded Network for Blind Face Restoration in the Wild | May 13, 2021 | Blind Face RestorationDecoder | CodeCode Available | 2 |
| Fortuna: A Library for Uncertainty Quantification in Deep Learning | Feb 8, 2023 | Bayesian InferenceBenchmarking | CodeCode Available | 2 |
| Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume Improvement | May 17, 2021 | Bayesian Optimization | CodeCode Available | 2 |
| BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation | Apr 3, 2022 | DecoderDepth Estimation | CodeCode Available | 2 |
| CogView: Mastering Text-to-Image Generation via Transformers | May 26, 2021 | Image GenerationSuper-Resolution | CodeCode Available | 2 |
| TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation | Jan 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| RGL: A Graph-Centric, Modular Framework for Efficient Retrieval-Augmented Generation on Graphs | Mar 25, 2025 | Abstract generation | CodeCode Available | 2 |
| A Contrastive Framework for Neural Text Generation | Feb 13, 2022 | DiversityText Generation | CodeCode Available | 2 |
| Understanding Multi-Granularity for Open-Vocabulary Part Segmentation | Jun 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems | Jul 9, 2024 | | CodeCode Available | 2 |
| FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models | Jul 1, 2024 | BenchmarkingFairness | CodeCode Available | 2 |
| Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? | Dec 31, 2022 | Data AugmentationRetrieval | CodeCode Available | 2 |
| PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation | Jul 8, 2024 | EthicsLanguage Modeling | CodeCode Available | 2 |
| A Diffusion-Based Generative Equalizer for Music Restoration | Mar 27, 2024 | Bandwidth ExtensionHallucination | CodeCode Available | 2 |
| Efficient and Modular Implicit Differentiation | May 31, 2021 | Meta-Learning | CodeCode Available | 2 |
| Omnizart: A General Toolbox for Automatic Music Transcription | Jun 1, 2021 | Chord RecognitionDownbeat Tracking | CodeCode Available | 2 |
| PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development | Jan 23, 2023 | Question AnsweringReading Comprehension | CodeCode Available | 2 |
| PolyRoom: Room-aware Transformer for Floorplan Reconstruction | Jul 15, 2024 | | CodeCode Available | 2 |
| deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural Networks | Aug 20, 2024 | GPUImage Registration | CodeCode Available | 2 |
| Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potential | Jul 23, 2022 | | CodeCode Available | 2 |
| Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations | Jun 10, 2021 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection | Jan 29, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models | Sep 11, 2024 | DenoisingDisentanglement | CodeCode Available | 2 |
| Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl | May 2, 2023 | Interpretable Machine Learningregression | CodeCode Available | 2 |
| ReVersion: Diffusion-Based Relation Inversion from Images | Mar 23, 2023 | Contrastive LearningFew-Shot Learning | CodeCode Available | 2 |
| Towards Scalable Automated Alignment of LLMs: A Survey | Jun 3, 2024 | Survey | CodeCode Available | 2 |
| AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting | Jul 10, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Less is More: Fewer Interpretable Region via Submodular Subset Selection | Feb 14, 2024 | Error UnderstandingImage Attribution | CodeCode Available | 2 |
| LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference | Jun 26, 2024 | multimodal interaction | CodeCode Available | 2 |
| NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction | Jun 20, 2021 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding | Jul 4, 2022 | BenchmarkingDocument Ranking | CodeCode Available | 2 |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Apr 18, 2022 | Instance SegmentationObject Detection | CodeCode Available | 2 |
| Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Aug 21, 2024 | Novel View Synthesis | CodeCode Available | 2 |
| BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models | May 16, 2022 | Image GenerationImage-to-Image Translation | CodeCode Available | 2 |
| Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis | Oct 9, 2022 | 3D Point Cloud ClassificationKnowledge Distillation | CodeCode Available | 2 |
| KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment | Feb 10, 2025 | ArticlesKnowledge Graphs | CodeCode Available | 2 |
| 6Img-to-3D: Few-Image Large-Scale Outdoor Driving Scene Reconstruction | Apr 18, 2024 | 3D ReconstructionImage to 3D | CodeCode Available | 2 |
| iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval | May 5, 2024 | BenchmarkingComposed Image Retrieval (CoIR) | CodeCode Available | 2 |
| FedCache 2.0: Federated Edge Learning with Knowledge Caching and Dataset Distillation | May 22, 2024 | Dataset DistillationFederated Learning | CodeCode Available | 2 |
| Hawkeye: A PyTorch-based Library for Fine-Grained Image Recognition with Deep Learning | Oct 14, 2023 | Fine-Grained Image Recognition | CodeCode Available | 2 |
| L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks | Jan 27, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| Learning to Route Among Specialized Experts for Zero-Shot Generalization | Feb 8, 2024 | parameter-efficient fine-tuningZero-shot Generalization | CodeCode Available | 2 |
| GRiT: A Generative Region-to-text Transformer for Object Understanding | Dec 1, 2022 | DecoderDense Captioning | CodeCode Available | 2 |
| RemoteCLIP: A Vision Language Foundation Model for Remote Sensing | Jun 19, 2023 | ClassificationCross-Modal Retrieval | CodeCode Available | 2 |
| BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments | May 27, 2024 | AI AgentBayesian Optimization | CodeCode Available | 2 |
| 3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation | Sep 12, 2022 | 3D Face AnimationDisentanglement | CodeCode Available | 2 |
| ExpeL: LLM Agents Are Experiential Learners | Aug 20, 2023 | Decision MakingTransfer Learning | CodeCode Available | 2 |