| OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association | Mar 3, 2021 | Car Pose EstimationKeypoint Detection | CodeCode Available | 2 |
| MedViT: A Robust Vision Transformer for Generalized Medical Image Classification | Feb 19, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores | Nov 10, 2023 | | CodeCode Available | 2 |
| H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training | Sep 21, 2023 | | CodeCode Available | 2 |
| Training-free CryoET Tomogram Segmentation | Jul 8, 2024 | Contrastive LearningCryogenic Electron Tomography | CodeCode Available | 2 |
| Beyond Next Token Prediction: Patch-Level Training for Large Language Models | Jul 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile Manipulation | Mar 18, 2021 | Object | CodeCode Available | 2 |
| BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Nov 23, 2022 | 3D Scene ReconstructionDeblurring | CodeCode Available | 2 |
| BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios | Jul 25, 2023 | Code GenerationFact Checking | CodeCode Available | 2 |
| ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems | Aug 5, 2024 | AI Agent | CodeCode Available | 2 |
| A Review of Graph Neural Networks in Epidemic Modeling | Mar 28, 2024 | Epidemiology | CodeCode Available | 2 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 |
| Towards Interpreting Visual Information Processing in Vision-Language Models | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Seg-metrics: a Python package to compute segmentation metrics | Jan 12, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking | Mar 24, 2024 | Object TrackingRgb-T Tracking | CodeCode Available | 2 |
| GAN Prior Embedded Network for Blind Face Restoration in the Wild | May 13, 2021 | Blind Face RestorationDecoder | CodeCode Available | 2 |
| Fortuna: A Library for Uncertainty Quantification in Deep Learning | Feb 8, 2023 | Bayesian InferenceBenchmarking | CodeCode Available | 2 |
| Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume Improvement | May 17, 2021 | Bayesian Optimization | CodeCode Available | 2 |
| BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation | Apr 3, 2022 | DecoderDepth Estimation | CodeCode Available | 2 |
| CogView: Mastering Text-to-Image Generation via Transformers | May 26, 2021 | Image GenerationSuper-Resolution | CodeCode Available | 2 |
| TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation | Jan 25, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| RGL: A Graph-Centric, Modular Framework for Efficient Retrieval-Augmented Generation on Graphs | Mar 25, 2025 | Abstract generation | CodeCode Available | 2 |
| A Contrastive Framework for Neural Text Generation | Feb 13, 2022 | DiversityText Generation | CodeCode Available | 2 |
| Understanding Multi-Granularity for Open-Vocabulary Part Segmentation | Jun 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems | Jul 9, 2024 | | CodeCode Available | 2 |
| FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models | Jul 1, 2024 | BenchmarkingFairness | CodeCode Available | 2 |
| Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? | Dec 31, 2022 | Data AugmentationRetrieval | CodeCode Available | 2 |
| PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation | Jul 8, 2024 | EthicsLanguage Modeling | CodeCode Available | 2 |
| A Diffusion-Based Generative Equalizer for Music Restoration | Mar 27, 2024 | Bandwidth ExtensionHallucination | CodeCode Available | 2 |
| Efficient and Modular Implicit Differentiation | May 31, 2021 | Meta-Learning | CodeCode Available | 2 |
| Omnizart: A General Toolbox for Automatic Music Transcription | Jun 1, 2021 | Chord RecognitionDownbeat Tracking | CodeCode Available | 2 |
| PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development | Jan 23, 2023 | Question AnsweringReading Comprehension | CodeCode Available | 2 |
| PolyRoom: Room-aware Transformer for Floorplan Reconstruction | Jul 15, 2024 | | CodeCode Available | 2 |
| deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural Networks | Aug 20, 2024 | GPUImage Registration | CodeCode Available | 2 |
| Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potential | Jul 23, 2022 | | CodeCode Available | 2 |
| Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations | Jun 10, 2021 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection | Jan 29, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models | Sep 11, 2024 | DenoisingDisentanglement | CodeCode Available | 2 |
| Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl | May 2, 2023 | Interpretable Machine Learningregression | CodeCode Available | 2 |
| ReVersion: Diffusion-Based Relation Inversion from Images | Mar 23, 2023 | Contrastive LearningFew-Shot Learning | CodeCode Available | 2 |
| Towards Scalable Automated Alignment of LLMs: A Survey | Jun 3, 2024 | Survey | CodeCode Available | 2 |
| AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders | Jan 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting | Jul 10, 2024 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Less is More: Fewer Interpretable Region via Submodular Subset Selection | Feb 14, 2024 | Error UnderstandingImage Attribution | CodeCode Available | 2 |
| LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference | Jun 26, 2024 | multimodal interaction | CodeCode Available | 2 |
| NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction | Jun 20, 2021 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding | Jul 4, 2022 | BenchmarkingDocument Ranking | CodeCode Available | 2 |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Apr 18, 2022 | Instance SegmentationObject Detection | CodeCode Available | 2 |
| Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Aug 21, 2024 | Novel View Synthesis | CodeCode Available | 2 |