| Nemo: First Glimpse of a New Rule Engine | Aug 30, 2023 | Knowledge Graphs | CodeCode Available | 2 |
| Softpick: No Attention Sink, No Massive Activations with Rectified Softmax | Apr 29, 2025 | Quantization | CodeCode Available | 2 |
| Interpretability at Scale: Identifying Causal Mechanisms in Alpaca | May 15, 2023 | | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Point Segment and Count: A Generalized Framework for Object Counting | Jan 1, 2024 | Few-shot Object Counting and DetectionKnowledge Distillation | CodeCode Available | 2 |
| XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| QQQ: Quality Quattuor-Bit Quantization for Large Language Models | Jun 14, 2024 | Quantization | CodeCode Available | 2 |
| CV-Cities: Advancing Cross-View Geo-Localization in Global Cities | Nov 19, 2024 | Cross-View Geo-LocalisationDrone-view target localization | CodeCode Available | 2 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| Differentiable Reward Optimization for LLM based TTS system | Jul 8, 2025 | text-to-speechText to Speech | CodeCode Available | 2 |
| SF-V: Single Forward Video Generation Model | Jun 6, 2024 | Denoisingmodel | CodeCode Available | 2 |
| Graph Neural Networks in TensorFlow and Keras with Spektral | Jun 22, 2020 | Deep LearningGeneral Classification | CodeCode Available | 2 |
| ArtGS: Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting | Feb 26, 2025 | parameter estimation | CodeCode Available | 2 |
| TensorOpt: Exploring the Tradeoffs in Distributed DNN Training with Auto-Parallelism | Apr 16, 2020 | | CodeCode Available | 2 |
| Rank-based Non-dominated Sorting | Mar 25, 2022 | Evolutionary Algorithms | CodeCode Available | 2 |
| Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly | Apr 30, 2024 | Anomaly Detection | CodeCode Available | 2 |
| Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks | May 17, 2023 | Graph ClassificationGraph Representation Learning | CodeCode Available | 2 |
| PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds | May 8, 2023 | 2D Object Detection3D Object Detection | CodeCode Available | 2 |
| Torch-Struct: Deep Structured Prediction Library | Jul 1, 2020 | Deep LearningPrediction | CodeCode Available | 2 |
| FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping | Oct 19, 2022 | AttributeDecoder | CodeCode Available | 2 |
| Curriculum Learning for ab initio Deep Learned Refractive Optics | Feb 2, 2023 | | CodeCode Available | 2 |
| ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks | Dec 14, 2023 | Abstractive Text SummarizationCode Generation | CodeCode Available | 2 |
| MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra | May 23, 2023 | DecoderDenoising | CodeCode Available | 2 |
| Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework | Mar 17, 2024 | AllData Augmentation | CodeCode Available | 2 |
| Revisiting Scene Text Recognition: A Data Perspective | Jul 17, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| Spanish Pre-trained BERT Model and Evaluation Data | Aug 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large Language Models for Information Retrieval: A Survey | Aug 14, 2023 | Information RetrievalQuestion Answering | CodeCode Available | 2 |
| Simplifying Paragraph-level Question Generation via Transformer Language Models | May 3, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery | Jan 25, 2024 | Cloud RemovalImage Generation | CodeCode Available | 2 |
| OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations | Jun 17, 2024 | Depth Completion | CodeCode Available | 2 |
| Prioritized Training on Points that are Learnable, Worth Learning, and Not Yet Learnt | Jun 14, 2022 | | CodeCode Available | 2 |
| Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D Convolutions | Jun 8, 2022 | Autonomous VehiclesNavigate | CodeCode Available | 2 |
| Scalable 3D Captioning with Pretrained Models | Jun 12, 2023 | DescriptiveImage Captioning | CodeCode Available | 2 |
| Open High-Resolution Satellite Imagery: The WorldStrat Dataset -- With Application to Super-Resolution | Jul 13, 2022 | HumanitarianMulti-Frame Super-Resolution | CodeCode Available | 2 |
| Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | Nov 19, 2019 | Atari GamesAtari Games 100k | CodeCode Available | 2 |
| YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction | Dec 24, 2023 | UIE | CodeCode Available | 2 |
| Complex-YOLO: Real-time 3D Object Detection on Point Clouds | Mar 16, 2018 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering | May 12, 2025 | Large Language Modelreinforcement-learning | CodeCode Available | 2 |
| Rényi Differential Privacy of the Sampled Gaussian Mechanism | Aug 28, 2019 | | CodeCode Available | 2 |
| AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs | Apr 11, 2024 | Safety Alignment | CodeCode Available | 2 |
| A Temporal Kolmogorov-Arnold Transformer for Time Series Forecasting | Jun 4, 2024 | DecoderKolmogorov-Arnold Networks | CodeCode Available | 2 |
| Block Transformer: Global-to-Local Language Modeling for Fast Inference | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection | Jun 15, 2024 | 3D Object DetectionComputational Efficiency | CodeCode Available | 2 |
| YUAN 2.0: A Large Language Model with Localized Filtering-based Attention | Nov 27, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training | Oct 9, 2024 | Caption GenerationContrastive Learning | CodeCode Available | 2 |
| Decomposition Betters Tracking Everything Everywhere | Jul 9, 2024 | Motion EstimationPoint Tracking | CodeCode Available | 2 |
| Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications | Sep 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Multimodal Prototyping for cancer survival prediction | Jun 28, 2024 | PredictionSurvival Prediction | CodeCode Available | 2 |
| WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering | Jul 8, 2024 | DiagnosticGenerative Visual Question Answering | CodeCode Available | 2 |
| Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP | Aug 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |