| Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection | Mar 14, 2024 | Knowledge DistillationNovel Object Detection | CodeCode Available | 2 |
| SEGAN: Speech Enhancement Generative Adversarial Network | Mar 28, 2017 | Generative Adversarial NetworkSpeech Enhancement | CodeCode Available | 2 |
| SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs | Oct 17, 2024 | | CodeCode Available | 2 |
| CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | May 29, 2022 | Chinese Sentiment AnalysisConversational Response Generation | CodeCode Available | 2 |
| 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Dec 24, 2024 | Natural Language UnderstandingScene Understanding | CodeCode Available | 2 |
| Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking | Feb 7, 2023 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 |
| AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements | May 20, 2024 | 3D Pose EstimationPose Estimation | CodeCode Available | 2 |
| Global Estimation of Building-Integrated Facade and Rooftop Photovoltaic Potential by Integrating 3D Building Footprint and Spatio-Temporal Datasets | Dec 2, 2024 | | CodeCode Available | 2 |
| ByT5 model for massively multilingual grapheme-to-phoneme conversion | Apr 6, 2022 | Grapheme-to-Phoneme Conversion | CodeCode Available | 2 |
| Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and Simplification | Nov 8, 2020 | | CodeCode Available | 2 |
| StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views | Jun 8, 2023 | Autonomous DrivingGPU | CodeCode Available | 2 |
| A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark | Feb 28, 2022 | Image SegmentationInductive Bias | CodeCode Available | 2 |
| Neural interval-censored survival regression with feature selection | Jun 14, 2022 | feature selectionregression | CodeCode Available | 2 |
| DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models | Nov 28, 2022 | DenoisingLanguage Modeling | CodeCode Available | 2 |
| Executing your Commands via Motion Diffusion in Latent Space | Dec 8, 2022 | Motion GenerationMotion Synthesis | CodeCode Available | 2 |
| NMS Strikes Back | Dec 12, 2022 | Attributeobject-detection | CodeCode Available | 2 |
| YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion | Mar 10, 2025 | | CodeCode Available | 2 |
| DiffFace: Diffusion-based Face Swapping with Facial Guidance | Dec 27, 2022 | Face Swapping | CodeCode Available | 2 |
| CodeJudge: Evaluating Code Generation with Large Language Models | Oct 3, 2024 | Code Generation | CodeCode Available | 2 |
| Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes | May 3, 2023 | | CodeCode Available | 2 |
| Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context | Sep 15, 2023 | | CodeCode Available | 2 |
| Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation | Apr 1, 2024 | Action SegmentationSegmentation | CodeCode Available | 2 |
| One-Step Diffusion Distillation through Score Implicit Matching | Oct 22, 2024 | | CodeCode Available | 2 |
| Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability | Apr 13, 2025 | model | CodeCode Available | 2 |
| Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders | Jun 13, 2025 | Speech Enhancement | CodeCode Available | 2 |
| Learning local equivariant representations for quantum operators | Jul 8, 2024 | Computational Efficiency | CodeCode Available | 2 |
| BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations | Apr 15, 2022 | Self-Supervised Learning | CodeCode Available | 2 |
| Scaling Relationship on Learning Mathematical Reasoning with Large Language Models | Aug 3, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| ETSformer: Exponential Smoothing Transformers for Time-series Forecasting | Feb 3, 2022 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| Investigating Affective Use and Emotional Well-being on ChatGPT | Apr 4, 2025 | Privacy Preserving | CodeCode Available | 2 |
| LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language | May 21, 2024 | regression | CodeCode Available | 2 |
| AnySat: One Earth Observation Model for Many Resolutions, Scales, and Modalities | Dec 18, 2024 | Change DetectionDiversity | CodeCode Available | 2 |
| ZnTrack -- Data as Code | Jan 19, 2024 | Management | CodeCode Available | 2 |
| Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism | Sep 17, 2019 | GPULAMBADA | CodeCode Available | 2 |
| Autonomous Improvement of Instruction Following Skills via Foundation Models | Jul 30, 2024 | Image GenerationInstruction Following | CodeCode Available | 2 |
| MemoryBank: Enhancing Large Language Models with Long-Term Memory | May 17, 2023 | Chatbot | CodeCode Available | 2 |
| FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction | May 28, 2024 | In-Context LearningPrediction | CodeCode Available | 2 |
| Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models | May 5, 2025 | Active Learning | CodeCode Available | 2 |
| SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing | May 5, 2025 | Triplet | CodeCode Available | 2 |
| GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction | Apr 29, 2025 | 3DGS3D Reconstruction | CodeCode Available | 2 |
| Unified Continuous Generative Models | May 12, 2025 | Image Generation | CodeCode Available | 2 |
| Text-based Animatable 3D Avatars with Morphable Model Alignment | Apr 22, 2025 | 3D Generation3DGS | CodeCode Available | 2 |
| Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration | May 7, 2025 | Computational Efficiency | CodeCode Available | 2 |
| SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models | May 12, 2025 | | CodeCode Available | 2 |
| Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities | May 10, 2025 | Spatial Reasoning | CodeCode Available | 2 |
| Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection | May 12, 2025 | Anomaly Detection | CodeCode Available | 2 |
| A Tutorial on Structural Identifiability of Epidemic Models Using StructuralIdentifiability.jl | May 15, 2025 | parameter estimation | CodeCode Available | 2 |
| DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy | May 16, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning | May 19, 2025 | | CodeCode Available | 2 |