| Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning | Aug 24, 2021 | CPUGPU | CodeCode Available | 2 | 5 |
| LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models | Oct 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning | Feb 7, 2025 | | CodeCode Available | 2 | 5 |
| Harder Tasks Need More Experts: Dynamic Routing in MoE Models | Mar 12, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 2 | 5 |
| SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language | Jun 28, 2023 | DescriptiveLanguage Modeling | CodeCode Available | 2 | 5 |
| PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling | Oct 8, 2024 | document understandingLanguage Modeling | CodeCode Available | 2 | 5 |
| Adaptive Keyframe Sampling for Long Video Understanding | Jan 1, 2025 | Video Understanding | CodeCode Available | 2 | 5 |
| A Survey of Deep Learning for Mathematical Reasoning | Dec 20, 2022 | Deep LearningMath | CodeCode Available | 2 | 5 |
| SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction | Oct 31, 2023 | PredictionSemantic Similarity | CodeCode Available | 2 | 5 |
| Foundation Models for Spatio-Temporal Data Science: A Tutorial and Survey | Mar 12, 2025 | Management | CodeCode Available | 2 | 5 |
| ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs | Oct 6, 2022 | GPUVocal Bursts Intensity Prediction | CodeCode Available | 2 | 5 |
| DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection | Jun 23, 2022 | Change DetectionDecision Making | CodeCode Available | 2 | 5 |
| Data Science with LLMs and Interpretable Models | Feb 22, 2024 | Additive modelsQuestion Answering | CodeCode Available | 2 | 5 |
| Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024 | Sep 3, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 | 5 |
| Preventing Local Pitfalls in Vector Quantization via Optimal Transport | Dec 19, 2024 | Image ReconstructionQuantization | CodeCode Available | 2 | 5 |
| PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark | Mar 21, 2022 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| A Survey of Financial AI: Architectures, Advances and Open Challenges | Nov 1, 2024 | Decision MakingPortfolio Optimization | CodeCode Available | 2 | 5 |
| IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling | Jan 6, 2023 | Link PredictionOptical Character Recognition | CodeCode Available | 2 | 5 |
| Habitat: A Platform for Embodied AI Research | Apr 2, 2019 | BenchmarkingGPU | CodeCode Available | 2 | 5 |
| Masked Siamese Networks for Label-Efficient Learning | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 | 5 |
| Mamba-ST: State Space Model for Efficient Style Transfer | Sep 16, 2024 | MambaStyle Transfer | CodeCode Available | 2 | 5 |
| recommenderlab: An R Framework for Developing and Testing Recommendation Algorithms | May 24, 2022 | Recommendation Systems | CodeCode Available | 2 | 5 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 | 5 |
| LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer | Feb 3, 2025 | | CodeCode Available | 2 | 5 |
| RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports | May 23, 2024 | DiagnosticMulti-Label Classification | CodeCode Available | 2 | 5 |
| MaGGIe: Masked Guided Gradual Human Instance Matting | Apr 24, 2024 | Image MattingVideo Matting | CodeCode Available | 2 | 5 |
| GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation | Feb 24, 2024 | | CodeCode Available | 2 | 5 |
| Phi-4 Technical Report | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| PubTables-1M: Towards comprehensive table extraction from unstructured documents | Sep 30, 2021 | Articlesobject-detection | CodeCode Available | 2 | 5 |
| CoqPilot, a plugin for LLM-based generation of proofs | Oct 25, 2024 | Benchmarking | CodeCode Available | 2 | 5 |
| Formalizing and Benchmarking Prompt Injection Attacks and Defenses | Oct 19, 2023 | Benchmarking | CodeCode Available | 2 | 5 |
| AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge | Apr 2, 2025 | scientific discovery | CodeCode Available | 2 | 5 |
| Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model | Jun 22, 2023 | | CodeCode Available | 2 | 5 |
| DeeperHistReg: Robust Whole Slide Images Registration Framework | Apr 19, 2024 | whole slide images | CodeCode Available | 2 | 5 |
| Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer | Apr 19, 2022 | 2D Human Pose Estimation3D Human Pose Estimation | CodeCode Available | 2 | 5 |
| Common Diffusion Noise Schedules and Sample Steps are Flawed | May 15, 2023 | | CodeCode Available | 2 | 5 |
| Multi-Target XGBoostLSS Regression | Oct 13, 2022 | regression | CodeCode Available | 2 | 5 |
| Recent advances in the Self-Referencing Embedding Strings (SELFIES) library | Feb 7, 2023 | | CodeCode Available | 2 | 5 |
| RETVec: Resilient and Efficient Text Vectorizer | Feb 18, 2023 | Adversarial TextMetric Learning | CodeCode Available | 2 | 5 |
| Document Expansion by Query Prediction | Apr 17, 2019 | Passage Re-RankingPrediction | CodeCode Available | 2 | 5 |
| Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework | Apr 2, 2025 | BenchmarkingSynthetic Data Generation | CodeCode Available | 2 | 5 |
| EdgeGaussians -- 3D Edge Mapping via Gaussian Splatting | Sep 19, 2024 | | CodeCode Available | 2 | 5 |
| OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model | Mar 13, 2025 | AI AgentLanguage Modeling | CodeCode Available | 2 | 5 |
| RobustNeRF: Ignoring Distractors with Robust Losses | Feb 2, 2023 | NeRF | CodeCode Available | 2 | 5 |
| Building Normalizing Flows with Stochastic Interpolants | Sep 30, 2022 | BenchmarkingDensity Estimation | CodeCode Available | 2 | 5 |
| Efficient World Models with Context-Aware Tokenization | Jun 27, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 | 5 |