| PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark | Mar 21, 2022 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| A Survey of Financial AI: Architectures, Advances and Open Challenges | Nov 1, 2024 | Decision MakingPortfolio Optimization | CodeCode Available | 2 |
| IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling | Jan 6, 2023 | Link PredictionOptical Character Recognition | CodeCode Available | 2 |
| Habitat: A Platform for Embodied AI Research | Apr 2, 2019 | BenchmarkingGPU | CodeCode Available | 2 |
| Masked Siamese Networks for Label-Efficient Learning | Apr 14, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Mamba-ST: State Space Model for Efficient Style Transfer | Sep 16, 2024 | MambaStyle Transfer | CodeCode Available | 2 |
| recommenderlab: An R Framework for Developing and Testing Recommendation Algorithms | May 24, 2022 | Recommendation Systems | CodeCode Available | 2 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |
| LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer | Feb 3, 2025 | | CodeCode Available | 2 |
| RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports | May 23, 2024 | DiagnosticMulti-Label Classification | CodeCode Available | 2 |
| MaGGIe: Masked Guided Gradual Human Instance Matting | Apr 24, 2024 | Image MattingVideo Matting | CodeCode Available | 2 |
| GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation | Feb 24, 2024 | | CodeCode Available | 2 |
| Phi-4 Technical Report | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| PubTables-1M: Towards comprehensive table extraction from unstructured documents | Sep 30, 2021 | Articlesobject-detection | CodeCode Available | 2 |
| CoqPilot, a plugin for LLM-based generation of proofs | Oct 25, 2024 | Benchmarking | CodeCode Available | 2 |
| Formalizing and Benchmarking Prompt Injection Attacks and Defenses | Oct 19, 2023 | Benchmarking | CodeCode Available | 2 |
| AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge | Apr 2, 2025 | scientific discovery | CodeCode Available | 2 |
| Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model | Jun 22, 2023 | | CodeCode Available | 2 |
| DeeperHistReg: Robust Whole Slide Images Registration Framework | Apr 19, 2024 | whole slide images | CodeCode Available | 2 |
| Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer | Apr 19, 2022 | 2D Human Pose Estimation3D Human Pose Estimation | CodeCode Available | 2 |
| Common Diffusion Noise Schedules and Sample Steps are Flawed | May 15, 2023 | | CodeCode Available | 2 |
| Multi-Target XGBoostLSS Regression | Oct 13, 2022 | regression | CodeCode Available | 2 |
| Recent advances in the Self-Referencing Embedding Strings (SELFIES) library | Feb 7, 2023 | | CodeCode Available | 2 |
| RETVec: Resilient and Efficient Text Vectorizer | Feb 18, 2023 | Adversarial TextMetric Learning | CodeCode Available | 2 |
| Document Expansion by Query Prediction | Apr 17, 2019 | Passage Re-RankingPrediction | CodeCode Available | 2 |
| Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework | Apr 2, 2025 | BenchmarkingSynthetic Data Generation | CodeCode Available | 2 |
| EdgeGaussians -- 3D Edge Mapping via Gaussian Splatting | Sep 19, 2024 | | CodeCode Available | 2 |
| OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model | Mar 13, 2025 | AI AgentLanguage Modeling | CodeCode Available | 2 |
| RobustNeRF: Ignoring Distractors with Robust Losses | Feb 2, 2023 | NeRF | CodeCode Available | 2 |
| Building Normalizing Flows with Stochastic Interpolants | Sep 30, 2022 | BenchmarkingDensity Estimation | CodeCode Available | 2 |
| Efficient World Models with Context-Aware Tokenization | Jun 27, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 |
| Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning | May 27, 2024 | Question AnsweringRAG | CodeCode Available | 2 |
| PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion | Dec 6, 2024 | 3D Scene ReconstructionDepth Estimation | CodeCode Available | 2 |
| Flaming-hot Initiation with Regular Execution Sampling for Large Language Models | Oct 28, 2024 | DiversityMath | CodeCode Available | 2 |
| aMUSEd: An Open MUSE Reproduction | Jan 3, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition | Jan 30, 2023 | Feature Upsamplingimage-classification | CodeCode Available | 2 |
| Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Jun 2, 2024 | AllImage Compression | CodeCode Available | 2 |
| Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution | Apr 15, 2025 | Image Super-ResolutionKnowledge Distillation | CodeCode Available | 2 |
| Generative Modeling for Mathematical Discovery | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents | Oct 18, 2024 | | CodeCode Available | 2 |
| Physics-based Deep Learning | Sep 11, 2021 | Deep LearningPhysical Simulations | CodeCode Available | 2 |
| LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks | Oct 28, 2024 | Quantization | CodeCode Available | 2 |
| Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting | Dec 20, 2023 | 3D GenerationImage Generation | CodeCode Available | 2 |
| Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation | Nov 8, 2023 | Style TransferVoice Conversion | CodeCode Available | 2 |
| MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving | Jul 27, 2023 | Autonomous DrivingNeRF | CodeCode Available | 2 |
| Text-to-3D using Gaussian Splatting | Sep 28, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 |
| UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing | Nov 25, 2024 | | CodeCode Available | 2 |