| Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese | Sep 8, 2023 | Domain AdaptationHallucination | CodeCode Available | 4 | 5 |
| MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds | Dec 9, 2024 | Camera CalibrationCamera Pose Estimation | CodeCode Available | 4 | 5 |
| BLOOM: A 176B-Parameter Open-Access Multilingual Language Model | Nov 9, 2022 | DecoderLanguage Modeling | CodeCode Available | 4 | 5 |
| Gender Representation in TV and Radio: Automatic Information Extraction methods versus Manual Analyses | Jun 14, 2024 | | CodeCode Available | 4 | 5 |
| BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision | Nov 18, 2022 | 3D Object Detection | CodeCode Available | 4 | 5 |
| NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors | Dec 6, 2022 | 3D Generation3D geometry | CodeCode Available | 4 | 5 |
| RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild | Apr 21, 2025 | | CodeCode Available | 4 | 5 |
| COS-Mix: Cosine Similarity and Distance Fusion for Improved Information Retrieval | Jun 2, 2024 | Information RetrievalRAG | CodeCode Available | 4 | 5 |
| UniScene: Unified Occupancy-centric Driving Scene Generation | Dec 6, 2024 | Autonomous DrivingScene Generation | CodeCode Available | 4 | 5 |
| Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset | Jan 9, 2025 | Human Mesh RecoveryMotion Generation | CodeCode Available | 4 | 5 |
| Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Sep 26, 2024 | 3D ReconstructionDenoising | CodeCode Available | 4 | 5 |
| Goldfish: Vision-Language Understanding of Arbitrarily Long Videos | Jul 17, 2024 | RetrievalVideo Understanding | CodeCode Available | 4 | 5 |
| When Does Perceptual Alignment Benefit Vision Representations? | Oct 14, 2024 | Depth EstimationImage Generation | CodeCode Available | 4 | 5 |
| MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI | Oct 15, 2024 | Benchmarking | CodeCode Available | 4 | 5 |
| Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models | Jan 14, 2025 | BenchmarkingText-to-Video Generation | CodeCode Available | 4 | 5 |
| A foundation model for human-AI collaboration in medical literature mining | Jan 27, 2025 | Literature MiningSystematic Literature Review | CodeCode Available | 4 | 5 |
| Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | Oct 9, 2023 | Action RecognitionImage Generation | CodeCode Available | 4 | 5 |
| PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology | May 16, 2024 | whole slide images | CodeCode Available | 4 | 5 |
| FFCV: Accelerating Training by Removing Data Bottlenecks | Jun 21, 2023 | CPUGPU | CodeCode Available | 4 | 5 |
| Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs | Jun 23, 2024 | | CodeCode Available | 4 | 5 |
| Building a Culture of Reproducibility in Academic Research | Dec 27, 2022 | Cultural Vocal Bursts Intensity Prediction | CodeCode Available | 4 | 5 |
| A deep learning framework for efficient pathology image analysis | Feb 18, 2025 | BenchmarkingDeep Learning | CodeCode Available | 4 | 5 |
| Story-Adapter: A Training-free Iterative Framework for Long Story Visualization | Oct 8, 2024 | Image GenerationStory Visualization | CodeCode Available | 4 | 5 |
| Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention | Feb 16, 2025 | | CodeCode Available | 4 | 5 |
| CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution | Jan 5, 2024 | HumanEvalPrediction | CodeCode Available | 4 | 5 |
| VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation | May 20, 2025 | MMEMultiple-choice | CodeCode Available | 4 | 5 |
| CitationMap: A Python Tool to Identify and Visualize Your Google Scholar Citations Around the World | Aug 2, 2024 | Citation VisualizationData Visualization | CodeCode Available | 4 | 5 |
| Real-time volumetric rendering of dynamic humans | Mar 21, 2023 | 3D ReconstructionGPU | CodeCode Available | 4 | 5 |
| Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces | Oct 21, 2024 | Code Generationscientific discovery | CodeCode Available | 4 | 5 |
| DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection | Jan 1, 2020 | AttributeDeepFake Detection | CodeCode Available | 4 | 5 |
| Inductive Moment Matching | Mar 10, 2025 | | CodeCode Available | 4 | 5 |
| Polysemous codes | Sep 7, 2016 | Quantization | CodeCode Available | 4 | 5 |
| SWE-bench: Can Language Models Resolve Real-World GitHub Issues? | Oct 10, 2023 | Bug fixingCode Generation | CodeCode Available | 4 | 5 |
| RUMI: Rummaging Using Mutual Information | Aug 19, 2024 | Model Predictive ControlObject | CodeCode Available | 4 | 5 |
| ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks | Mar 27, 2023 | text annotationText Classification | CodeCode Available | 4 | 5 |
| A General Theoretical Paradigm to Understand Learning from Human Preferences | Oct 18, 2023 | | CodeCode Available | 4 | 5 |
| Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Jun 3, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 4 | 5 |
| MUSE: Machine Unlearning Six-Way Evaluation for Language Models | Jul 8, 2024 | ArticlesMachine Unlearning | CodeCode Available | 4 | 5 |
| Stock Price Prediction via Discovering Multi-Frequency Trading Patterns | Aug 13, 2017 | PredictionStock Price Prediction | CodeCode Available | 4 | 5 |
| The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence | Mar 20, 2024 | | CodeCode Available | 4 | 5 |
| Fast Transformer Decoding: One Write-Head is All You Need | Nov 6, 2019 | AllLanguage Modelling | CodeCode Available | 4 | 5 |
| OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data | Oct 2, 2024 | Arithmetic ReasoningLarge Language Model | CodeCode Available | 4 | 5 |
| DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid Spaces | Dec 15, 2024 | Symbolic Regression | CodeCode Available | 4 | 5 |
| Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms | Mar 10, 2025 | | CodeCode Available | 4 | 5 |
| Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-Drones | Jul 2, 2024 | Autonomous Navigation | CodeCode Available | 4 | 5 |
| ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding | Jan 14, 2025 | RAGRetrieval | CodeCode Available | 4 | 5 |
| PointVLA: Injecting the 3D World into Vision-Language-Action Models | Mar 10, 2025 | Imitation LearningSpatial Reasoning | CodeCode Available | 4 | 5 |
| ViViD: Video Virtual Try-on using Diffusion Models | May 20, 2024 | Virtual Try-on | CodeCode Available | 4 | 5 |
| GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image | Mar 18, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 4 | 5 |
| GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis | Jan 31, 2023 | Face GenerationLip Reading | CodeCode Available | 4 | 5 |