| Machine Learning in Asset Management—Part 2: Portfolio Construction—Weight Optimization. The Journal of Financial Data Science | Mar 26, 2020 | ArticlesAsset Management | CodeCode Available | 2 | 5 |
| The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization | Mar 24, 2024 | reinforcement-learning | CodeCode Available | 2 | 5 |
| Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality | Oct 7, 2024 | Causal Inferencecounterfactual | CodeCode Available | 2 | 5 |
| InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales | Jun 19, 2024 | DenoisingIn-Context Learning | CodeCode Available | 2 | 5 |
| MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity | Jul 22, 2024 | DiversityMultiple-choice | CodeCode Available | 2 | 5 |
| BANet: Bilateral Aggregation Network for Mobile Stereo Matching | Mar 5, 2025 | Stereo Matching | CodeCode Available | 2 | 5 |
| Axes that matter: PCA with a difference | Mar 9, 2025 | regression | CodeCode Available | 2 | 5 |
| Duoduo CLIP: Efficient 3D Understanding with Multi-View Images | Jun 17, 2024 | GPUObject | CodeCode Available | 2 | 5 |
| ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation | Feb 3, 2024 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation | Mar 22, 2024 | Earth Observation | CodeCode Available | 2 | 5 |
| PromptDet: Towards Open-vocabulary Detection using Uncurated Images | Mar 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Geometric Transformer for Fast and Robust Point Cloud Registration | Feb 14, 2022 | Metric LearningPoint Cloud Registration | CodeCode Available | 2 | 5 |
| Weak-to-Strong Extrapolation Expedites Alignment | Apr 25, 2024 | | CodeCode Available | 2 | 5 |
| A Survey of Pretraining on Graphs: Taxonomy, Methods, and Applications | Feb 16, 2022 | Drug DiscoveryGraph Representation Learning | CodeCode Available | 2 | 5 |
| Practical Compact Deep Compressed Sensing | Nov 20, 2024 | compressed sensing | CodeCode Available | 2 | 5 |
| PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers | Jun 18, 2024 | Decision MakingRAG | CodeCode Available | 2 | 5 |
| BHViT: Binarized Hybrid Vision Transformer | Mar 4, 2025 | BinarizationQuantization | CodeCode Available | 2 | 5 |
| Observational Scaling Laws and the Predictability of Language Model Performance | May 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective | Aug 13, 2024 | Image GenerationSynthetic Image Detection | CodeCode Available | 2 | 5 |
| Melody transcription via generative pre-training | Dec 4, 2022 | Chord RecognitionInformation Retrieval | CodeCode Available | 2 | 5 |
| ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning | Nov 22, 2021 | DenoisingMulti-Task Learning | CodeCode Available | 2 | 5 |
| Binding Language Models in Symbolic Languages | Oct 6, 2022 | Language ModellingSemantic Parsing | CodeCode Available | 2 | 5 |
| FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder | Jan 18, 2024 | | CodeCode Available | 2 | 5 |
| Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding | Mar 15, 2024 | 3D GenerationImage to 3D | CodeCode Available | 2 | 5 |
| ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking | Jul 8, 2022 | Entity DisambiguationEntity Linking | CodeCode Available | 2 | 5 |
| DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks | Sep 10, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 2 | 5 |
| Multi-instrument Music Synthesis with Spectrogram Diffusion | Jun 11, 2022 | DecoderGenerative Adversarial Network | CodeCode Available | 2 | 5 |
| Interpreting Object-level Foundation Models via Visual Precision Search | Nov 25, 2024 | Explainable Artificial Intelligence (XAI)Object | CodeCode Available | 2 | 5 |
| EDGE: Editable Dance Generation From Music | Nov 19, 2022 | DiversityMotion Synthesis | CodeCode Available | 2 | 5 |
| BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Jul 8, 2024 | Autonomous DrivingDecoder | CodeCode Available | 2 | 5 |
| A Systematic Review on the Evaluation of Large Language Models in Theory of Mind Tasks | Feb 12, 2025 | | CodeCode Available | 2 | 5 |
| Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models | Apr 3, 2024 | Instruction Following | CodeCode Available | 2 | 5 |
| UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation Datasets | Nov 25, 2024 | Segmentation | CodeCode Available | 2 | 5 |
| Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models | Oct 11, 2023 | Code GenerationImage Generation | CodeCode Available | 2 | 5 |
| Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline | Sep 26, 2023 | Knowledge DistillationObject Tracking | CodeCode Available | 2 | 5 |
| Scalable Diffusion Models with State Space Backbone | Feb 8, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 2 | 5 |
| Temporally Consistent Transformers for Video Generation | Oct 5, 2022 | MinecraftVideo Generation | CodeCode Available | 2 | 5 |
| Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary | Jan 16, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Meta Prompting for AI Systems | Nov 20, 2023 | Data InteractionGSM8K | CodeCode Available | 2 | 5 |
| VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking | Jan 24, 2025 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation | Feb 16, 2024 | Video Generation | CodeCode Available | 2 | 5 |
| FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion | Oct 27, 2022 | Data Augmentationtext annotation | CodeCode Available | 2 | 5 |
| Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking | Mar 9, 2023 | Contrastive LearningDecoder | CodeCode Available | 2 | 5 |
| MS-DETR: Efficient DETR Training with Mixed Supervision | Jan 8, 2024 | DecoderObject | CodeCode Available | 2 | 5 |
| MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning | Oct 5, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 | 5 |
| Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures | Mar 20, 2025 | DeblurringZero-shot Generalization | CodeCode Available | 2 | 5 |
| Accelerating Transformers with Spectrum-Preserving Token Merging | May 25, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See | Oct 8, 2024 | | CodeCode Available | 2 | 5 |
| UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning | Jan 12, 2022 | Representation Learning | CodeCode Available | 2 | 5 |
| OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation | Jun 9, 2025 | Image Generation | CodeCode Available | 2 | 5 |