| ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge | Mar 24, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 4 |
| A Survey on Large Language Models for Recommendation | May 31, 2023 | Recommendation Systems | CodeCode Available | 4 |
| Segment Anything in Medical Images | Apr 24, 2023 | DiagnosticImage Segmentation | CodeCode Available | 4 |
| mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality | Apr 27, 2023 | Visual Question Answering (VQA)Zero-Shot Video Question Answer | CodeCode Available | 4 |
| The Ideal Continual Learner: An Agent That Never Forgets | Apr 29, 2023 | Continual LearningGeneralization Bounds | CodeCode Available | 4 |
| OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics | Jan 22, 2024 | object-detectionObject Detection | CodeCode Available | 4 |
| The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot | Jun 29, 2023 | Image SegmentationSemantic Segmentation | CodeCode Available | 4 |
| Turning Whisper into Real-Time Transcription System | Jul 27, 2023 | speech-recognitionSpeech Recognition | CodeCode Available | 4 |
| EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models | Mar 18, 2024 | | CodeCode Available | 4 |
| Neural general circulation models optimized to predict satellite-based precipitation observations | Dec 16, 2024 | | CodeCode Available | 4 |
| Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain | Sep 8, 2023 | Fact CheckingKnowledge Graphs | CodeCode Available | 4 |
| DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation | Sep 28, 2023 | 3D Generation | CodeCode Available | 4 |
| An Empirical Study of Instruction-tuning Large Language Models in Chinese | Oct 11, 2023 | | CodeCode Available | 4 |
| Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code | Nov 14, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 4 |
| OpenProteinSet: Training data for structural biology at scale | Aug 10, 2023 | Protein DesignProtein Structure Prediction | CodeCode Available | 4 |
| OpenAGI: When LLM Meets Domain Experts | Apr 10, 2023 | BenchmarkingNatural Language Queries | CodeCode Available | 4 |
| Efficient Parameter Optimisation for Quantum Kernel Alignment: A Sub-sampling Approach in Variational Training | Jan 5, 2024 | Quantum Machine Learning | CodeCode Available | 4 |
| PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map Consistency | Jan 17, 2024 | GPUIncremental Learning | CodeCode Available | 4 |
| Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation | Sep 6, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 4 |
| MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis | Feb 8, 2024 | AttributeConditional Text-to-Image Synthesis | CodeCode Available | 4 |
| Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Feb 9, 2024 | Event DetectionHate Speech Detection | CodeCode Available | 4 |
| AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning | Feb 23, 2024 | | CodeCode Available | 4 |
| shapiq: Shapley Interactions for Machine Learning | Oct 2, 2024 | BenchmarkingData Valuation | CodeCode Available | 4 |
| ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models | Mar 4, 2024 | Image Generation | CodeCode Available | 4 |
| Tiny Machine Learning: Progress and Futures | Mar 28, 2024 | Deep Learning | CodeCode Available | 4 |
| End-to-End Autonomous Driving through V2X Cooperation | Mar 31, 2024 | Autonomous Driving | CodeCode Available | 4 |
| MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers | Feb 15, 2025 | Image AnimationPortrait Animation | CodeCode Available | 4 |
| PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning | Apr 25, 2024 | Dense CaptioningMVBench | CodeCode Available | 4 |
| Direct Training High-Performance Deep Spiking Neural Networks: A Review of Theories and Methods | May 6, 2024 | | CodeCode Available | 4 |
| LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit | May 9, 2024 | BenchmarkingComputational Efficiency | CodeCode Available | 4 |
| Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology | May 19, 2024 | Multiple Instance LearningRepresentation Learning | CodeCode Available | 4 |
| OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | May 21, 2024 | | CodeCode Available | 4 |
| LLMs Meet Multimodal Generation and Editing: A Survey | May 29, 2024 | multimodal generationSurvey | CodeCode Available | 4 |
| Grokfast: Accelerated Grokking by Amplifying Slow Gradients | May 30, 2024 | | CodeCode Available | 4 |
| HelpSteer2: Open-source dataset for training top-performing reward models | Jun 12, 2024 | Attribute | CodeCode Available | 4 |
| Nemotron-4 340B Technical Report | Jun 17, 2024 | Synthetic Data Generation | CodeCode Available | 4 |
| Improving Multi-modal Recommender Systems by Denoising and Aligning Multi-modal Content and User Feedback | Jun 18, 2024 | DenoisingRecommendation Systems | CodeCode Available | 4 |
| Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration | Jul 11, 2023 | HallucinationLogic Grid Puzzle | CodeCode Available | 4 |
| Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training | Oct 28, 2021 | Deep LearningGPU | CodeCode Available | 4 |
| SSL4EO-L: Datasets and Foundation Models for Landsat Imagery | Jun 15, 2023 | Cloud DetectionEarth Observation | CodeCode Available | 4 |
| Continual Learning with Pre-Trained Models: A Survey | Jan 29, 2024 | Continual LearningFairness | CodeCode Available | 4 |
| MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine | Jul 11, 2024 | Contrastive LearningLanguage Modelling | CodeCode Available | 4 |
| Tarsier: Recipes for Training and Evaluating Large Video Description Models | Jun 30, 2024 | Video CaptioningVideo Description | CodeCode Available | 4 |
| Wavelet Convolutions for Large Receptive Fields | Jul 8, 2024 | 2D Object Detection2D Semantic Segmentation | CodeCode Available | 4 |
| A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends | Jul 10, 2024 | Data Poisoning | CodeCode Available | 4 |
| Stable-Hair: Real-World Hair Transfer via Diffusion Model | Jul 19, 2024 | Triplet | CodeCode Available | 4 |
| Timer: Generative Pre-trained Transformers Are Large Time Series Models | Feb 4, 2024 | Anomaly DetectionImputation | CodeCode Available | 4 |
| Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts | Sep 24, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 4 |
| Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR | Sep 24, 2024 | | CodeCode Available | 4 |