| FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects | Dec 13, 2023 | 3D Object Detection3D Object Tracking | CodeCode Available | 4 |
| Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models | Nov 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Nov 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code | Nov 14, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 4 |
| SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models | Nov 13, 2023 | Described Object DetectionLanguage Modeling | CodeCode Available | 4 |
| mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration | Nov 7, 2023 | 1 Image, 2*2 StitchingDecoder | CodeCode Available | 4 |
| Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | Oct 9, 2023 | Action RecognitionImage Generation | CodeCode Available | 4 |
| Efficient Post-training Quantization with FP8 Formats | Sep 26, 2023 | image-classificationImage Classification | CodeCode Available | 4 |
| Safurai 001: New Qualitative Approach for Code LLM Evaluation | Sep 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| A Survey on Large Language Model based Autonomous Agents | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |