| rLLM: Relational Table Learning with LLMs | Jul 29, 2024 | ClassificationNode Classification | CodeCode Available | 3 | 5 |
| WildGaussians: 3D Gaussian Splatting in the Wild | Jul 11, 2024 | 3DGS3D Scene Reconstruction | CodeCode Available | 3 | 5 |
| VISA: Reasoning Video Object Segmentation via Large Language Models | Jul 16, 2024 | DecoderObject | CodeCode Available | 3 | 5 |
| Scaling Retrieval-Based Language Models with a Trillion-Token Datastore | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Compact Language Models via Pruning and Knowledge Distillation | Jul 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 3 | 5 |
| PyABSA: A Modularized Framework for Reproducible Aspect-based Sentiment Analysis | Aug 2, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 3 | 5 |
| Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection | Jul 30, 2024 | object-detectionObject Detection | CodeCode Available | 3 | 5 |
| MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine | Aug 6, 2024 | Medical Visual Question AnsweringOrgan Detection | CodeCode Available | 3 | 5 |
| 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data | Aug 7, 2024 | 16k2k | CodeCode Available | 3 | 5 |
| NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices | Aug 19, 2024 | Optical Flow Estimation | CodeCode Available | 3 | 5 |
| ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models | Aug 16, 2024 | GPUModel Compression | CodeCode Available | 3 | 5 |
| LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Aug 19, 2024 | 3DGSPoint Cloud Registration | CodeCode Available | 3 | 5 |
| Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Aug 19, 2024 | Image GenerationVideo Generation | CodeCode Available | 3 | 5 |
| AnyGraph: Graph Foundation Model in the Wild | Aug 20, 2024 | Graph LearningMixture-of-Experts | CodeCode Available | 3 | 5 |
| LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs | Aug 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt | Sep 19, 2024 | 3DGSGPU | CodeCode Available | 3 | 5 |
| PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions | Sep 23, 2024 | Image GenerationImage Restoration | CodeCode Available | 3 | 5 |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Sep 24, 2024 | ClusteringLanguage Modelling | CodeCode Available | 3 | 5 |
| Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts | Sep 25, 2024 | CAD ReconstructionText to 3D | CodeCode Available | 3 | 5 |
| ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation | Sep 20, 2024 | DescriptiveQuestion Answering | CodeCode Available | 3 | 5 |
| Results of the Big ANN: NeurIPS'23 competition | Sep 25, 2024 | Diversity | CodeCode Available | 3 | 5 |
| Diffusion Models are Evolutionary Algorithms | Oct 3, 2024 | DenoisingEvolutionary Algorithms | CodeCode Available | 3 | 5 |
| ControlAR: Controllable Image Generation with Autoregressive Models | Oct 3, 2024 | Image Generation | CodeCode Available | 3 | 5 |
| CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control | Oct 4, 2024 | Motion GenerationReinforcement Learning (RL) | CodeCode Available | 3 | 5 |
| DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation | Oct 17, 2024 | Talking Head GenerationVideo Generation | CodeCode Available | 3 | 5 |