| 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination | Jun 7, 2024 | Hallucination | CodeCode Available | 2 | 5 |
| Spatio-Temporal Self-Supervised Learning for Traffic Flow Prediction | Dec 7, 2022 | AttributePrediction | CodeCode Available | 2 | 5 |
| Contrastive Decoding: Open-ended Text Generation as Optimization | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting | Nov 19, 2022 | DecoderScene Text Detection | CodeCode Available | 2 | 5 |
| MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning | Apr 18, 2023 | Emotion RecognitionMulti-Label Learning | CodeCode Available | 2 | 5 |
| DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation | Jun 24, 2024 | BenchmarkingImage Generation | CodeCode Available | 2 | 5 |
| FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation | Oct 5, 2023 | HallucinationWorld Knowledge | CodeCode Available | 2 | 5 |
| EpiLearn: A Python Library for Machine Learning in Epidemic Modeling | Jun 10, 2024 | | CodeCode Available | 2 | 5 |
| 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods | Feb 18, 2024 | Data CompressionImage Compression | CodeCode Available | 2 | 5 |
| It Takes Two to Tango: Directly Optimizing for Constrained Synthesizability in Generative Molecular Design | Oct 15, 2024 | Drug Discoveryreinforcement-learning | CodeCode Available | 2 | 5 |
| FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation | Apr 19, 2024 | DecoderNetwork Embedding | CodeCode Available | 2 | 5 |
| NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape Estimation | May 15, 2023 | 3D human pose and shape estimation3D Human Pose Estimation | CodeCode Available | 2 | 5 |
| Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding | Jan 24, 2025 | AnatomyContrastive Learning | CodeCode Available | 2 | 5 |
| FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models | Jun 24, 2024 | Video Generation | CodeCode Available | 2 | 5 |
| DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Jun 23, 2024 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning | Jun 25, 2024 | ObjectObject Recognition | CodeCode Available | 2 | 5 |
| Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models | Jun 25, 2024 | DiversityMath | CodeCode Available | 2 | 5 |
| Scattered Mixture-of-Experts Implementation | Mar 13, 2024 | Mixture-of-Experts | CodeCode Available | 2 | 5 |
| EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation | Jun 26, 2024 | Action AnticipationAction Recognition | CodeCode Available | 2 | 5 |
| DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability | Jun 27, 2024 | Speech Synthesistext-to-speech | CodeCode Available | 2 | 5 |
| RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulation | Jun 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural Networks | Jul 4, 2022 | | CodeCode Available | 2 | 5 |
| Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Jun 28, 2024 | 8kAnomaly Detection | CodeCode Available | 2 | 5 |
| E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness | Jul 1, 2024 | 3D Generation | CodeCode Available | 2 | 5 |
| MG-Verilog: Multi-grained Dataset Towards Enhanced LLM-assisted Verilog Generation | Jul 2, 2024 | In-Context Learning | CodeCode Available | 2 | 5 |