| Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark | Apr 23, 2025 | | CodeCode Available | 2 | 5 |
| GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians | Dec 4, 2023 | Motion Estimation | CodeCode Available | 2 | 5 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| Collaborative Neural Rendering using Anime Character Sheets | Jul 12, 2022 | Image GenerationImage to 3D | CodeCode Available | 2 | 5 |
| PACO: Parts and Attributes of Common Objects | Jan 4, 2023 | 2D Object DetectionAttribute | CodeCode Available | 2 | 5 |
| Neighbourhood Representative Sampling for Efficient End-to-end Video Quality Assessment | Oct 11, 2022 | Video Quality AssessmentVisual Question Answering (VQA) | CodeCode Available | 2 | 5 |
| Bayesian Enhancement Models for One-to-Many Mapping in Image Enhancement | Oct 13, 2024 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 | 5 |
| ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification | Feb 13, 2022 | ClassificationManagement | CodeCode Available | 2 | 5 |
| READ: Large-Scale Neural Scene Rendering for Autonomous Driving | May 11, 2022 | 3D Scene ReconstructionAutonomous Driving | CodeCode Available | 2 | 5 |
| LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models | Jun 15, 2023 | HallucinationImage Captioning | CodeCode Available | 2 | 5 |
| Probing the limitations of multimodal language models for chemistry and materials research | Nov 25, 2024 | Experimental DesignSpatial Reasoning | CodeCode Available | 2 | 5 |
| CostFilter-AD: Enhancing Anomaly Detection through Matching Cost Filtering | May 2, 2025 | Anomaly DetectionUnsupervised Anomaly Detection | CodeCode Available | 2 | 5 |
| The Importance of Being Scalable: Improving the Speed and Accuracy of Neural Network Interatomic Potentials Across Chemical Domains | Oct 31, 2024 | GPUPhilosophy | CodeCode Available | 2 | 5 |
| Machine Unlearning in Generative AI: A Survey | Jul 30, 2024 | Machine UnlearningSurvey | CodeCode Available | 2 | 5 |
| A Survey on Diffusion Models for Anomaly Detection | Jan 20, 2025 | Anomaly DetectionComputational Efficiency | CodeCode Available | 2 | 5 |
| PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop | Mar 12, 2025 | DiagnosticVideo Generation | CodeCode Available | 2 | 5 |
| NeuS2: Fast Learning of Neural Implicit Surfaces for Multi-view Reconstruction | Dec 10, 2022 | Surface Reconstruction | CodeCode Available | 2 | 5 |
| DreamText: High Fidelity Scene Text Synthesis | May 23, 2024 | | CodeCode Available | 2 | 5 |
| GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models | Jul 2, 2024 | Marketing | CodeCode Available | 2 | 5 |
| Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy | Oct 15, 2022 | Feature CompressionQuestion Answering | CodeCode Available | 2 | 5 |
| Cross-Domain Pre-training with Language Models for Transferable Time Series Representations | Mar 19, 2024 | Language ModellingTime Series | CodeCode Available | 2 | 5 |
| Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM | Apr 27, 2023 | Surface Reconstruction | CodeCode Available | 2 | 5 |
| GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning | Mar 24, 2023 | Virtual Try-on | CodeCode Available | 2 | 5 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 | 5 |
| Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents | Apr 25, 2024 | Decision MakingSpecificity | CodeCode Available | 2 | 5 |