VMamba: Visual State Space Model Jan 18, 2024 Computational Efficiency Language Modeling
Code Code Available 7Full Scaling Automation for Sustainable Development of Green Data Centers May 1, 2023 Cloud Computing CPU
Code Code Available 7ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech Nov 7, 2022 Representation Learning Speech Representation Learning
Code Code Available 6Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments Jan 10, 2023 GPU Imitation Learning
Code Code Available 5Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese Nov 2, 2022 Contrastive Learning image-classification
Code Code Available 5Point Transformer V3: Simpler Faster Stronger Jan 1, 2024 Representation Learning
Code Code Available 5Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Jun 24, 2024 Representation Learning Visual Grounding
Code Code Available 5A Time Series is Worth 64 Words: Long-term Forecasting with Transformers Nov 27, 2022 Multivariate Time Series Forecasting Representation Learning
Code Code Available 5CodeGen2: Lessons for Training LLMs on Programming and Natural Languages May 3, 2023 Causal Language Modeling Decoder
Code Code Available 5Masked Completion via Structured Diffusion with White-Box Transformers Apr 3, 2024 Representation Learning
Code Code Available 5Self-Supervised Pre-Training for Table Structure Recognition Transformer Feb 23, 2024 Representation Learning
Code Code Available 4EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything Dec 1, 2023 Decoder image-classification
Code Code Available 4Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard Jun 13, 2023 Information Retrieval Representation Learning
Code Code Available 4ControlVAE: Tuning, Analytical Properties, and Performance Analysis Oct 31, 2020 Disentanglement Image Generation
Code Code Available 4Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology May 19, 2024 Multiple Instance Learning Representation Learning
Code Code Available 4Lightweight Pixel Difference Networks for Efficient Visual Representation Learning Feb 1, 2024 Edge Detection Object Recognition
Code Code Available 4A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions Jun 15, 2022 Clustering Deep Clustering
Code Code Available 4AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Aug 10, 2023 Audio Generation In-Context Learning
Code Code Available 42D Matryoshka Sentence Embeddings Feb 22, 2024 RAG Representation Learning
Code Code Available 4BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Jan 30, 2023 Generative Visual Question Answering Image Captioning
Code Code Available 4Sundial: A Family of Highly Capable Time Series Foundation Models Feb 2, 2025 Representation Learning Time Series
Code Code Available 4SVFR: A Unified Framework for Generalized Video Face Restoration Jan 2, 2025 Colorization Representation Learning
Code Code Available 4LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation Nov 7, 2024 Contrastive Learning Image Captioning
Code Code Available 4Multi-label Cluster Discrimination for Visual Representation Learning Jul 24, 2024 Contrastive Learning Image-text Retrieval
Code Code Available 4ROLAND: Graph Learning Framework for Dynamic Graphs Aug 15, 2022 Graph Learning Graph Representation Learning
Code Code Available 3Robust and Efficient Medical Imaging with Self-Supervision May 19, 2022 Diagnostic Representation Learning
Code Code Available 3Common Sense Reasoning for Deepfake Detection Jan 31, 2024 Binary Classification Common Sense Reasoning
Code Code Available 3Probabilistic Forecasting with Temporal Convolutional Neural Network Jun 11, 2019 Multivariate Time Series Forecasting Probabilistic Time Series Forecasting
Code Code Available 3Addressing Representation Collapse in Vector Quantized Models with One Linear Layer Nov 4, 2024 Quantization Representation Learning
Code Code Available 3ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders Jan 2, 2023 Object Detection Representation Learning
Code Code Available 3OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain May 12, 2025 Multivariate Time Series Forecasting Representation Learning
Code Code Available 3Point Transformer V3: Simpler, Faster, Stronger Dec 15, 2023 3D Semantic Segmentation LIDAR Semantic Segmentation
Code Code Available 3Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis May 16, 2025 Continual Learning Representation Learning
Code Code Available 3SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity Sep 13, 2024 Deep Attention Representation Learning
Code Code Available 3Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Oct 22, 2024 GPU Representation Learning
Code Code Available 3Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction Mar 22, 2025 Graph Attention Prediction
Code Code Available 3Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models May 26, 2023 GSM8K Multimodal Reasoning
Code Code Available 3HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image Analysis Jun 23, 2024 Benchmarking Representation Learning
Code Code Available 3Momentum Contrast for Unsupervised Visual Representation Learning Nov 13, 2019 Contrastive Learning Image Classification
Code Code Available 3MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Jan 2, 2025 Contrastive Learning Key Detection
Code Code Available 3GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting Aug 21, 2024 Representation Learning
Code Code Available 3A Survey on Self-Supervised Learning for Non-Sequential Tabular Data Feb 2, 2024 Contrastive Learning Descriptive
Code Code Available 3Foundation Models for Music: A Survey Aug 26, 2024 In-Context Learning Representation Learning
Code Code Available 3GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Dec 17, 2024 3D Semantic Occupancy Prediction Autonomous Driving
Code Code Available 3ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding Oct 23, 2020 Language Modeling Language Modelling
Code Code Available 3Evaluating representation learning on the protein structure universe Jun 19, 2024 Representation Learning
Code Code Available 3EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG Signals Jan 1, 2024 EEG Representation Learning
Code Code Available 3Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis Jun 5, 2024 Mamba Medical Image Analysis
Code Code Available 3Elucidating the Design Space of Multimodal Protein Language Models Apr 15, 2025 Diversity Representation Learning
Code Code Available 3Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs Jan 11, 2024 Representation Learning Self-Supervised Learning
Code Code Available 3