VMamba: Visual State Space Model Jan 18, 2024 Computational Efficiency Language Modeling
Code Code Available 7Full Scaling Automation for Sustainable Development of Green Data Centers May 1, 2023 Cloud Computing CPU
Code Code Available 7ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech Nov 7, 2022 Representation Learning Speech Representation Learning
Code Code Available 6Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Jun 24, 2024 Representation Learning Visual Grounding
Code Code Available 5Masked Completion via Structured Diffusion with White-Box Transformers Apr 3, 2024 Representation Learning
Code Code Available 5Point Transformer V3: Simpler Faster Stronger Jan 1, 2024 Representation Learning
Code Code Available 5CodeGen2: Lessons for Training LLMs on Programming and Natural Languages May 3, 2023 Causal Language Modeling Decoder
Code Code Available 5Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments Jan 10, 2023 GPU Imitation Learning
Code Code Available 5A Time Series is Worth 64 Words: Long-term Forecasting with Transformers Nov 27, 2022 Multivariate Time Series Forecasting Representation Learning
Code Code Available 5Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese Nov 2, 2022 Contrastive Learning image-classification
Code Code Available 5Sundial: A Family of Highly Capable Time Series Foundation Models Feb 2, 2025 Representation Learning Time Series
Code Code Available 4SVFR: A Unified Framework for Generalized Video Face Restoration Jan 2, 2025 Colorization Representation Learning
Code Code Available 4LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation Nov 7, 2024 Contrastive Learning Image Captioning
Code Code Available 4Multi-label Cluster Discrimination for Visual Representation Learning Jul 24, 2024 Contrastive Learning Image-text Retrieval
Code Code Available 4Morphological Prototyping for Unsupervised Slide Representation Learning in Computational Pathology May 19, 2024 Multiple Instance Learning Representation Learning
Code Code Available 4Self-Supervised Pre-Training for Table Structure Recognition Transformer Feb 23, 2024 Representation Learning
Code Code Available 42D Matryoshka Sentence Embeddings Feb 22, 2024 RAG Representation Learning
Code Code Available 4Lightweight Pixel Difference Networks for Efficient Visual Representation Learning Feb 1, 2024 Edge Detection Object Recognition
Code Code Available 4EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything Dec 1, 2023 Decoder image-classification
Code Code Available 4AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Aug 10, 2023 Audio Generation In-Context Learning
Code Code Available 4Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard Jun 13, 2023 Information Retrieval Representation Learning
Code Code Available 4BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Jan 30, 2023 Generative Visual Question Answering Image Captioning
Code Code Available 4A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions Jun 15, 2022 Clustering Deep Clustering
Code Code Available 4ControlVAE: Tuning, Analytical Properties, and Performance Analysis Oct 31, 2020 Disentanglement Image Generation
Code Code Available 4Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis May 16, 2025 Continual Learning Representation Learning
Code Code Available 3OLinear: A Linear Model for Time Series Forecasting in Orthogonally Transformed Domain May 12, 2025 Multivariate Time Series Forecasting Representation Learning
Code Code Available 3Elucidating the Design Space of Multimodal Protein Language Models Apr 15, 2025 Diversity Representation Learning
Code Code Available 3GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation Apr 11, 2025 Decoder Image Generation
Code Code Available 3Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction Mar 22, 2025 Graph Attention Prediction
Code Code Available 3NdLinear Is All You Need for Representation Learning Mar 21, 2025 All Representation Learning
Code Code Available 3MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Jan 2, 2025 Contrastive Learning Key Detection
Code Code Available 3Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models Dec 18, 2024 Representation Learning Robot Manipulation
Code Code Available 3GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Dec 17, 2024 3D Semantic Occupancy Prediction Autonomous Driving
Code Code Available 3Addressing Representation Collapse in Vector Quantized Models with One Linear Layer Nov 4, 2024 Quantization Representation Learning
Code Code Available 3Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Oct 22, 2024 GPU Representation Learning
Code Code Available 3SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity Sep 13, 2024 Deep Attention Representation Learning
Code Code Available 3Foundation Models for Music: A Survey Aug 26, 2024 In-Context Learning Representation Learning
Code Code Available 3GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting Aug 21, 2024 Representation Learning
Code Code Available 3HEST-1k: A Dataset for Spatial Transcriptomics and Histology Image Analysis Jun 23, 2024 Benchmarking Representation Learning
Code Code Available 3Evaluating representation learning on the protein structure universe Jun 19, 2024 Representation Learning
Code Code Available 3Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis Jun 5, 2024 Mamba Medical Image Analysis
Code Code Available 3TSLANet: Rethinking Transformers for Time Series Representation Learning Apr 12, 2024 Anomaly Detection Computational Efficiency
Code Code Available 3A Survey on Self-Supervised Learning for Non-Sequential Tabular Data Feb 2, 2024 Contrastive Learning Descriptive
Code Code Available 3Common Sense Reasoning for Deepfake Detection Jan 31, 2024 Binary Classification Common Sense Reasoning
Code Code Available 3Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs Jan 11, 2024 Representation Learning Self-Supervised Learning
Code Code Available 3Universal Time-Series Representation Learning: A Survey Jan 8, 2024 Feature Engineering Representation Learning
Code Code Available 3EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG Signals Jan 1, 2024 EEG Representation Learning
Code Code Available 3Point Transformer V3: Simpler, Faster, Stronger Dec 15, 2023 3D Semantic Segmentation LIDAR Semantic Segmentation
Code Code Available 3White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? Nov 22, 2023 All Data Compression
Code Code Available 3White-Box Transformers via Sparse Rate Reduction Jun 1, 2023 Representation Learning
Code Code Available 3