| EduNLP: Towards a Unified and Modularized Library for Educational Resources | Jun 3, 2024 | | CodeCode Available | 2 |
| Boosting Vision-Language Models with Transduction | Jun 3, 2024 | Few-Shot LearningTransductive Learning | CodeCode Available | 2 |
| Tetrahedron Splatting for 3D Generation | Jun 3, 2024 | 3D Generation3DGS | CodeCode Available | 2 |
| TimeCMA: Towards LLM-Empowered Multivariate Time Series Forecasting via Cross-Modality Alignment | Jun 3, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |
| SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM | Jun 3, 2024 | DecoderGPU | CodeCode Available | 2 |
| A Diffusion Model Framework for Unsupervised Neural Combinatorial Optimization | Jun 3, 2024 | Combinatorial Optimization | CodeCode Available | 2 |
| Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM models | Jun 3, 2024 | ChunkingMamba | CodeCode Available | 2 |
| Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language Models | Jun 3, 2024 | Image CaptioningLanguage Modelling | CodeCode Available | 2 |
| GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model | Jun 3, 2024 | geo-localizationLanguage Modeling | CodeCode Available | 2 |
| GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Jun 3, 2024 | 3D Object DetectionImage-to-Image Translation | CodeCode Available | 2 |
| The Geometry of Categorical and Hierarchical Concepts in Large Language Models | Jun 3, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching | Jun 3, 2024 | Denoising | CodeCode Available | 2 |
| Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation | Jun 3, 2024 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Get my drift? Catching LLM Task Drift with Activation Deltas | Jun 2, 2024 | Text Generation | CodeCode Available | 2 |
| Visual place recognition for aerial imagery: A survey | Jun 2, 2024 | SurveyVisual Localization | CodeCode Available | 2 |
| Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation | Jun 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Aligning Language Models with Demonstrated Feedback | Jun 2, 2024 | ArticlesAvg | CodeCode Available | 2 |
| AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark | Jun 2, 2024 | Face SwappingFairness | CodeCode Available | 2 |
| Correlation Matching Transformation Transformers for UHD Image Restoration | Jun 2, 2024 | DeblurringImage Deblurring | CodeCode Available | 2 |
| Full-Atom Peptide Design based on Multi-modal Flow Matching | Jun 2, 2024 | Drug Discovery | CodeCode Available | 2 |
| Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption | Jun 2, 2024 | AllImage Compression | CodeCode Available | 2 |
| AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Jun 1, 2024 | Gesture GenerationRhythm | CodeCode Available | 2 |
| Neural Optimal Transport with Lagrangian Costs | Jun 1, 2024 | | CodeCode Available | 2 |
| DroneVis: Versatile Computer Vision Library for Drones | Jun 1, 2024 | | CodeCode Available | 2 |
| PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify | Jun 1, 2024 | | CodeCode Available | 2 |
| DeepMol: An Automated Machine and Deep Learning Framework for Computational Chemistr | Jun 1, 2024 | Activity PredictionAutoML | CodeCode Available | 2 |
| A Survey on Large Language Models for Code Generation | Jun 1, 2024 | Code GenerationHumanEval | CodeCode Available | 2 |
| Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning | Jun 1, 2024 | AttributePhysics-informed machine learning | CodeCode Available | 2 |
| FlowIE: Efficient Image Enhancement via Rectified Flow | Jun 1, 2024 | Image Enhancement | CodeCode Available | 2 |
| RecDiff: Diffusion Model for Social Recommendation | Jun 1, 2024 | Denoisingmodel | CodeCode Available | 2 |
| Neural Combinatorial Optimization Algorithms for Solving Vehicle Routing Problems: A Comprehensive Survey with Perspectives | Jun 1, 2024 | Combinatorial Optimization | CodeCode Available | 2 |
| Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching | Jun 1, 2024 | Audio GenerationVideo-to-Sound Generation | CodeCode Available | 2 |
| Learning Manipulation by Predicting Interaction | Jun 1, 2024 | Representation Learning | CodeCode Available | 2 |
| SelfGNN: Self-Supervised Graph Neural Networks for Sequential Recommendation | May 31, 2024 | Graph Neural NetworkRecommendation Systems | CodeCode Available | 2 |
| Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration | May 31, 2024 | Deformable Medical Image RegistrationImage Registration | CodeCode Available | 2 |
| DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models | May 31, 2024 | cross-modal alignmentVisual Localization | CodeCode Available | 2 |
| ABodyBuilder3: Improved and scalable antibody structure predictions | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation | May 31, 2024 | Recommendation SystemsSequential Recommendation | CodeCode Available | 2 |
| TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank | May 31, 2024 | EpidemiologyHoldout Set | CodeCode Available | 2 |
| Improved Techniques for Optimization-Based Jailbreaking on Large Language Models | May 31, 2024 | Red Teaming | CodeCode Available | 2 |
| Query2CAD: Generating CAD models using natural language queries | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation | May 31, 2024 | 3D GenerationImage Generation | CodeCode Available | 2 |
| SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | May 31, 2024 | | CodeCode Available | 2 |
| Mixed Diffusion for 3D Indoor Scene Synthesis | May 31, 2024 | DenoisingIndoor Scene Synthesis | CodeCode Available | 2 |
| ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model | May 31, 2024 | 3DGSImage Compression | CodeCode Available | 2 |
| LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| STHN: Deep Homography Estimation for UAV Thermal Geo-localization with Satellite Imagery | May 30, 2024 | Autonomous Navigationgeo-localization | CodeCode Available | 2 |
| Visual Perception by Large Language Model's Weights | May 30, 2024 | | CodeCode Available | 2 |
| Improving the Training of Rectified Flows | May 30, 2024 | Image GenerationKnowledge Distillation | CodeCode Available | 2 |
| Open-Set Domain Adaptation for Semantic Segmentation | May 30, 2024 | Domain AdaptationSemantic Segmentation | CodeCode Available | 2 |