| UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs | Apr 11, 2024 | | CodeCode Available | 3 |
| NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | Apr 11, 2024 | Autonomous DrivingNeRF | CodeCode Available | 3 |
| Rho-1: Not All Tokens Are What You Need | Apr 11, 2024 | AllContinual Pretraining | CodeCode Available | 3 |
| Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs | Apr 10, 2024 | | CodeCode Available | 3 |
| Addressing the Abstraction and Reasoning Corpus via Procedural Example Generation | Apr 10, 2024 | ARCDiversity | CodeCode Available | 3 |
| MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection | Apr 9, 2024 | Anomaly DetectionDecoder | CodeCode Available | 3 |
| ZeST: Zero-Shot Material Transfer from a Single Image | Apr 9, 2024 | Appearance TransferObject | CodeCode Available | 3 |
| RoadBEV: Road Surface Reconstruction in Bird's Eye View | Apr 9, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 3 |
| Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Apr 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention | Apr 9, 2024 | Autonomous DrivingPrediction | CodeCode Available | 3 |
| pfl-research: simulation framework for accelerating research in Private Federated Learning | Apr 9, 2024 | Federated Learning | CodeCode Available | 3 |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Apr 8, 2024 | Image GenerationImage-to-Image Translation | CodeCode Available | 3 |
| PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection | Apr 8, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 3 |
| MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding | Apr 8, 2024 | GPUMultiple-choice | CodeCode Available | 3 |
| AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications | Apr 7, 2024 | AI AgentManagement | CodeCode Available | 3 |
| Allo: A Programming Model for Composable Accelerator Design | Apr 7, 2024 | GPUHigh-Level Synthesis | CodeCode Available | 3 |
| Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making | Apr 6, 2024 | Decision Making | CodeCode Available | 3 |
| Lossless and Near-Lossless Compression for Foundation Models | Apr 5, 2024 | | CodeCode Available | 3 |
| Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation | Apr 5, 2024 | DecoderMamba | CodeCode Available | 3 |
| 3D Facial Expressions through Analysis-by-Neural-Synthesis | Apr 5, 2024 | 3D Face ReconstructionFace Reconstruction | CodeCode Available | 3 |
| Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions | Apr 4, 2024 | Survey | CodeCode Available | 3 |
| LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Apr 3, 2024 | 3D Reconstruction4D reconstruction | CodeCode Available | 3 |
| RS-Mamba for Large Remote Sensing Image Dense Prediction | Apr 3, 2024 | Building change detection for remote sensing imagesChange Detection | CodeCode Available | 3 |
| BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models | Apr 3, 2024 | GPUMath | CodeCode Available | 3 |
| Faster Diffusion via Temporal Attention Decomposition | Apr 3, 2024 | | CodeCode Available | 3 |
| PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models | Apr 3, 2024 | GSM8KQuantization | CodeCode Available | 3 |
| Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining | Apr 2, 2024 | Image ReconstructionRain Removal | CodeCode Available | 3 |
| Tensorized NeuroEvolution of Augmenting Topologies for GPU Acceleration | Apr 2, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 |
| Advancing LLM Reasoning Generalists with Preference Trees | Apr 2, 2024 | BenchmarkingCode Generation | CodeCode Available | 3 |
| SPMamba: State-space model is all you need in speech separation | Apr 2, 2024 | AllMamba | CodeCode Available | 3 |
| GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views | Apr 2, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 3 |
| ViTamin: Designing Scalable Vision Models in the Vision-Language Era | Apr 2, 2024 | | CodeCode Available | 3 |
| Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks | Apr 2, 2024 | In-Context Learning | CodeCode Available | 3 |
| Evalverse: Unified and Accessible Library for Large Language Model Evaluation | Apr 1, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 3 |
| GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA | Apr 1, 2024 | GPUMultiobjective Optimization | CodeCode Available | 3 |
| HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach | Apr 1, 2024 | | CodeCode Available | 3 |
| An RML-FNML module for Python user-defined functions in Morph-KGC | Apr 1, 2024 | Data IntegrationKnowledge Graphs | CodeCode Available | 3 |
| Evaluating Text-to-Visual Generation with Image-to-Text Generation | Apr 1, 2024 | Image to textQuestion Answering | CodeCode Available | 3 |
| M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models | Mar 31, 2024 | Image-text RetrievalLanguage Modeling | CodeCode Available | 3 |
| Towards Realistic Scene Generation with LiDAR Diffusion Models | Mar 31, 2024 | 3D geometryImage Generation | CodeCode Available | 3 |
| DRCT: Saving Image Super-resolution away from Information Bottleneck | Mar 31, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 3 |
| 94% on CIFAR-10 in 3.29 Seconds on a Single GPU | Mar 30, 2024 | GPU | CodeCode Available | 3 |
| Rewrite the Stars | Mar 29, 2024 | | CodeCode Available | 3 |
| UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation | Mar 29, 2024 | Image SegmentationLesion Segmentation | CodeCode Available | 3 |
| Are We on the Right Way for Evaluating Large Vision-Language Models? | Mar 29, 2024 | World Knowledge | CodeCode Available | 3 |
| TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios | Mar 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| RSMamba: Remote Sensing Image Classification with State Space Model | Mar 28, 2024 | Classificationimage-classification | CodeCode Available | 3 |
| Navigating Eukaryotic Genome Annotation Pipelines: A Route Map to BRAKER, Galba, and TSEBRA | Mar 28, 2024 | | CodeCode Available | 3 |
| Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models | Mar 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Mar 28, 2024 | Image RetrievalImplicit Relations | CodeCode Available | 3 |