| CAX: Cellular Automata Accelerated in JAX | Oct 3, 2024 | ARCArtificial Life | CodeCode Available | 3 |
| Diffusion Models are Evolutionary Algorithms | Oct 3, 2024 | DenoisingEvolutionary Algorithms | CodeCode Available | 3 |
| How to Train Long-Context Language Models (Effectively) | Oct 3, 2024 | | CodeCode Available | 3 |
| Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents | Oct 3, 2024 | Autonomous DrivingBackdoor Attack | CodeCode Available | 3 |
| FAN: Fourier Analysis Networks | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding | Oct 2, 2024 | Image GenerationText to Image Generation | CodeCode Available | 3 |
| OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models | Oct 2, 2024 | Benchmarking | CodeCode Available | 3 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 |
| Deep Learning Alternatives of the Kolmogorov Superposition Theorem | Oct 2, 2024 | Deep LearningKolmogorov-Arnold Networks | CodeCode Available | 3 |
| MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series Forecasting | Oct 2, 2024 | Multivariate Time Series ForecastingMultivariate Time Series Forecastingm | CodeCode Available | 3 |
| SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios | Oct 2, 2024 | Speech EnhancementSpeech Separation | CodeCode Available | 3 |
| SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images | Oct 2, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 3 |
| MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Oct 2, 2024 | 3DGSNeRF | CodeCode Available | 3 |
| MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters | Oct 2, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 3 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Simple and Fast Distillation of Diffusion Models | Sep 29, 2024 | GPUImage Generation | CodeCode Available | 3 |
| PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Sep 27, 2024 | Image to Video GenerationVideo Generation | CodeCode Available | 3 |
| Emu3: Next-Token Prediction is All You Need | Sep 27, 2024 | All | CodeCode Available | 3 |
| DANA: Domain-Aware Neurosymbolic Agents for Consistency and Accuracy | Sep 27, 2024 | Financial Analysis | CodeCode Available | 3 |
| CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns | Sep 27, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 3 |
| Does End-to-End Autonomous Driving Really Need Perception Tasks? | Sep 26, 2024 | Autonomous Driving | CodeCode Available | 3 |
| The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection Benchmark | Sep 26, 2024 | Anomaly DetectionBenchmarking | CodeCode Available | 3 |
| Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey | Sep 26, 2024 | Safety Alignment | CodeCode Available | 3 |
| Generative Modeling of Molecular Dynamics Trajectories | Sep 26, 2024 | | CodeCode Available | 3 |
| Cascade Prompt Learning for Vision-Language Model Adaptation | Sep 26, 2024 | General Knowledgeimage-classification | CodeCode Available | 3 |
| Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale | Sep 25, 2024 | Large Language Model | CodeCode Available | 3 |
| Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts | Sep 25, 2024 | CAD ReconstructionText to 3D | CodeCode Available | 3 |
| Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors | Sep 25, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 3 |
| Results of the Big ANN: NeurIPS'23 competition | Sep 25, 2024 | Diversity | CodeCode Available | 3 |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Sep 24, 2024 | ClusteringLanguage Modelling | CodeCode Available | 3 |
| Language-based Audio Moment Retrieval | Sep 24, 2024 | audio moment retrievalMoment Retrieval | CodeCode Available | 3 |
| WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction | Sep 24, 2024 | Managementspeech-recognition | CodeCode Available | 3 |
| ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech | Sep 24, 2024 | Audio Generation | CodeCode Available | 3 |
| MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving | Sep 23, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 3 |
| Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance | Sep 23, 2024 | Emotion RecognitionFAD | CodeCode Available | 3 |
| PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions | Sep 23, 2024 | Image GenerationImage Restoration | CodeCode Available | 3 |
| ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation | Sep 20, 2024 | DescriptiveQuestion Answering | CodeCode Available | 3 |
| Data Augmentation for Sequential Recommendation: A Survey | Sep 20, 2024 | Data AugmentationRecommendation Systems | CodeCode Available | 3 |
| Colorful Diffuse Intrinsic Image Decomposition in the Wild | Sep 20, 2024 | Color ConstancyIntrinsic Image Decomposition | CodeCode Available | 3 |
| GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks | Sep 20, 2024 | AllSinging Voice Synthesis | CodeCode Available | 3 |
| Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation | Sep 19, 2024 | RAGRetrieval | CodeCode Available | 3 |
| DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input | Sep 19, 2024 | | CodeCode Available | 3 |
| Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution | Sep 19, 2024 | document understandingVideo Question Answering | CodeCode Available | 3 |
| 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt | Sep 19, 2024 | 3DGSGPU | CodeCode Available | 3 |
| WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild | Sep 18, 2024 | 3D Hand Pose EstimationHand Detection | CodeCode Available | 3 |
| SOAP: Improving and Stabilizing Shampoo using Adam | Sep 17, 2024 | Computational Efficiency | CodeCode Available | 3 |
| CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark | Sep 17, 2024 | | CodeCode Available | 3 |
| Deep Graph Anomaly Detection: A Survey and New Perspectives | Sep 16, 2024 | Anomaly DetectionGraph Anomaly Detection | CodeCode Available | 3 |
| Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models | Sep 16, 2024 | DecoderDiversity | CodeCode Available | 3 |
| Towards Kinetic Manipulation of the Latent Space | Sep 15, 2024 | | CodeCode Available | 3 |