| GC4NC: A Benchmark Framework for Graph Condensation on Node Classification with New Insights | Jun 24, 2024 | DenoisingNeural Architecture Search | CodeCode Available | 2 | 5 |
| Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos | Jun 26, 2024 | Novel View SynthesisPoint Tracking | CodeCode Available | 2 | 5 |
| RegMix: Data Mixture as Regression for Language Model Pre-training | Jul 1, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 2 | 5 |
| Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units | Jul 5, 2024 | Acoustic Unit DiscoveryAutomatic Speech Recognition | CodeCode Available | 2 | 5 |
| Learning Formal Mathematics From Intrinsic Motivation | Jun 30, 2024 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 2 | 5 |
| Solving Motion Planning Tasks with a Scalable Generative Model | Jul 3, 2024 | Autonomous DrivingMotion Planning | CodeCode Available | 2 | 5 |
| Isomorphic Pruning for Vision Models | Jul 5, 2024 | | CodeCode Available | 2 | 5 |
| Benchmarking Complex Instruction-Following with Multiple Constraints Composition | Jul 4, 2024 | BenchmarkingInstruction Following | CodeCode Available | 2 | 5 |
| MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis | Jul 4, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 | 5 |
| TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling | Apr 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Trainable Fractional Fourier Transform | Mar 4, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection | Jul 10, 2024 | DecoderImage Segmentation | CodeCode Available | 2 | 5 |
| ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement | Jul 9, 2024 | AttributeDisentanglement | CodeCode Available | 2 | 5 |
| AccDiffusion: An Accurate Method for Higher-Resolution Image Generation | Jul 15, 2024 | Image GenerationObject | CodeCode Available | 2 | 5 |
| PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration | Jul 14, 2024 | Inductive BiasPoint Cloud Registration | CodeCode Available | 2 | 5 |
| iHuman: Instant Animatable Digital Humans From Monocular Videos | Jul 15, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 | 5 |
| From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients | Jul 15, 2024 | GPU | CodeCode Available | 2 | 5 |
| GroupMamba: Efficient Group-Based Visual State Space Model | Jul 18, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| AutoFlow: Automated Workflow Generation for Large Language Model Agents | Jul 1, 2024 | AI AgentLanguage Modeling | CodeCode Available | 2 | 5 |
| Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery | Jul 19, 2024 | | CodeCode Available | 2 | 5 |
| RealViformer: Investigating Attention for Real-World Video Super-Resolution | Jul 19, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models | Sep 21, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 | 5 |
| LoRA-Pro: Are Low-Rank Adapters Properly Optimized? | Jul 25, 2024 | Code GenerationComputational Efficiency | CodeCode Available | 2 | 5 |
| NAVIX: Scaling MiniGrid Environments with JAX | Jul 28, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images | Jul 29, 2024 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls | Jul 30, 2024 | Gesture GenerationMotion Generation | CodeCode Available | 2 | 5 |
| WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds | Jul 11, 2024 | Retrieval | CodeCode Available | 2 | 5 |
| Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | Aug 7, 2024 | AttributeIn-Context Learning | CodeCode Available | 2 | 5 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models | Apr 10, 2025 | Reinforcement Learning (RL)Visual Reasoning | CodeCode Available | 2 | 5 |
| Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment | Aug 12, 2024 | Contrastive Learning | CodeCode Available | 2 | 5 |
| AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents | Aug 15, 2024 | | CodeCode Available | 2 | 5 |
| Accelerating Giant Impact Simulations with Machine Learning | Aug 16, 2024 | | CodeCode Available | 2 | 5 |
| GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned Policy | Aug 26, 2024 | Few-Shot LearningImage Generation | CodeCode Available | 2 | 5 |
| UTrack: Multi-Object Tracking with Uncertain Detections | Aug 30, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 2 | 5 |
| AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction | Sep 3, 2024 | RelationRelation Extraction | CodeCode Available | 2 | 5 |
| PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease Segmentation | Sep 6, 2024 | Benchmarkingimage-classification | CodeCode Available | 2 | 5 |
| A Survey on Mixup Augmentations and Beyond | Sep 8, 2024 | Image ClassificationSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| PiEEG-16 to Measure 16 EEG Channels with Raspberry Pi for Brain-Computer Interfaces and EEG devices | Sep 8, 2024 | Brain Computer InterfaceEEG | CodeCode Available | 2 | 5 |
| The CMA Evolution Strategy: A Tutorial | Apr 4, 2016 | | CodeCode Available | 2 | 5 |
| Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization | Sep 19, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents | Sep 24, 2024 | Code GenerationManagement | CodeCode Available | 2 | 5 |
| A Survey on the Honesty of Large Language Models | Sep 27, 2024 | Survey | CodeCode Available | 2 | 5 |
| Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction | Sep 26, 2024 | 4D reconstructionObject | CodeCode Available | 2 | 5 |
| Spiking Transformer with Spatial-Temporal Attention | Sep 29, 2024 | | CodeCode Available | 2 | 5 |
| Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking | Sep 28, 2024 | Prognosis | CodeCode Available | 2 | 5 |
| Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion? | Oct 2, 2024 | Code CompletionCode Generation | CodeCode Available | 2 | 5 |
| PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Oct 1, 2024 | 3D Anomaly DetectionAnomaly Detection | CodeCode Available | 2 | 5 |
| End-to-end Piano Performance-MIDI to Score Conversion with Transformers | Sep 30, 2024 | | CodeCode Available | 2 | 5 |
| Mamba in Vision: A Comprehensive Survey of Techniques and Applications | Oct 4, 2024 | MambaState Space Models | CodeCode Available | 2 | 5 |