| Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction | Oct 31, 2024 | Disaster ResponseLanguage Modeling | CodeCode Available | 1 |
| GHIL-Glue: Hierarchical Control with Filtered Subgoal Images | Oct 26, 2024 | Imitation LearningVideo Prediction | —Unverified | 0 |
| Random Policy Enables In-Context Reinforcement Learning within Trust Horizons | Oct 25, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| Adversarial Environment Design via Regret-Guided Diffusion Models | Oct 25, 2024 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning | Oct 24, 2024 | Instruction FollowingNatural Language Understanding | —Unverified | 0 |
| LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Oct 22, 2024 | 3DGSDecoder | —Unverified | 0 |
| DEL-Ranking: Ranking-Correction Denoising Framework for Elucidating Molecular Affinities in DNA-Encoded Libraries | Oct 19, 2024 | DenoisingZero-shot Generalization | —Unverified | 0 |
| BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities | Oct 18, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 2 |
| Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement | Oct 15, 2024 | DisentanglementInductive Bias | CodeCode Available | 2 |
| MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer | Oct 14, 2024 | Transfer LearningVideo Recognition | CodeCode Available | 0 |
| Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels | Oct 10, 2024 | Motion ForecastingZero-shot Generalization | —Unverified | 0 |
| On the Evaluation of Generative Robotic Simulations | Oct 10, 2024 | Diversitytext similarity | —Unverified | 0 |
| RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation | Oct 10, 2024 | Zero-shot Generalization | CodeCode Available | 5 |
| Zero-Shot Generalization of Vision-Based RL Without Data Augmentation | Oct 9, 2024 | Data AugmentationDisentanglement | —Unverified | 0 |
| Zero-Shot Fact Verification via Natural Logic and Large Language Models | Oct 4, 2024 | Fact VerificationZero-shot Generalization | CodeCode Available | 0 |
| What Matters for Model Merging at Scale? | Oct 4, 2024 | modelTask Arithmetic | —Unverified | 0 |
| Cross-Embodiment Dexterous Grasping with Reinforcement Learning | Oct 3, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Learning Diverse Bimanual Dexterous Manipulation Skills from Human Demonstrations | Oct 3, 2024 | Zero-shot Generalization | —Unverified | 0 |
| MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation | Sep 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction | Sep 26, 2024 | 3D ReconstructionDenoising | CodeCode Available | 4 |
| A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation | Sep 24, 2024 | Anatomyobject-detection | CodeCode Available | 0 |
| From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models | Sep 24, 2024 | Reinforcement Learning (RL)Zero-shot Generalization | —Unverified | 0 |
| M^2PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning | Sep 24, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| Deep Generative Adversarial Network for Occlusion Removal from a Single Image | Sep 20, 2024 | Generative Adversarial NetworkSegmentation | —Unverified | 0 |
| Deep Learning based Optical Image Super-Resolution via Generative Diffusion Models for Layerwise in-situ LPBF Monitoring | Sep 20, 2024 | Image Super-ResolutionSSIM | —Unverified | 0 |