Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines Jun 20, 2024 Diversity object-detection
Code Code Available 35 Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language Model Nov 29, 2023 Diversity Language Modeling
Code Code Available 35 UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving Dec 6, 2024 Autonomous Driving Diversity
Code Code Available 35 UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition Apr 23, 2024 Decoder Diversity
Code Code Available 35 UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction Mar 22, 2024 Diversity Prediction
Code Code Available 35 Taming Diffusion Probabilistic Models for Character Control Apr 23, 2024 Computational Efficiency Diversity
Code Code Available 35 SVIT: Scaling up Visual Instruction Tuning Jul 9, 2023 Diversity Image Captioning
Code Code Available 35 ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars Mar 22, 2024 3D Generation Diversity
Code Code Available 35 Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait Mar 17, 2025 Computational Efficiency Diversity
Code Code Available 35 Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding Feb 22, 2024 Diversity Scene Understanding
Code Code Available 35 Self-QA: Unsupervised Knowledge Guided Language Model Alignment May 19, 2023 Diversity Language Modeling
Code Code Available 35 SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks Nov 20, 2023 Diversity Image Segmentation
Code Code Available 35 Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation May 30, 2024 Diversity Drug Design
Code Code Available 35 Results of the Big ANN: NeurIPS'23 competition Sep 25, 2024 Diversity
Code Code Available 35 DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness Mar 11, 2025 Diversity
Code Code Available 35 RT-1: Robotics Transformer for Real-World Control at Scale Dec 13, 2022 Diversity Robot Manipulation
Code Code Available 35 Sequential Modeling Enables Scalable Learning for Large Vision Models Dec 1, 2023 Diversity
Code Code Available 35 OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Dec 27, 2024 Diversity Synthetic Data Generation
Code Code Available 35 CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous Driving Oct 11, 2023 Autonomous Driving Benchmarking
Code Code Available 35 Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models Sep 16, 2024 Decoder Diversity
Code Code Available 35 MNN: A Universal and Efficient Inference Engine Feb 27, 2020 Deep Learning Diversity
Code Code Available 35 Generating Long Sequences with Sparse Transformers Apr 23, 2019 Diversity Image Generation
Code Code Available 35 Objaverse-XL: A Universe of 10M+ 3D Objects Jul 11, 2023 Diversity Novel View Synthesis
Code Code Available 35 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision May 4, 2023 Diversity In-Context Learning
Code Code Available 35 INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning Jan 12, 2024 Diversity document understanding
Code Code Available 35 Improving Text Embeddings with Large Language Models Dec 31, 2023 Decoder Diversity
Code Code Available 35 Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2 Aug 3, 2024 Diversity Segmentation
Code Code Available 35 Hierarchical Text-Conditional Image Generation with CLIP Latents Apr 13, 2022 Conditional Image Generation Decoder
Code Code Available 35 Improved motif-scaffolding with SE(3) flow matching Jan 8, 2024 Data Augmentation Diversity
Code Code Available 35 Improving Model Evaluation using SMART Filtering of Benchmark Datasets Oct 26, 2024 Chatbot Diversity
Code Code Available 35 LongAlign: A Recipe for Long Context Alignment of Large Language Models Jan 31, 2024 Diversity Instruction Following
Code Code Available 35 MiniViT: Compressing Vision Transformers with Weight Multiplexing Apr 14, 2022 Diversity Image Classification
Code Code Available 35 FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes May 7, 2024 3D Point Cloud Classification 3D Semantic Segmentation
Code Code Available 35 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping May 27, 2024 Depth Estimation Diversity
Code Code Available 35 Anything-3D: Towards Single-view Anything Reconstruction in the Wild Apr 19, 2023 3D Reconstruction Diversity
Code Code Available 35 SkillMimic: Learning Basketball Interaction Skills from Demonstrations Aug 12, 2024 Diversity Human-Object Interaction Detection
Code Code Available 35 EgoMimic: Scaling Imitation Learning via Egocentric Video Oct 31, 2024 Diversity Imitation Learning
Code Code Available 25 Efficient Quality Diversity Optimization of 3D Buildings through 2D Pre-optimization Mar 28, 2023 Diversity
Code Code Available 25 Effective Data Augmentation With Diffusion Models Feb 7, 2023 Data Augmentation Diversity
Code Code Available 25 AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making Nov 6, 2024 Decision Making Diversity
Code Code Available 25 EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce Aug 14, 2023 Diversity Instruction Following
Code Code Available 25 Dual Spoof Disentanglement Generation for Face Anti-spoofing with Depth Uncertainty Learning Dec 1, 2021 Disentanglement Diversity
Code Code Available 25 DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation Aug 24, 2024 Anomaly Classification Anomaly Detection
Code Code Available 25 EasyPortrait -- Face Parsing and Portrait Segmentation Dataset Apr 26, 2023 Diversity Domain Generalization
Code Code Available 25 EDGE: Editable Dance Generation From Music Nov 19, 2022 Diversity Motion Synthesis
Code Code Available 25 Diverse Preference Optimization Jan 30, 2025 Diversity
Code Code Available 25 DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data May 16, 2024 Data Augmentation Diversity
Code Code Available 25 DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models Mar 4, 2025 Diversity GPU
Code Code Available 25 Diffusion Probabilistic Models beat GANs on Medical Images Dec 14, 2022 Denoising Diversity
Code Code Available 25 DiffusionPen: Towards Controlling the Style of Handwritten Text Generation Sep 9, 2024 Diversity HTR
Code Code Available 25