| GRID: A Platform for General Robot Intelligence Development | Oct 2, 2023 | | CodeCode Available | 2 |
| PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series Forecasting | Oct 1, 2023 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models | Oct 1, 2023 | Benchmarking | CodeCode Available | 2 |
| Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion | Oct 1, 2023 | DenoisingImage Generation | CodeCode Available | 2 |
| Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants | Oct 1, 2023 | Instruction Following | CodeCode Available | 2 |
| InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists | Sep 30, 2023 | Depth EstimationImage Generation | CodeCode Available | 2 |
| Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process | Sep 29, 2023 | Change Data GenerationChange Detection | CodeCode Available | 2 |
| Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| GAIA-1: A Generative World Model for Autonomous Driving | Sep 29, 2023 | Autonomous Driving | CodeCode Available | 2 |
| Graph-based Neural Weather Prediction for Limited Area Modeling | Sep 29, 2023 | Weather Forecasting | CodeCode Available | 2 |
| nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance | Sep 29, 2023 | Few-Shot LearningHeart Segmentation | CodeCode Available | 2 |
| Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering | Sep 29, 2023 | Image to textPassage Retrieval | CodeCode Available | 2 |
| Directly Fine-Tuning Diffusion Models on Differentiable Rewards | Sep 29, 2023 | | CodeCode Available | 2 |
| One for All: Towards Training One Graph Model for All Classification Tasks | Sep 29, 2023 | AllGraph Classification | CodeCode Available | 2 |
| UXsim: An open source macroscopic and mesoscopic traffic simulator in Python -- a technical overview | Sep 29, 2023 | | CodeCode Available | 2 |
| CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets | Sep 29, 2023 | Language ModellingMathematical Reasoning | CodeCode Available | 2 |
| Denoising Diffusion Bridge Models | Sep 29, 2023 | DenoisingImage Generation | CodeCode Available | 2 |
| Transformer-VQ: Linear-Time Transformers via Vector Quantization | Sep 28, 2023 | 8kDecoder | CodeCode Available | 2 |
| LawBench: Benchmarking Legal Knowledge of Large Language Models | Sep 28, 2023 | ArticlesBenchmarking | CodeCode Available | 2 |
| ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers | Sep 28, 2023 | GPUInstruction Following | CodeCode Available | 2 |
| DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models | Sep 28, 2023 | 10-shot image generation1 Image, 2*2 Stitchi | CodeCode Available | 2 |
| MEM: Multi-Modal Elevation Mapping for Robotics and Learning | Sep 28, 2023 | ColorizationGPU | CodeCode Available | 2 |
| GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond | Sep 28, 2023 | Benchmarking | CodeCode Available | 2 |
| Text-to-3D using Gaussian Splatting | Sep 28, 2023 | 3D GenerationText to 3D | CodeCode Available | 2 |
| RLLTE: Long-Term Evolution Project of Reinforcement Learning | Sep 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Cross-Prediction-Powered Inference | Sep 28, 2023 | Decision MakingMissing Labels | CodeCode Available | 2 |
| MHG-GNN: Combination of Molecular Hypergraph Grammar with Graph Neural Network | Sep 28, 2023 | Graph Neural NetworkPrediction | CodeCode Available | 2 |
| Deep Geometrized Cartoon Line Inbetweening | Sep 28, 2023 | | CodeCode Available | 2 |
| OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs | Sep 27, 2023 | | CodeCode Available | 2 |
| GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization | Sep 27, 2023 | Contrastive Learninggeo-localization | CodeCode Available | 2 |
| Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future | Sep 27, 2023 | Navigate | CodeCode Available | 2 |
| NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions | Sep 27, 2023 | | CodeCode Available | 2 |
| Effective Long-Context Scaling of Foundation Models | Sep 27, 2023 | Continual PretrainingLanguage Modeling | CodeCode Available | 2 |
| A Content-Driven Micro-Video Recommendation Dataset at Scale | Sep 27, 2023 | BenchmarkingRecommendation Systems | CodeCode Available | 2 |
| A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning | Sep 26, 2023 | BenchmarkingMulti-Objective Reinforcement Learning | CodeCode Available | 2 |
| RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models | Sep 26, 2023 | Information RetrievalReranking | CodeCode Available | 2 |
| Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline | Sep 26, 2023 | Knowledge DistillationObject Tracking | CodeCode Available | 2 |
| ProteinInvBench: Benchmarking Protein Inverse Folding on Diverse Tasks, Models, and Metrics | Sep 26, 2023 | | CodeCode Available | 2 |
| M^4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models | Sep 26, 2023 | | CodeCode Available | 2 |
| PIXIU: A Comprehensive Benchmark, Instruction Dataset and Large Language Model for Finance | Sep 26, 2023 | | CodeCode Available | 2 |
| ICML 2023 Topological Deep Learning Challenge : Design and Results | Sep 26, 2023 | Deep Learning | CodeCode Available | 2 |
| ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and Design | Sep 26, 2023 | Mutational/Variant Effect Prediction | CodeCode Available | 2 |
| Joint Audio and Speech Understanding | Sep 25, 2023 | | CodeCode Available | 2 |
| Detecting and Grounding Multi-Modal Media Manipulation and Beyond | Sep 25, 2023 | Binary ClassificationContrastive Learning | CodeCode Available | 2 |
| OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding | Sep 25, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 |
| Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory | Sep 25, 2023 | | CodeCode Available | 2 |
| Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision | Sep 25, 2023 | Image Quality Assessment | CodeCode Available | 2 |
| VidChapters-7M: Video Chapters at Scale | Sep 25, 2023 | Dense Video CaptioningNavigate | CodeCode Available | 2 |
| MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models | Sep 24, 2023 | Instruction Following | CodeCode Available | 2 |
| P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting | Sep 22, 2023 | DecoderSpeech Synthesis | CodeCode Available | 2 |