| ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs | Sep 22, 2023 | Math | CodeCode Available | 2 |
| AnglE-optimized Text Embeddings | Sep 22, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control | Sep 22, 2023 | GPUreinforcement-learning | CodeCode Available | 2 |
| Detect Everything with Few Examples | Sep 22, 2023 | Binary ClassificationCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| ICASSP 2023 Acoustic Echo Cancellation Challenge | Sep 22, 2023 | Acoustic echo cancellationSpeech Enhancement | CodeCode Available | 2 |
| The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" | Sep 21, 2023 | Data AugmentationSentence | CodeCode Available | 2 |
| Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents | Sep 21, 2023 | | CodeCode Available | 2 |
| Wasserstein Quantum Monte Carlo: A Novel Approach for Solving the Quantum Many-Body Schrödinger Equation | Sep 21, 2023 | | CodeCode Available | 2 |
| TART: A plug-and-play Transformer module for task-agnostic reasoning | Sep 21, 2023 | | CodeCode Available | 2 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 |
| Random-Access Infinite Context Length for Transformers | Sep 21, 2023 | | CodeCode Available | 2 |
| Geometric Transformer with Interatomic Positional Encoding | Sep 21, 2023 | | CodeCode Available | 2 |
| RRHF: Rank Responses to Align Language Models with Human Feedback | Sep 21, 2023 | | CodeCode Available | 2 |
| BanditPAM++: Faster k-medoids Clustering | Sep 21, 2023 | | CodeCode Available | 2 |
| Parsel🐍: Algorithmic Reasoning with Language Models by Composing Decompositions | Sep 21, 2023 | | CodeCode Available | 2 |
| Blockwise Parallel Transformers for Large Context Models | Sep 21, 2023 | | CodeCode Available | 2 |
| On the Planning Abilities of Large Language Models - A Critical Investigation | Sep 21, 2023 | | CodeCode Available | 2 |
| One Fits All: Power General Time Series Analysis by Pretrained LM | Sep 21, 2023 | | CodeCode Available | 2 |
| H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training | Sep 21, 2023 | | CodeCode Available | 2 |
| Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context | Sep 21, 2023 | | CodeCode Available | 2 |
| MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models | Sep 21, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models | Sep 21, 2023 | | CodeCode Available | 2 |
| PromptIR: Prompting for All-in-One Image Restoration | Sep 21, 2023 | | CodeCode Available | 2 |
| Achieving Cross Modal Generalization with Multimodal Unified Representation | Sep 21, 2023 | | CodeCode Available | 2 |
| RMT: Retentive Networks Meet Vision Transformers | Sep 20, 2023 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise Optimization | Sep 20, 2023 | Knowledge Distillationobject-detection | CodeCode Available | 2 |
| A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models | Sep 20, 2023 | Language ModellingMachine Translation | CodeCode Available | 2 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 |
| DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services | Sep 20, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Text2Reward: Reward Shaping with Language Models for Reinforcement Learning | Sep 20, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 2 |
| DreamLLM: Synergistic Multimodal Comprehension and Creation | Sep 20, 2023 | multimodal generationVisual Question Answering | CodeCode Available | 2 |
| You Only Look at Screens: Multimodal Chain-of-Action Agents | Sep 20, 2023 | Type prediction | CodeCode Available | 2 |
| GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts | Sep 19, 2023 | Red Teaming | CodeCode Available | 2 |
| Rethinking Imitation-based Planner for Autonomous Driving | Sep 19, 2023 | Autonomous DrivingData Augmentation | CodeCode Available | 2 |
| Forgedit: Text Guided Image Editing via Learning and Forgetting | Sep 19, 2023 | text-guided-image-editing | CodeCode Available | 2 |
| PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training | Sep 19, 2023 | 2kPosition | CodeCode Available | 2 |
| PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes | Sep 19, 2023 | Self-Driving Cars | CodeCode Available | 2 |
| Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity | Sep 19, 2023 | GPU | CodeCode Available | 2 |
| PLVS: A SLAM System with Points, Lines, Volumetric Mapping, and 3D Incremental Segmentation | Sep 19, 2023 | 3D ReconstructionSegmentation | CodeCode Available | 2 |
| DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving | Sep 18, 2023 | Autonomous DrivingVideo Generation | CodeCode Available | 2 |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Sep 18, 2023 | 3D geometryDecoder | CodeCode Available | 2 |
| vSHARP: variable Splitting Half-quadratic Admm algorithm for Reconstruction of inverse-Problems | Sep 18, 2023 | compressed sensingMRI Reconstruction | CodeCode Available | 2 |
| RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision | Sep 18, 2023 | Autonomous DrivingNeRF | CodeCode Available | 2 |
| RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud | Sep 18, 2023 | Motion EstimationMotion Segmentation | CodeCode Available | 2 |
| Grasp-Anything: Large-scale Grasp Dataset from Foundation Models | Sep 18, 2023 | DiversityRobotic Grasping | CodeCode Available | 2 |
| HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform | Sep 18, 2023 | Speech Synthesis | CodeCode Available | 2 |
| OWL: A Large Language Model for IT Operations | Sep 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation | Sep 17, 2023 | 3D Interacting Hand Pose EstimationDiversity | CodeCode Available | 2 |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET) | Sep 15, 2023 | Music Source Separation | CodeCode Available | 2 |