| LayoutGPT: Compositional Visual Planning and Generation with Large Language Models | May 24, 2023 | Image GenerationIndoor Scene Synthesis | CodeCode Available | 2 |
| CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition | May 24, 2023 | DenoisingKnowledge Distillation | CodeCode Available | 2 |
| A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | May 24, 2023 | Dense Pixel Correspondence EstimationRepresentation Learning | CodeCode Available | 2 |
| NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario | May 24, 2023 | Autonomous DrivingQuestion Answering | CodeCode Available | 2 |
| Enabling Large Language Models to Generate Text with Citations | May 24, 2023 | HallucinationRetrieval | CodeCode Available | 2 |
| gRNAde: Geometric Deep Learning for 3D RNA inverse design | May 24, 2023 | 3D geometryDeep Learning | CodeCode Available | 2 |
| A New Era in Software Security: Towards Self-Healing Software via Large Language Models and Formal Verification | May 24, 2023 | C++ codeMathematical Proofs | CodeCode Available | 2 |
| torchgfn: A PyTorch GFlowNet library | May 24, 2023 | | CodeCode Available | 2 |
| ExpertPrompting: Instructing Large Language Models to be Distinguished Experts | May 24, 2023 | In-Context LearningInstruction Following | CodeCode Available | 2 |
| Adapting Language Models to Compress Contexts | May 24, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Lawyer LLaMA Technical Report | May 24, 2023 | ArticlesHallucination | CodeCode Available | 2 |
| Unpaired Image-to-Image Translation via Neural Schrödinger Bridge | May 24, 2023 | Image-to-Image TranslationTranslation | CodeCode Available | 2 |
| Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models | May 24, 2023 | ChatbotNatural Language Understanding | CodeCode Available | 2 |
| Sparse4D v2: Recurrent Temporal Fusion with Sparse Model | May 23, 2023 | | CodeCode Available | 2 |
| Improving Factuality and Reasoning in Language Models through Multiagent Debate | May 23, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning | May 23, 2023 | Code GenerationConstituency Parsing | CodeCode Available | 2 |
| Link Prediction without Graph Neural Networks | May 23, 2023 | AttributeGraph Learning | CodeCode Available | 2 |
| SAD: Segment Any RGBD | May 23, 2023 | 3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| DetGPT: Detect What You Need via Reasoning | May 23, 2023 | Autonomous DrivingObject | CodeCode Available | 2 |
| LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models | May 23, 2023 | Common Sense ReasoningImage Generation | CodeCode Available | 2 |
| Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach | May 23, 2023 | GPUImage Generation | CodeCode Available | 2 |
| Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning | May 23, 2023 | Image GenerationOptical Flow Estimation | CodeCode Available | 2 |
| Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Efficient Multi-Scale Attention Module with Cross-Spatial Learning | May 23, 2023 | Dimensionality Reductionimage-classification | CodeCode Available | 2 |
| The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning | May 23, 2023 | Common Sense ReasoningCommon Sense Reasoning (Zero-Shot) | CodeCode Available | 2 |
| FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation | May 23, 2023 | FormLanguage Modelling | CodeCode Available | 2 |
| REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos | May 23, 2023 | Garment ReconstructionNeural Rendering | CodeCode Available | 2 |
| MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra | May 23, 2023 | DecoderDenoising | CodeCode Available | 2 |
| ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models | May 23, 2023 | Retrieval | CodeCode Available | 2 |
| Perception Test: A Diagnostic Benchmark for Multimodal Video Models | May 23, 2023 | DiagnosticGrounded Video Question Answering | CodeCode Available | 2 |
| SMT 2.0: A Surrogate Modeling Toolbox with a focus on Hierarchical and Mixed Variables Gaussian Processes | May 23, 2023 | Gaussian Processes | CodeCode Available | 2 |
| MAGE: Machine-generated Text Detection in the Wild | May 22, 2023 | Binary text classificationFace Swapping | CodeCode Available | 2 |
| Hierarchical Integration Diffusion Model for Realistic Image Deblurring | May 22, 2023 | DeblurringImage Deblurring | CodeCode Available | 2 |
| Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection | May 22, 2023 | Fairness | CodeCode Available | 2 |
| Multimodal Automated Fact-Checking: A Survey | May 22, 2023 | Fact CheckingMisinformation | CodeCode Available | 2 |
| RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars | May 22, 2023 | 2kImage Matting | CodeCode Available | 2 |
| Training Diffusion Models with Reinforcement Learning | May 22, 2023 | Decision MakingDenoising | CodeCode Available | 2 |
| FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation | May 22, 2023 | Imitation LearningMotion Planning | CodeCode Available | 2 |
| Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching | May 22, 2023 | AllFew-Shot Semantic Segmentation | CodeCode Available | 2 |
| VDT: General-purpose Video Diffusion Transformers via Mask Modeling | May 22, 2023 | Autonomous DrivingVideo Generation | CodeCode Available | 2 |
| Boosting Knowledge Graph Generation from Tabular Data with RML Views | May 22, 2023 | Data IntegrationGraph Generation | CodeCode Available | 2 |
| LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities | May 22, 2023 | Event Extractiongraph construction | CodeCode Available | 2 |
| Mist: Towards Improved Adversarial Examples for Diffusion Models | May 22, 2023 | Adversarial Defense | CodeCode Available | 2 |
| LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On | May 22, 2023 | Virtual Try-on | CodeCode Available | 2 |
| Lion: Adversarial Distillation of Proprietary Large Language Models | May 22, 2023 | Instruction FollowingKnowledge Distillation | CodeCode Available | 2 |
| ControlVideo: Training-free Controllable Text-to-Video Generation | May 22, 2023 | Image GenerationText-to-Video Generation | CodeCode Available | 2 |
| VanillaNet: the Power of Minimalism in Deep Learning | May 22, 2023 | Deep LearningPhilosophy | CodeCode Available | 2 |
| Evaluating the Performance of Large Language Models on GAOKAO Benchmark | May 21, 2023 | | CodeCode Available | 2 |
| Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning | May 20, 2023 | Logical Reasoning | CodeCode Available | 2 |
| Knowledge-Design: Pushing the Limit of Protein Design via Knowledge Refinement | May 20, 2023 | Protein DesignRetrieval | CodeCode Available | 2 |