| Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models | Jun 7, 2023 | DiversityImage Generation | CodeCode Available | 2 |
| ModuleFormer: Modularity Emerges from Mixture-of-Experts | Jun 7, 2023 | Language ModellingLightweight Deployment | CodeCode Available | 2 |
| On the Reliability of Watermarks for Large Language Models | Jun 7, 2023 | | CodeCode Available | 2 |
| INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models | Jun 7, 2023 | | CodeCode Available | 2 |
| RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control | Jun 6, 2023 | continuous-controlContinuous Control | CodeCode Available | 2 |
| LEACE: Perfect linear concept erasure in closed form | Jun 6, 2023 | FairnessForm | CodeCode Available | 2 |
| Adversarial attacks and defenses in explainable artificial intelligence: A survey | Jun 6, 2023 | Decision MakingExplainable artificial intelligence | CodeCode Available | 2 |
| Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere | Jun 6, 2023 | Operator learning | CodeCode Available | 2 |
| Inference-Time Intervention: Eliciting Truthful Answers from a Language Model | Jun 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph | Jun 6, 2023 | | CodeCode Available | 2 |
| MolFM: A Multimodal Molecular Foundation Model | Jun 6, 2023 | Cross-Modal RetrievalKnowledge Graphs | CodeCode Available | 2 |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Jun 5, 2023 | GPULanguage Modelling | CodeCode Available | 2 |
| STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection | Jun 5, 2023 | Face AlignmentFacial Landmark Detection | CodeCode Available | 2 |
| LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion | Jun 5, 2023 | | CodeCode Available | 2 |
| Calib-Anything: Zero-training LiDAR-Camera Extrinsic Calibration Method Using Segment Anything | Jun 5, 2023 | Camera Calibration | CodeCode Available | 2 |
| LibAUC: A Deep Learning Library for X-Risk Optimization | Jun 5, 2023 | BenchmarkingClassification | CodeCode Available | 2 |
| Scene as Occupancy | Jun 5, 2023 | DecoderMotion Planning | CodeCode Available | 2 |
| User Behavior Simulation with Large Language Model based Agents | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model | Jun 4, 2023 | 3D Object DetectionImage Segmentation | CodeCode Available | 2 |
| Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation | Jun 4, 2023 | Semantic Segmentation | CodeCode Available | 2 |
| DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting | Jun 3, 2023 | Computational EfficiencyInductive Bias | CodeCode Available | 2 |
| VideoComposer: Compositional Video Synthesis with Motion Controllability | Jun 3, 2023 | Image GenerationText-to-Video Generation | CodeCode Available | 2 |
| Fine-Grained Human Feedback Gives Better Rewards for Language Model Training | Jun 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TIES-Merging: Resolving Interference When Merging Models | Jun 2, 2023 | Transfer Learning | CodeCode Available | 2 |
| Invisible Image Watermarks Are Provably Removable Using Generative AI | Jun 2, 2023 | DenoisingImage Denoising | CodeCode Available | 2 |
| Example-based Motion Synthesis via Generative Motion Matching | Jun 1, 2023 | Motion GenerationMotion Synthesis | CodeCode Available | 2 |
| StyleDrop: Text-to-Image Generation in Any Style | Jun 1, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Low-Light Image Enhancement with Wavelet-based Diffusion Models | Jun 1, 2023 | DenoisingFace Detection | CodeCode Available | 2 |
| GRES: Generalized Referring Expression Segmentation | Jun 1, 2023 | Generalized Referring Expression SegmentationReferring Expression | CodeCode Available | 2 |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 2 |
| Differential Diffusion: Giving Each Pixel Its Strength | Jun 1, 2023 | Image GenerationText-based Image Editing | CodeCode Available | 2 |
| Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models | Jun 1, 2023 | GPUImage Compression | CodeCode Available | 2 |
| Thought Cloning: Learning to Think while Acting by Imitating Human Thinking | Jun 1, 2023 | Imitation LearningReinforcement Learning (RL) | CodeCode Available | 2 |
| ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation | Jun 1, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics | Jun 1, 2023 | DiagnosticRepresentation Learning | CodeCode Available | 2 |
| Inserting Anybody in Diffusion Models via Celeb Basis | Jun 1, 2023 | | CodeCode Available | 2 |
| Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models | Jun 1, 2023 | Image GenerationStory Visualization | CodeCode Available | 2 |
| DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting | May 31, 2023 | DecoderScene Text Detection | CodeCode Available | 2 |
| Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust | May 31, 2023 | Image Generation | CodeCode Available | 2 |
| MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training | May 31, 2023 | Language ModellingQuantization | CodeCode Available | 2 |
| Neural Kernel Surface Reconstruction | May 31, 2023 | Surface Reconstruction | CodeCode Available | 2 |
| Improving CLIP Training with Language Rewrites | May 31, 2023 | In-Context LearningSentence | CodeCode Available | 2 |
| Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning | May 31, 2023 | Decision MakingGeneral Knowledge | CodeCode Available | 2 |
| A Geometric Perspective on Diffusion Models | May 31, 2023 | Denoising | CodeCode Available | 2 |
| Multi-modal Queried Object Detection in the Wild | May 30, 2023 | Few-Shot Object DetectionObject | CodeCode Available | 2 |
| Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate | May 30, 2023 | Arithmetic ReasoningMachine Translation | CodeCode Available | 2 |
| Voice Conversion With Just Nearest Neighbors | May 30, 2023 | Voice Conversion | CodeCode Available | 2 |
| HiFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance | May 30, 2023 | 3D Generation3D geometry | CodeCode Available | 2 |
| Cones 2: Customizable Image Synthesis with Multiple Subjects | May 30, 2023 | Image Generation | CodeCode Available | 2 |
| Are Large Kernels Better Teachers than Transformers for ConvNets? | May 30, 2023 | Knowledge Distillation | CodeCode Available | 2 |