| SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents | Oct 18, 2023 | | CodeCode Available | 2 |
| Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Oct 18, 2023 | 4kimage-classification | CodeCode Available | 2 |
| Iterative Methods for Vecchia-Laplace Approximations for Latent Gaussian Process Models | Oct 18, 2023 | | CodeCode Available | 2 |
| LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks | Oct 17, 2023 | In-Context Learning | CodeCode Available | 2 |
| BitNet: Scaling 1-bit Transformers for Large Language Models | Oct 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment | Oct 17, 2023 | AttributeObject | CodeCode Available | 2 |
| Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting | Oct 16, 2023 | | CodeCode Available | 2 |
| ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models | Oct 16, 2023 | General Reinforcement LearningGPU | CodeCode Available | 2 |
| AdaLomo: Low-memory Optimization with Adaptive Learning Rate | Oct 16, 2023 | | CodeCode Available | 2 |
| LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation | Oct 16, 2023 | GPUImage Animation | CodeCode Available | 2 |
| IDRNet: Intervention-Driven Relation Network for Semantic Segmentation | Oct 16, 2023 | RelationRelation Network | CodeCode Available | 2 |
| FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models | Oct 16, 2023 | Federated Learningparameter-efficient fine-tuning | CodeCode Available | 2 |
| HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending | Oct 16, 2023 | Attribute | CodeCode Available | 2 |
| On Generative Agents in Recommendation | Oct 16, 2023 | Collaborative FilteringMovie Recommendation | CodeCode Available | 2 |
| Character-LLM: A Trainable Agent for Role-Playing | Oct 16, 2023 | | CodeCode Available | 2 |
| Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies | Oct 16, 2023 | Few-Shot LearningTime Series | CodeCode Available | 2 |
| The Calysto Scheme Project | Oct 16, 2023 | | CodeCode Available | 2 |
| Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling | Oct 14, 2023 | Speech Synthesistext-to-speech | CodeCode Available | 2 |
| An Expression Tree Decoding Strategy for Mathematical Equation Generation | Oct 14, 2023 | MathMathematical Reasoning | CodeCode Available | 2 |
| Hawkeye: A PyTorch-based Library for Fine-Grained Image Recognition with Deep Learning | Oct 14, 2023 | Fine-Grained Image Recognition | CodeCode Available | 2 |
| A Setwise Approach for Effective and Highly Efficient Zero-shot Ranking with Large Language Models | Oct 14, 2023 | Document Ranking | CodeCode Available | 2 |
| From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models | Oct 13, 2023 | HallucinationImage Captioning | CodeCode Available | 2 |
| ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models | Oct 13, 2023 | Knowledge Base Question AnsweringKnowledge Graphs | CodeCode Available | 2 |
| X-Pose: Detecting Any Keypoints | Oct 12, 2023 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 2 |
| PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm | Oct 12, 2023 | 3D Object Detection3D Reconstruction | CodeCode Available | 2 |
| GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models | Oct 12, 2023 | GPUText to 3D | CodeCode Available | 2 |
| Jailbreaking Black Box Large Language Models in Twenty Queries | Oct 12, 2023 | | CodeCode Available | 2 |
| Learning to Act from Actionless Videos through Dense Correspondences | Oct 12, 2023 | | CodeCode Available | 2 |
| UniPAD: A Universal Pre-training Paradigm for Autonomous Driving | Oct 12, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing | Oct 12, 2023 | text-guided-image-editing | CodeCode Available | 2 |
| LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models | Oct 12, 2023 | Natural Language UnderstandingQuantization | CodeCode Available | 2 |
| OmniControl: Control Any Joint at Any Time for Human Motion Generation | Oct 12, 2023 | Motion Generation | CodeCode Available | 2 |
| Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes | Oct 12, 2023 | GPUNovel View Synthesis | CodeCode Available | 2 |
| Octopus: Embodied Vision-Language Programmer from Environmental Feedback | Oct 12, 2023 | BenchmarkingCode Generation | CodeCode Available | 2 |
| Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity | Oct 11, 2023 | RetrievalSpecificity | CodeCode Available | 2 |
| ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models | Oct 11, 2023 | Image Generation | CodeCode Available | 2 |
| ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction Horizons | Oct 11, 2023 | BenchmarkingPosition | CodeCode Available | 2 |
| VeCLIP: Improving CLIP Training via Visual-enriched Captions | Oct 11, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 2 |
| Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models | Oct 11, 2023 | Code GenerationImage Generation | CodeCode Available | 2 |
| LLark: A Multimodal Instruction-Following Language Model for Music | Oct 11, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Large Language Models Are Zero-Shot Time Series Forecasters | Oct 11, 2023 | ImputationTime Series | CodeCode Available | 2 |
| DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model | Oct 11, 2023 | Autonomous DrivingImage Generation | CodeCode Available | 2 |
| Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition | Oct 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Making Large Language Models Perform Better in Knowledge Graph Completion | Oct 10, 2023 | In-Context LearningKnowledge Graph Completion | CodeCode Available | 2 |
| TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning | Oct 10, 2023 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| A Semantic Invariant Robust Watermark for Large Language Models | Oct 10, 2023 | | CodeCode Available | 2 |
| Lemur: Harmonizing Natural Language and Code for Language Agents | Oct 10, 2023 | | CodeCode Available | 2 |
| Uni3D: Exploring Unified 3D Representation at Scale | Oct 10, 2023 | 3D Object ClassificationRetrieval | CodeCode Available | 2 |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Oct 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Conformal Prediction for Deep Classifier via Label Ranking | Oct 10, 2023 | Conformal PredictionPrediction | CodeCode Available | 2 |