| LoRA+: Efficient Low Rank Adaptation of Large Models | Feb 19, 2024 | | CodeCode Available | 3 |
| ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models | Feb 18, 2024 | Language ModellingQuestion Answering | CodeCode Available | 3 |
| 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Feb 18, 2024 | DenoisingRobot Manipulation | CodeCode Available | 3 |
| EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models | Feb 18, 2024 | Event ExtractionHallucination | CodeCode Available | 3 |
| GenAD: Generative End-to-End Autonomous Driving | Feb 18, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| OneBit: Towards Extremely Low-bit Large Language Models | Feb 17, 2024 | Quantization | CodeCode Available | 3 |
| LLMDFA: Analyzing Dataflow in Code with Large Language Models | Feb 16, 2024 | Hallucination | CodeCode Available | 3 |
| 3D Diffuser Actor: Policy Diffusion with 3D Scene Representations | Feb 16, 2024 | DenoisingRobot Manipulation | CodeCode Available | 3 |
| Discovering and exploring cases of educational source code plagiarism with Dolos | Feb 16, 2024 | | CodeCode Available | 3 |
| BitDelta: Your Fine-Tune May Only Be Worth One Bit | Feb 15, 2024 | GPU | CodeCode Available | 3 |
| Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips | Feb 15, 2024 | | CodeCode Available | 3 |
| QuRating: Selecting High-Quality Data for Training Language Models | Feb 15, 2024 | In-Context Learning | CodeCode Available | 3 |
| Data Engineering for Scaling Language Models to 128K Context | Feb 15, 2024 | 4kContinual Pretraining | CodeCode Available | 3 |
| OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning | Feb 15, 2024 | Data AugmentationInstruction Following | CodeCode Available | 3 |
| GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering | Feb 15, 2024 | 3D ReconstructionNovel View Synthesis | CodeCode Available | 3 |
| Traj-LIO: A Resilient Multi-LiDAR Multi-IMU State Estimator Through Sparse Gaussian Process | Feb 14, 2024 | | CodeCode Available | 3 |
| Magic-Me: Identity-Specific Video Customized Diffusion | Feb 14, 2024 | Image GenerationText to Image Generation | CodeCode Available | 3 |
| PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers | Feb 13, 2024 | Question AnsweringRetrieval | CodeCode Available | 3 |
| VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search | Feb 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SPO: Sequential Monte Carlo Policy Optimisation | Feb 12, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 3 |
| PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models | Feb 12, 2024 | Answer GenerationHallucination | CodeCode Available | 3 |
| Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models | Feb 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Scaling Laws for Fine-Grained Mixture of Experts | Feb 12, 2024 | Mixture-of-Experts | CodeCode Available | 3 |
| Q-Bench+: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs | Feb 11, 2024 | Image Quality AssessmentQuestion Answering | CodeCode Available | 3 |
| X-LoRA: Mixture of Low-Rank Adapter Experts, a Flexible Framework for Large Language Models with Applications in Protein Mechanics and Molecular Design | Feb 11, 2024 | graph constructionKnowledge Graphs | CodeCode Available | 3 |
| OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning | Feb 10, 2024 | Federated LearningInstruction Following | CodeCode Available | 3 |
| Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models | Feb 10, 2024 | CPUGPU | CodeCode Available | 3 |
| ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and Refinement | Feb 9, 2024 | HallucinationLanguage Modelling | CodeCode Available | 3 |
| FNSPID: A Comprehensive Financial News Dataset in Time Series | Feb 9, 2024 | Financial AnalysisTime Series | CodeCode Available | 3 |
| ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics | Feb 9, 2024 | | CodeCode Available | 3 |
| HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting | Feb 9, 2024 | | CodeCode Available | 3 |
| The boundary of neural network trainability is fractal | Feb 9, 2024 | | CodeCode Available | 3 |
| Noise Contrastive Alignment of Language Models with Explicit Rewards | Feb 8, 2024 | Language ModellingMath | CodeCode Available | 3 |
| Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey | Feb 8, 2024 | ArticlesEntity Alignment | CodeCode Available | 3 |
| Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Feb 8, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design | Feb 7, 2024 | | CodeCode Available | 3 |
| Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models | Feb 7, 2024 | counterfactualImage Generation | CodeCode Available | 3 |
| MEMORYLLM: Towards Self-Updatable Large Language Models | Feb 7, 2024 | Model Editing | CodeCode Available | 3 |
| InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory | Feb 7, 2024 | | CodeCode Available | 3 |
| Temporal Graph Analysis with TGX | Feb 6, 2024 | | CodeCode Available | 3 |
| ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation | Feb 6, 2024 | Image to Video GenerationVideo Generation | CodeCode Available | 3 |
| Does confidence calibration improve conformal prediction? | Feb 6, 2024 | Conformal PredictionPrediction | CodeCode Available | 3 |
| OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving | Feb 6, 2024 | Autonomous DrivingNeural Rendering | CodeCode Available | 3 |
| CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations | Feb 6, 2024 | Visual Reasoning | CodeCode Available | 3 |
| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Feb 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DistiLLM: Towards Streamlined Distillation for Large Language Models | Feb 6, 2024 | Instruction FollowingKnowledge Distillation | CodeCode Available | 3 |
| The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry | Feb 6, 2024 | | CodeCode Available | 3 |
| BiLLM: Pushing the Limit of Post-Training Quantization for LLMs | Feb 6, 2024 | BinarizationGPU | CodeCode Available | 3 |
| Deep Learning for Multivariate Time Series Imputation: A Survey | Feb 6, 2024 | Deep LearningImputation | CodeCode Available | 3 |