| BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Nov 9, 2023 | Face ReenactmentNeRF | CodeCode Available | 2 |
| On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Nov 9, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents | Nov 9, 2023 | Instruction FollowingLLM real-life tasks | CodeCode Available | 2 |
| A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions | Nov 9, 2023 | HallucinationInformation Retrieval | CodeCode Available | 2 |
| Agent Lumos: Unified and Modular Training for Open-Source Language Agents | Nov 9, 2023 | MathQuestion Answering | CodeCode Available | 2 |
| A differentiable brain simulator bridging brain simulation and brain-inspired computing | Nov 9, 2023 | | CodeCode Available | 2 |
| BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| High-Performance Transformers for Table Structure Recognition Need Early Convolutions | Nov 9, 2023 | DecoderRepresentation Learning | CodeCode Available | 2 |
| CellPhoneDB v5: inferring cell-cell communication from single-cell multiomics data | Nov 8, 2023 | | CodeCode Available | 2 |
| Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation | Nov 8, 2023 | Style TransferVoice Conversion | CodeCode Available | 2 |
| Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers | Nov 8, 2023 | | CodeCode Available | 2 |
| NExT-Chat: An LMM for Chat, Detection and Segmentation | Nov 8, 2023 | Referring ExpressionReferring Expression Segmentation | CodeCode Available | 2 |
| Rethinking Benchmark and Contamination for Language Models with Rephrased Samples | Nov 8, 2023 | HumanEvalMMLU | CodeCode Available | 2 |
| Neuro-GPT: Towards A Foundation Model for EEG | Nov 7, 2023 | Brain Computer InterfaceEEG | CodeCode Available | 2 |
| Black-Box Prompt Optimization: Aligning Large Language Models without Model Training | Nov 7, 2023 | GPU | CodeCode Available | 2 |
| A Survey of Large Language Models Attribution | Nov 7, 2023 | Survey | CodeCode Available | 2 |
| Towards Garment Sewing Pattern Reconstruction from a Single Image | Nov 7, 2023 | Garment ReconstructionTexture Synthesis | CodeCode Available | 2 |
| A Foundation Model for Music Informatics | Nov 6, 2023 | Information Retrievalmodel | CodeCode Available | 2 |
| Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch | Nov 6, 2023 | DecoderGSM8K | CodeCode Available | 2 |
| PhoGPT: Generative Pre-training for Vietnamese | Nov 6, 2023 | Instruction Following | CodeCode Available | 2 |
| Can LLMs Follow Simple Rules? | Nov 6, 2023 | | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| QECO: A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing | Nov 4, 2023 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 2 |
| MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning | Nov 4, 2023 | Multi-Task Learning | CodeCode Available | 2 |
| Simplifying Transformer Blocks | Nov 3, 2023 | Decoder | CodeCode Available | 2 |
| EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Nov 3, 2023 | Optical Flow EstimationSemantic Segmentation | CodeCode Available | 2 |
| Medical Image Segmentation with Domain Adaptation: A Survey | Nov 3, 2023 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review | Nov 3, 2023 | Diagnostic | CodeCode Available | 2 |
| PPI++: Efficient Prediction-Powered Inference | Nov 2, 2023 | Prediction | CodeCode Available | 2 |
| Diffusion Models for Reinforcement Learning: A Survey | Nov 2, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Adapting Frechet Audio Distance for Generative Music Evaluation | Nov 2, 2023 | FAD | CodeCode Available | 2 |
| ProAgent: From Robotic Process Automation to Agentic Process Automation | Nov 2, 2023 | Decision Making | CodeCode Available | 2 |
| TopicGPT: A Prompt-based Topic Modeling Framework | Nov 2, 2023 | SpecificityTopic Models | CodeCode Available | 2 |
| Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers | Nov 2, 2023 | Prompt Engineering | CodeCode Available | 2 |
| JADE: A Linguistics-based Safety Evaluation Platform for Large Language Models | Nov 1, 2023 | Natural Questions | CodeCode Available | 2 |
| OpenForest: A data catalogue for machine learning in forest monitoring | Nov 1, 2023 | | CodeCode Available | 2 |
| SoulChat: Improving LLMs' Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations | Nov 1, 2023 | | CodeCode Available | 2 |
| Efficient LLM Inference on CPUs | Nov 1, 2023 | Quantization | CodeCode Available | 2 |
| Low-latency Real-time Voice Conversion on CPU | Nov 1, 2023 | CPUKnowledge Distillation | CodeCode Available | 2 |
| What's In My Big Data? | Oct 31, 2023 | Benchmarking | CodeCode Available | 2 |
| SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction | Oct 31, 2023 | PredictionSemantic Similarity | CodeCode Available | 2 |
| CapsFusion: Rethinking Image-Text Data at Scale | Oct 31, 2023 | World Knowledge | CodeCode Available | 2 |
| ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection | Oct 31, 2023 | Camouflaged Object Segmentation | CodeCode Available | 2 |
| Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory | Oct 31, 2023 | Deep Learning | CodeCode Available | 2 |
| Modular Boundaries in Recurrent Neural Networks | Oct 31, 2023 | Community DetectionDimensionality Reduction | CodeCode Available | 2 |
| TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition | Oct 30, 2023 | Image ClassificationObject Detection | CodeCode Available | 2 |
| Evaluating Large Language Models: A Comprehensive Survey | Oct 30, 2023 | Survey | CodeCode Available | 2 |
| Large Trajectory Models are Scalable Motion Predictors and Planners | Oct 30, 2023 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks | Oct 30, 2023 | Benchmarkingobject-detection | CodeCode Available | 2 |
| Atom: Low-bit Quantization for Efficient and Accurate LLM Serving | Oct 29, 2023 | GPUQuantization | CodeCode Available | 2 |