| Long Context Alignment with Short Instructions and Synthesized Positions | May 7, 2024 | 16kInstruction Following | —Unverified | 0 |
| SnapKV: LLM Knows What You are Looking for Before Generation | Apr 22, 2024 | 16kGPU | CodeCode Available | 3 |
| FPT: Feature Prompt Tuning for Few-shot Readability Assessment | Apr 3, 2024 | 16kFew-Shot Text Classification | CodeCode Available | 0 |
| Long-form factuality in large language models | Mar 27, 2024 | 16kForm | CodeCode Available | 4 |
| RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict | Mar 25, 2024 | 16kClaim Verification | CodeCode Available | 0 |
| An AI-Assisted Skincare Routine Recommendation System in XR | Mar 20, 2024 | 16k | —Unverified | 0 |
| Human Evaluation of English--Irish Transformer-Based NMT | Mar 4, 2024 | 16kMachine Translation | —Unverified | 0 |
| Transformers for Low-Resource Languages:Is Féidir Linn! | Mar 4, 2024 | 16kHyperparameter Optimization | —Unverified | 0 |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Mar 2, 2024 | 16kCPU | CodeCode Available | 1 |
| Training-Free Long-Context Scaling of Large Language Models | Feb 27, 2024 | 16k | CodeCode Available | 3 |
| Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays | Feb 25, 2024 | 16k8k | —Unverified | 0 |
| Hydragen: High-Throughput LLM Inference with Shared Prefixes | Feb 7, 2024 | 16kChatbot | CodeCode Available | 1 |
| LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K | Feb 6, 2024 | 16kBenchmarking | CodeCode Available | 2 |
| Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis | Jan 22, 2024 | 16kProgram Synthesis | CodeCode Available | 1 |
| Calpric: Inclusive and Fine-grain Labeling of Privacy Policies with Crowdsourcing and Active Learning | Jan 16, 2024 | 16kActive Learning | CodeCode Available | 0 |
| Detours for Navigating Instructional Videos | Jan 3, 2024 | 16kQuestion Answering | —Unverified | 0 |
| Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction | Dec 21, 2023 | 16kAttribute | —Unverified | 0 |
| Beyond Accuracy: Statistical Measures and Benchmark for Evaluation of Representation from Self-Supervised Learning | Dec 2, 2023 | 16kDiversity | —Unverified | 0 |
| Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers | Oct 16, 2023 | 16kHallucination | CodeCode Available | 1 |
| Improved prompting and process for writing user personas with LLMs, using qualitative interviews: Capturing behaviour and personality traits of users | Oct 10, 2023 | 16k | —Unverified | 0 |
| Scaling Laws of RoPE-based Extrapolation | Oct 8, 2023 | 16k | CodeCode Available | 1 |
| Retrieval meets Long Context Large Language Models | Oct 4, 2023 | 16k4k | —Unverified | 0 |
| Home Electricity Data Generator (HEDGE): An open-access tool for the generation of electric vehicle, residential demand, and PV generation profiles | Oct 2, 2023 | 16k | CodeCode Available | 1 |
| Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models | Aug 29, 2023 | 16k8k | —Unverified | 0 |
| LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding | Aug 28, 2023 | 16kCode Completion | CodeCode Available | 3 |
| Code Llama: Open Foundation Models for Code | Aug 24, 2023 | 16kCode Generation | CodeCode Available | 6 |
| Giraffe: Adventures in Expanding Context Lengths in LLMs | Aug 21, 2023 | 16k4k | CodeCode Available | 2 |
| Hadiths Classification Using a Novel Author-Based Hadith Classification Dataset (ABCD) | Aug 14, 2023 | 16kClassification | CodeCode Available | 0 |
| Detecting and Preventing Hallucinations in Large Vision Language Models | Aug 11, 2023 | 16kHallucination | CodeCode Available | 1 |
| LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding | Jun 29, 2023 | 16kImage Captioning | CodeCode Available | 2 |
| The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks | Jun 14, 2023 | 16kClassification | CodeCode Available | 1 |
| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 |
| BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer | May 21, 2023 | 16kreinforcement-learning | CodeCode Available | 0 |
| AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed methods evaluation | May 20, 2023 | 16k | —Unverified | 0 |
| Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens | May 7, 2023 | 16k4k | —Unverified | 0 |
| Understanding Social Media Cross-Modality Discourse in Linguistic Space | Feb 26, 2023 | 16k | CodeCode Available | 0 |
| In-Context Learning with Many Demonstration Examples | Feb 9, 2023 | 16k8k | CodeCode Available | 1 |
| Leveraging Summary Guidance on Medical Report Summarization | Feb 8, 2023 | 16kAbstractive Text Summarization | —Unverified | 0 |
| An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions | Jan 1, 2023 | 16kGait Recognition | CodeCode Available | 1 |
| Spectrograms Are Sequences of Patches | Oct 28, 2022 | 16kSelf-Supervised Learning | CodeCode Available | 0 |
| COLING 2022 Shared Task: LED Finteuning and Recursive Summary Generation for Automatic Summarization of Chapters from Novels | Oct 1, 2022 | 16k | —Unverified | 0 |
| CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin Lesions | Aug 29, 2022 | 16kFairness | CodeCode Available | 1 |
| Investigating Efficiently Extending Transformers for Long Input Summarization | Aug 8, 2022 | 16kLong-range modeling | CodeCode Available | 3 |
| 0/1 Deep Neural Networks via Block Coordinate Descent | Jun 19, 2022 | 10-shot image generation | —Unverified | 0 |
| Improved two-stage hate speech classification for twitter based on Deep Neural Networks | Jun 8, 2022 | 16kAbusive Language | —Unverified | 0 |
| Introducing RezoJDM16k: a French KnowledgeGraph DataSet for Link Prediction | Jun 1, 2022 | 16kBenchmarking | —Unverified | 0 |
| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | May 27, 2022 | 16k4k | CodeCode Available | 6 |
| There’s a Time and Place for Reasoning Beyond the Image | May 1, 2022 | 16kArticles | CodeCode Available | 1 |
| Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction | Mar 24, 2022 | 16kData Augmentation | CodeCode Available | 1 |
| There is a Time and Place for Reasoning Beyond the Image | Mar 1, 2022 | 16kArticles | CodeCode Available | 1 |