| Zero-shot Translation of Attention Patterns in VQA Models to Natural Language | Nov 8, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| AI for All: Operationalising Diversity and Inclusion Requirements for AI Systems | Nov 7, 2023 | AllDecision Making | —Unverified | 0 |
| Multilingual Mathematical Autoformalization | Nov 7, 2023 | Few-Shot LearningLanguage Acquisition | CodeCode Available | 1 |
| Conversations in Galician: a Large Language Model for an Underrepresented Language | Nov 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Uncovering Intermediate Variables in Transformers using Circuit Probing | Nov 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Model based Long-tail Query Rewriting in Taobao Search | Nov 7, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 3 |
| Formal Aspects of Language Modeling | Nov 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration | Nov 7, 2023 | 1 Image, 2*2 StitchingDecoder | CodeCode Available | 4 |
| OLaLa: Ontology Matching with Large Language Models | Nov 7, 2023 | Graph MatchingLanguage Modeling | —Unverified | 0 |
| Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning | Nov 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning | Nov 7, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Meta-Adapter: An Online Few-shot Learner for Vision-Language Model | Nov 7, 2023 | Few-Shot Learningimage-classification | CodeCode Available | 1 |
| Aspects of human memory and Large Language Models | Nov 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging High-Level Synthesis and Large Language Models to Generate, Simulate, and Deploy a Uniform Random Number Generator Hardware Design | Nov 6, 2023 | High-Level SynthesisLanguage Modeling | —Unverified | 0 |
| ProPath: Disease-Specific Protein Language Model for Variant Pathogenicity | Nov 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CogVLM: Visual Expert for Pretrained Language Models | Nov 6, 2023 | 1 Image, 2*2 StitchingFS-MEVQA | CodeCode Available | 5 |
| Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation | Nov 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Efficient Self-Supervised Cross-View Training For Sentence Embedding | Nov 6, 2023 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| DeepInception: Hypnotize Large Language Model to Be Jailbreaker | Nov 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DAIL: Data Augmentation for In-Context Learning via Self-Paraphrase | Nov 6, 2023 | Data AugmentationIn-Context Learning | —Unverified | 0 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language | Nov 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-enhanced Self-training for Cross-domain Constituency Parsing | Nov 5, 2023 | Constituency ParsingLanguage Modeling | CodeCode Available | 0 |
| CIRCLE: Multi-Turn Query Clarifications with Reinforcement Learning | Nov 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Assessing the Promise and Pitfalls of ChatGPT for Automated Code Generation | Nov 5, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision | Nov 4, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Can Chat GPT solve a Linguistics Exam? | Nov 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning | Nov 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Black-Box Adversarial Attacks on Neural Text Detectors | Nov 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Too Much Information: Keeping Training Simple for BabyLMs | Nov 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Data-Free Distillation of Language Model by Text-to-Text Transfer | Nov 3, 2023 | Data-free Knowledge DistillationDiversity | —Unverified | 0 |
| GateLoop: Fully Data-Controlled Linear Recurrence for Sequence Modeling | Nov 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Supermind Ideator: Exploring generative AI to support creative problem-solving | Nov 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EmojiLM: Modeling the New Emoji Language | Nov 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Nov 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Recommendations by Concise User Profiles from Review Text | Nov 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predicting Question-Answering Performance of Large Language Models through Semantic Consistency | Nov 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Influence Guided Data Reweighting for Language Model Pre-training | Nov 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Collaborative Large Language Model for Recommender Systems | Nov 2, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| FlashDecoding++: Faster Large Language Model Inference on GPUs | Nov 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Continual Learning Under Language Shift | Nov 2, 2023 | Continual LearningLanguage Modeling | —Unverified | 0 |
| Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations | Nov 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mukh-Oboyob: Stable Diffusion and BanglaBERT enhanced Bangla Text-to-Face Synthesis | Nov 1, 2023 | Face GenerationImage Generation | CodeCode Available | 0 |
| An Improved Transformer-based Model for Detecting Phishing, Spam, and Ham: A Large Language Model Approach | Nov 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Training Paradigms for Clinical Feature Embeddings | Nov 1, 2023 | Clinical KnowledgeDimensionality Reduction | CodeCode Available | 0 |
| Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents | Nov 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection | Nov 1, 2023 | Action DetectionClassification | —Unverified | 0 |
| Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving | Nov 1, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation | Nov 1, 2023 | Code CompletionLanguage Modeling | —Unverified | 0 |
| Comparing Optimization Targets for Contrast-Consistent Search | Nov 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |