| Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model | Mar 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FlexCap: Describe Anything in Images in Controllable Detail | Mar 18, 2024 | AttributeDense Captioning | —Unverified | 0 |
| Revisiting The Classics: A Study on Identifying and Rectifying Gender Stereotypes in Rhymes and Poems | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Embedded Named Entity Recognition using Probing Classifiers | Mar 18, 2024 | DecoderFact Checking | CodeCode Available | 0 |
| Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning | Mar 18, 2024 | 3D Question Answering (3D-QA)Dense Captioning | —Unverified | 0 |
| Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines | Mar 18, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Zero-shot Compound Expression Recognition with Visual Language Model at the 6th ABAW Challenge | Mar 18, 2024 | Facial Expression RecognitionLanguage Modeling | —Unverified | 0 |
| Can LLM-Augmented autonomous agents cooperate?, An evaluation of their cooperative capabilities through Melting Pot | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning in Transformers - Mitigating Spurious Correlations and Reasoning Shortcuts | Mar 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping | Mar 17, 2024 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Training A Small Emotional Vision Language Model for Visual Art Comprehension | Mar 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Correcting misinformation on social media with a large language model | Mar 17, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 0 |
| Large language model-powered chatbots for internationalizing student support in higher education | Mar 16, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Integrating Wearable Sensor Data and Self-reported Diaries for Personalized Affect Forecasting | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detecting Bias in Large Language Models: Fine-tuned KcBERT | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Robustness and Diversity: Continual Learning in Dialog Generation with Text-Mixup and Batch Nuclear-Norm Maximization | Mar 16, 2024 | Continual LearningData Augmentation | —Unverified | 0 |
| Energy-Based Models with Applications to Speech and Language Processing | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GAgent: An Adaptive Rigid-Soft Gripping Agent with Vision Language Models for Complex Lighting Environments | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Chinese Humor Generation: A Study on Two-Part Allegorical Sayings | Mar 16, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| SelfIE: Self-Interpretation of Large Language Model Embeddings | Mar 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ChatPattern: Layout Pattern Customization via Natural Language | Mar 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging vision-language models for fair facial attribute classification | Mar 15, 2024 | AttributeFacial Attribute Classification | —Unverified | 0 |
| Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain | Mar 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model | Mar 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling | Mar 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Region-Language Pretraining for Open-Ended Object Detection | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Using an LLM to Turn Sign Spottings into Spoken Language Sentences | Mar 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Medical Multi-modal Contrastive Learning with Expert Annotations | Mar 15, 2024 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 0 |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Mar 15, 2024 | EgoSchemaForm | CodeCode Available | 2 |
| RAFT: Adapting Language Model to Domain Specific RAG | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model | Mar 15, 2024 | Cloze TestDistractor Generation | CodeCode Available | 1 |
| EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Right Place, Right Time! Dynamizing Topological Graphs for Embodied Navigation | Mar 14, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Fisher Mask Nodes for Language Model Merging | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What Was Your Prompt? A Remote Keylogging Attack on AI Assistants | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LAMP: A Language Model on the Map | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Mar 14, 2024 | Decoderimage-classification | CodeCode Available | 1 |
| Anomaly Detection by Adapting a pre-trained Vision Language Model | Mar 14, 2024 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| B-AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Black-box Adversarial Visual-Instructions | Mar 14, 2024 | FairnessLanguage Modeling | CodeCode Available | 0 |
| VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework | Mar 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GiT: Towards Generalist Vision Transformer through Universal Language Interface | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Semiparametric Token-Sequence Co-Supervision | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis | Mar 14, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| ProSwitch: Knowledge-Guided Instruction Tuning to Switch Between Professional and Non-Professional Responses | Mar 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Do Large Language Models Solve ARC Visual Analogies Like People Do? | Mar 13, 2024 | ARCLanguage Modeling | CodeCode Available | 0 |
| Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs | Mar 13, 2024 | 8kAnswer Generation | —Unverified | 0 |