| Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | Mar 18, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Large Language Model-informed ECG Dual Attention Network for Heart Failure Risk Prediction | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can We Talk Models Into Seeing the World Differently? | Mar 14, 2024 | Image CaptioningImage Classification | CodeCode Available | 1 |
| Emergence of Social Norms in Generative Agent Societies: Principles and Architecture | Mar 13, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model | Mar 11, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision | Mar 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Mar 8, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Generative News Recommendation | Mar 6, 2024 | ArticlesLanguage Modelling | CodeCode Available | 1 |
| Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-Based Phishing Detection | Mar 4, 2024 | Knowledge GraphsLanguage Modelling | CodeCode Available | 1 |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Mar 2, 2024 | 16kCPU | CodeCode Available | 1 |
| DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling | Mar 2, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition | Mar 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction | Feb 29, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning | Feb 29, 2024 | Continual LearningLanguage Modelling | CodeCode Available | 1 |
| Large Language Models are Learnable Planners for Long-Term Recommendation | Feb 29, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 1 |
| Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Grounding Language Models for Visual Entity Recognition | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogBench: a large language model walks into a psychology lab | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property | Feb 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Empowering Large Language Model Agents through Action Learning | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Self-Retrieval: End-to-End Information Retrieval with One Large Language Model | Feb 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |