Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles Jun 18, 2024 Arithmetic Reasoning Code Generation
Code Code Available 1P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models Jun 17, 2024 Common Sense Reasoning Data Augmentation
— Unverified 0Mixture-of-Subspaces in Low-Rank Adaptation Jun 16, 2024 Common Sense Reasoning Image Generation
Code Code Available 0A Survey of Video Datasets for Grounded Event Understanding Jun 14, 2024 Common Sense Reasoning Event Extraction
Code Code Available 0LLM-Driven Robots Risk Enacting Discrimination, Violence, and Unlawful Actions Jun 13, 2024 Common Sense Reasoning
— Unverified 0DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation Jun 9, 2024 Common Sense Reasoning Denoising
Code Code Available 1BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Jun 7, 2024 Common Sense Reasoning Sentence
Code Code Available 0Think out Loud: Emotion Deducing Explanation in Dialogues Jun 7, 2024 Common Sense Reasoning Emotion Cause Extraction
— Unverified 0RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation Jun 6, 2024 Common Sense Reasoning Mamba
— Unverified 0Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models Jun 6, 2024 Arithmetic Reasoning Code Generation
Code Code Available 0Generative AI-in-the-loop: Integrating LLMs and GPTs into the Next Generation Networks Jun 6, 2024 Common Sense Reasoning Intrusion Detection
— Unverified 0mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans Jun 6, 2024 Common Sense Reasoning Natural Language Understanding
— Unverified 0Every Answer Matters: Evaluating Commonsense with Probabilistic Measures Jun 6, 2024 Common Sense Reasoning Language Modeling
Code Code Available 0Do Language Models Understand Morality? Towards a Robust Detection of Moral Content Jun 6, 2024 Common Sense Reasoning Natural Language Inference
Code Code Available 0Large Language Models as Evaluators for Recommendation Explanations Jun 5, 2024 Common Sense Reasoning Instruction Following
Code Code Available 1RAG-based Crowdsourcing Task Decomposition via Masked Contrastive Learning with Prompts Jun 4, 2024 Common Sense Reasoning Contrastive Learning
— Unverified 0ACCORD: Closing the Commonsense Measurability Gap Jun 4, 2024 Benchmarking Common Sense Reasoning
Code Code Available 0Extended Mind Transformers Jun 4, 2024 Common Sense Reasoning counterfactual
Code Code Available 2Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models Jun 4, 2024 Common Sense Reasoning
Code Code Available 4Easy Problems That LLMs Get Wrong May 30, 2024 Common Sense Reasoning Logical Reasoning
Code Code Available 2Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-based Decision-Making Systems May 27, 2024 Autonomous Driving Common Sense Reasoning
— Unverified 0Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search May 26, 2024 Common Sense Reasoning Language Modeling
— Unverified 0Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge May 25, 2024 Common Sense Reasoning
— Unverified 0iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers May 25, 2024 Common Sense Reasoning Multiple-choice
Code Code Available 0Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models May 24, 2024 Common Sense Reasoning Language Modelling
Code Code Available 2Regressor-free Molecule Generation to Support Drug Response Prediction May 23, 2024 Common Sense Reasoning Drug Discovery
— Unverified 0Large Language Models are Effective Priors for Causal Graph Discovery May 22, 2024 Common Sense Reasoning
— Unverified 0FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering May 22, 2024 Common Sense Reasoning Graph Question Answering
— Unverified 0DaVinci at SemEval-2024 Task 9: Few-shot prompting GPT-3.5 for Unconventional Reasoning May 19, 2024 Common Sense Reasoning Sentence
— Unverified 0Meta-Control: Automatic Model-based Control Synthesis for Heterogeneous Robot Skills May 18, 2024 Collision Avoidance Common Sense Reasoning
— Unverified 0OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning May 9, 2024 Common Sense Reasoning named-entity-recognition
Code Code Available 1Soft Label PU Learning May 3, 2024 Common Sense Reasoning
— Unverified 0The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights May 2, 2024 Common Sense Reasoning Translation
— Unverified 0Artificial General Intelligence (AGI)-Native Wireless Systems: A Journey Beyond 6G Apr 29, 2024 Common Sense Reasoning
— Unverified 0FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models Apr 29, 2024 Common Sense Reasoning Multiple-choice
— Unverified 0Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning Apr 23, 2024 ARC Common Sense Reasoning
— Unverified 0SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Apr 22, 2024 Common Sense Reasoning
— Unverified 0MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts Apr 22, 2024 Common Sense Reasoning GPU
Code Code Available 3Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap Apr 21, 2024 Common Sense Reasoning
Code Code Available 1Concept Induction using LLMs: a user experiment for assessment Apr 18, 2024 Common Sense Reasoning Explainable artificial intelligence
— Unverified 0CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences Apr 16, 2024 Common Sense Reasoning NeRF
— Unverified 0Memory Sharing for Large Language Model based Agents Apr 15, 2024 Common Sense Reasoning Diversity
Code Code Available 1VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning Apr 10, 2024 Common Sense Reasoning Emotion Classification
Code Code Available 1Deep Reinforcement Learning-Based Approach for a Single Vehicle Persistent Surveillance Problem with Fuel Constraints Apr 9, 2024 Common Sense Reasoning Deep Reinforcement Learning
— Unverified 0DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models Apr 4, 2024 Common Sense Reasoning Computational Efficiency
— Unverified 0Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph Apr 4, 2024 Claim Verification Common Sense Reasoning
Code Code Available 0Stereotype Detection in LLMs: A Multiclass, Explainable, and Benchmark-Driven Approach Apr 2, 2024 Benchmarking Common Sense Reasoning
— Unverified 0Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs Apr 1, 2024 Common Sense Reasoning Object
— Unverified 0AILS-NTUA at SemEval-2024 Task 9: Cracking Brain Teasers: Transformer Models for Lateral Thinking Puzzles Apr 1, 2024 Common Sense Reasoning Multiple-choice
Code Code Available 0ITCMA: A Generative Agent Based on a Computational Consciousness Structure Mar 29, 2024 Action Generation Common Sense Reasoning
— Unverified 0