Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes Jul 17, 2025 Common Sense Reasoning World Knowledge
— Unverified 0LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization Jul 6, 2025 Common Sense Reasoning parameter-efficient fine-tuning
Code Code Available 0CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation Jun 11, 2025 Common Sense Reasoning Question Answering
— Unverified 0EditInspector: A Benchmark for Evaluation of Text-Guided Image Edits Jun 11, 2025 Artifact Detection Caption Generation
— Unverified 0Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search Jun 8, 2025 Common Sense Reasoning Motion Planning
Code Code Available 0AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment Jun 4, 2025 Common Sense Reasoning
Code Code Available 0ATLAS: Learning to Optimally Memorize the Context at Test Time May 29, 2025 Common Sense Reasoning Language Modeling
— Unverified 0Spatial Knowledge Graph-Guided Multimodal Synthesis May 28, 2025 Common Sense Reasoning Knowledge Graphs
— Unverified 0CaseEdit: Enhancing Localized Commonsense Reasoning via Null-Space Constrained Knowledge Editing in Small Parameter Language Models May 26, 2025 Common Sense Reasoning Computational Efficiency
— Unverified 0SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving May 22, 2025 Autonomous Driving Common Sense Reasoning
— Unverified 0Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation May 22, 2025 Common Sense Reasoning Information Retrieval
— Unverified 0OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation May 20, 2025 Common Sense Reasoning Mathematical Reasoning
— Unverified 03D Visual Illusion Depth Estimation May 19, 2025 Common Sense Reasoning Depth Estimation
Code Code Available 1Empirically evaluating commonsense intelligence in large language models with large-scale human judgments May 15, 2025 Common Sense Reasoning
— Unverified 0ProdRev: A DNN framework for empowering customers using generative pre-trained transformers May 14, 2025 Abstractive Text Summarization Common Sense Reasoning
— Unverified 0Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images May 12, 2025 Common Sense Reasoning
— Unverified 0AgentSGEN: Multi-Agent LLM in the Loop for Semantic Collaboration and GENeration of Synthetic Data May 7, 2025 Common Sense Reasoning
— Unverified 0Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation May 5, 2025 Common Sense Reasoning Scene Generation
— Unverified 0VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding May 2, 2025 Anomaly Detection Common Sense Reasoning
Code Code Available 1UAV-VLN: End-to-End Vision Language guided Navigation for UAVs Apr 30, 2025 Common Sense Reasoning Instruction Following
— Unverified 0ScanEdit: Hierarchically-Guided Functional 3D Scan Editing Apr 21, 2025 3D scene Editing Common Sense Reasoning
— Unverified 0CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Apr 18, 2025 Common Sense Reasoning image-classification
Code Code Available 1Creating 'Full-Stack' Hybrid Reasoning Systems that Prioritize and Enhance Human Intelligence Apr 18, 2025 Common Sense Reasoning Ingenuity
— Unverified 0FLIP Reasoning Challenge Apr 16, 2025 Common Sense Reasoning image-classification
Code Code Available 0Shrinkage Initialization for Smooth Learning of Neural Networks Apr 12, 2025 Common Sense Reasoning
— Unverified 0JEPA4Rec: Learning Effective Language Representations for Sequential Recommendation via Joint Embedding Predictive Architecture Apr 10, 2025 Common Sense Reasoning Descriptive
— Unverified 0What the HellaSwag? On the Validity of Common-Sense Reasoning Benchmarks Apr 10, 2025 Common Sense Reasoning HellaSwag
Code Code Available 0InstructionBench: An Instructional Video Understanding Benchmark Apr 7, 2025 Common Sense Reasoning Multiple-choice
— Unverified 0Proposition of Affordance-Driven Environment Recognition Framework Using Symbol Networks in Large Language Models Apr 2, 2025 Common Sense Reasoning
— Unverified 0DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism Apr 1, 2025 Common Sense Reasoning Computational Efficiency
Code Code Available 0WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization Mar 31, 2025 Common Sense Reasoning Memorization
— Unverified 0Information Gain Is Not All You Need Mar 28, 2025 All Common Sense Reasoning
Code Code Available 0GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection Mar 26, 2025 Common Sense Reasoning Object
— Unverified 0Global-Local Tree Search in VLMs for 3D Indoor Scene Generation Mar 24, 2025 Common Sense Reasoning Object
Code Code Available 1Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks Mar 24, 2025 Common Sense Reasoning Prediction
— Unverified 0A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives Mar 23, 2025 Benchmarking Common Sense Reasoning
— Unverified 0Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes Mar 22, 2025 Common Sense Reasoning
— Unverified 0Don't Fight Hallucinations, Use Them: Estimating Image Realism using NLI over Atomic Facts Mar 20, 2025 Common Sense Reasoning Natural Language Inference
Code Code Available 0Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Mar 18, 2025 3D Face Animation Common Sense Reasoning
Code Code Available 4Do I look like a `cat.n.01` to you? A Taxonomy Image Generation Benchmark Mar 13, 2025 Common Sense Reasoning Image Generation
— Unverified 0HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model Mar 13, 2025 Common Sense Reasoning Denoising
— Unverified 0AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning Mar 10, 2025 Autonomous Driving Common Sense Reasoning
Code Code Available 3WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation Mar 10, 2025 Common Sense Reasoning Image Generation
Code Code Available 4LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression Mar 6, 2025 Benchmarking Common Sense Reasoning
Code Code Available 0The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation Mar 5, 2025 Common Sense Reasoning Machine Translation
Code Code Available 0LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains Mar 3, 2025 Common Sense Reasoning Hallucination
— Unverified 0Code-as-Symbolic-Planner: Foundation Model-Based Robot Planning via Symbolic Code Generation Mar 3, 2025 Code Generation Common Sense Reasoning
— Unverified 0Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations Feb 28, 2025 Common Sense Reasoning counterfactual
— Unverified 0FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response Feb 25, 2025 Common Sense Reasoning Disaster Response
— Unverified 0The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? Feb 24, 2025 Arithmetic Reasoning Common Sense Reasoning
— Unverified 0