FLIP Reasoning Challenge Apr 16, 2025 Common Sense Reasoning image-classification
Code Code Available 0Shrinkage Initialization for Smooth Learning of Neural Networks Apr 12, 2025 Common Sense Reasoning
— Unverified 0What the HellaSwag? On the Validity of Common-Sense Reasoning Benchmarks Apr 10, 2025 Common Sense Reasoning HellaSwag
Code Code Available 0JEPA4Rec: Learning Effective Language Representations for Sequential Recommendation via Joint Embedding Predictive Architecture Apr 10, 2025 Common Sense Reasoning Descriptive
— Unverified 0InstructionBench: An Instructional Video Understanding Benchmark Apr 7, 2025 Common Sense Reasoning Multiple-choice
— Unverified 0Proposition of Affordance-Driven Environment Recognition Framework Using Symbol Networks in Large Language Models Apr 2, 2025 Common Sense Reasoning
— Unverified 0DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism Apr 1, 2025 Common Sense Reasoning Computational Efficiency
Code Code Available 0WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization Mar 31, 2025 Common Sense Reasoning Memorization
— Unverified 0Information Gain Is Not All You Need Mar 28, 2025 All Common Sense Reasoning
Code Code Available 0GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection Mar 26, 2025 Common Sense Reasoning Object
— Unverified 0Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks Mar 24, 2025 Common Sense Reasoning Prediction
— Unverified 0A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives Mar 23, 2025 Benchmarking Common Sense Reasoning
— Unverified 0Improving Preference Extraction In LLMs By Identifying Latent Knowledge Through Classifying Probes Mar 22, 2025 Common Sense Reasoning
— Unverified 0Don't Fight Hallucinations, Use Them: Estimating Image Realism using NLI over Atomic Facts Mar 20, 2025 Common Sense Reasoning Natural Language Inference
Code Code Available 0HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model Mar 13, 2025 Common Sense Reasoning Denoising
— Unverified 0Do I look like a `cat.n.01` to you? A Taxonomy Image Generation Benchmark Mar 13, 2025 Common Sense Reasoning Image Generation
— Unverified 0LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression Mar 6, 2025 Benchmarking Common Sense Reasoning
Code Code Available 0The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation Mar 5, 2025 Common Sense Reasoning Machine Translation
Code Code Available 0LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains Mar 3, 2025 Common Sense Reasoning Hallucination
— Unverified 0Code-as-Symbolic-Planner: Foundation Model-Based Robot Planning via Symbolic Code Generation Mar 3, 2025 Code Generation Common Sense Reasoning
— Unverified 0Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations Feb 28, 2025 Common Sense Reasoning counterfactual
— Unverified 0FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response Feb 25, 2025 Common Sense Reasoning Disaster Response
— Unverified 0The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? Feb 24, 2025 Arithmetic Reasoning Common Sense Reasoning
— Unverified 0KnowZRel: Common Sense Knowledge-based Zero-Shot Relationship Retrieval for Generalised Scene Graph Generation Feb 21, 2025 Common Sense Reasoning Graph Generation
Code Code Available 0PredictaBoard: Benchmarking LLM Score Predictability Feb 20, 2025 Benchmarking Common Sense Reasoning
Code Code Available 0Navigating Semantic Relations: Challenges for Language Models in Abstract Common-Sense Reasoning Feb 19, 2025 Common Sense Reasoning Mathematical Problem-Solving
— Unverified 0Tell Me Why: Incentivizing Explanations Feb 19, 2025 Common Sense Reasoning
— Unverified 0Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning Feb 19, 2025 Common Sense Reasoning Language Modeling
— Unverified 0Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights Feb 18, 2025 Arithmetic Reasoning Common Sense Reasoning
— Unverified 0Plant in Cupboard, Orange on Rably, Inat Aphone. Benchmarking Incremental Learning of Situation and Language Model using a Text-Simulated Situated Environment Feb 17, 2025 Benchmarking Common Sense Reasoning
— Unverified 0ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in Arbitrary Virtual Environments Feb 14, 2025 Common Sense Reasoning
— Unverified 0Elucidation of the Concept of Consciousness from the Theory of Non-Human Communication Agents Feb 5, 2025 Common Sense Reasoning Philosophy
— Unverified 0Large Language Models as Common-Sense Heuristics Jan 31, 2025 Common Sense Reasoning
— Unverified 0MACI: Multi-Agent Collaborative Intelligence for Adaptive Reasoning and Temporal Planning Jan 28, 2025 Common Sense Reasoning Management
— Unverified 0PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Jan 27, 2025 Benchmarking Common Sense Reasoning
— Unverified 0Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection Jan 26, 2025 Common Sense Reasoning counterfactual
— Unverified 0Towards A Litmus Test for Common Sense Jan 17, 2025 ARC Common Sense Reasoning
— Unverified 0A note on bequest preferences in utility maximisation for modern tontines Jan 15, 2025 Common Sense Reasoning
— Unverified 0The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering Jan 13, 2025 Common Sense Reasoning Question Answering
— Unverified 0Common Sense Is All You Need Jan 11, 2025 All ARC
— Unverified 0MSWA: Refining Local Attention with Multi-ScaleWindow Attention Jan 2, 2025 Common Sense Reasoning Language Modeling
— Unverified 0DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery Jan 1, 2025 Common Sense Reasoning Density Estimation
— Unverified 0FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement Jan 1, 2025 3D geometry Common Sense Reasoning
— Unverified 0Titans: Learning to Memorize at Test Time Dec 31, 2024 Common Sense Reasoning Language Modeling
Code Code Available 0KnowRA: Knowledge Retrieval Augmented Method for Document-level Relation Extraction with Comprehensive Reasoning Abilities Dec 31, 2024 Common Sense Reasoning Document-level Relation Extraction
— Unverified 0Embodied Image Quality Assessment for Robotic Intelligence Dec 25, 2024 Common Sense Reasoning Image Quality Assessment
Code Code Available 0VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Dec 24, 2024 Common Sense Reasoning Transfer Learning
— Unverified 0QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs Dec 16, 2024 Benchmarking Common Sense Reasoning
Code Code Available 0A Multimodal Social Agent Dec 11, 2024 Common Sense Reasoning Decision Making
— Unverified 0The Rosetta Paradox: Domain-Specific Performance Inversions in Large Language Models Dec 9, 2024 Common Sense Reasoning Specificity
— Unverified 0