Qwen2.5 Technical Report Dec 19, 2024 Common Sense Reasoning
Code Code Available 13LLaMA: Open and Efficient Foundation Language Models Feb 27, 2023 Arithmetic Reasoning Code Generation
Code Code Available 7Mamba: Linear-Time Sequence Modeling with Selective State Spaces Dec 1, 2023 2D Pose Estimation Common Sense Reasoning
Code Code Available 6Mistral 7B Oct 10, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Jun 1, 2023 Autonomous Driving Cloud Computing
Code Code Available 6Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Apr 3, 2023 Common Sense Reasoning Coreference Resolution
Code Code Available 6GPT-4 Technical Report Mar 15, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6Training Compute-Optimal Large Language Models Mar 29, 2022 Anachronisms Analogical Similarity
Code Code Available 6Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Jan 28, 2022 Common Sense Reasoning GSM8K
Code Code Available 6Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Mar 18, 2025 3D Face Animation Common Sense Reasoning
Code Code Available 4WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation Mar 10, 2025 Common Sense Reasoning Image Generation
Code Code Available 4Gated Delta Networks: Improving Mamba2 with Delta Rule Dec 9, 2024 Common Sense Reasoning Language Modeling
Code Code Available 4Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models Jun 4, 2024 Common Sense Reasoning
Code Code Available 4G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering Feb 12, 2024 Common Sense Reasoning Graph Classification
Code Code Available 4Knowledge Fusion of Large Language Models Jan 19, 2024 Code Generation Common Sense Reasoning
Code Code Available 4Mixtral of Experts Jan 8, 2024 Code Generation Common Sense Reasoning
Code Code Available 4SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot Jan 2, 2023 Common Sense Reasoning Language Modelling
Code Code Available 4Galactica: A Large Language Model for Science Nov 16, 2022 Anachronisms Bias Detection
Code Code Available 4N-Grammer: Augmenting Transformers with latent n-grams Jul 13, 2022 Common Sense Reasoning Coreference Resolution
Code Code Available 4Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Jun 9, 2022 Common Sense Reasoning Math
Code Code Available 4AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning Mar 10, 2025 Autonomous Driving Common Sense Reasoning
Code Code Available 3CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos Nov 26, 2024 Common Sense Reasoning Imitation Learning
Code Code Available 3MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts Apr 22, 2024 Common Sense Reasoning GPU
Code Code Available 3Common Sense Reasoning for Deepfake Detection Jan 31, 2024 Binary Classification Common Sense Reasoning
Code Code Available 3Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia Dec 6, 2023 Common Sense Reasoning
Code Code Available 3Reasoning with Language Model Prompting: A Survey Dec 19, 2022 Arithmetic Reasoning Common Sense Reasoning
Code Code Available 3ST-MoE: Designing Stable and Transferable Sparse Expert Models Feb 17, 2022 ARC Common Sense Reasoning
Code Code Available 3Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 3Language Models are Few-Shot Learners May 28, 2020 answerability prediction Articles
Code Code Available 3BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Oct 11, 2018 Citation Intent Classification Common Sense Reasoning
Code Code Available 3PrefixQuant: Eliminating Outliers by Prefixed Tokens for Large Language Models Quantization Oct 7, 2024 Common Sense Reasoning Quantization
Code Code Available 2RegMix: Data Mixture as Regression for Language Model Pre-training Jul 1, 2024 Common Sense Reasoning Language Modeling
Code Code Available 2Extended Mind Transformers Jun 4, 2024 Common Sense Reasoning counterfactual
Code Code Available 2Easy Problems That LLMs Get Wrong May 30, 2024 Common Sense Reasoning Logical Reasoning
Code Code Available 2Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models May 24, 2024 Common Sense Reasoning Language Modelling
Code Code Available 2OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models Feb 16, 2024 Common Sense Reasoning Navigate
Code Code Available 2Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks Jan 5, 2024 Arithmetic Reasoning Code Generation
Code Code Available 2Holodeck: Language Guided Generation of 3D Embodied AI Environments Dec 14, 2023 Common Sense Reasoning Language Modelling
Code Code Available 2On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving Nov 9, 2023 Autonomous Driving Common Sense Reasoning
Code Code Available 2LLM-FP4: 4-Bit Floating-Point Quantized Transformers Oct 25, 2023 Common Sense Reasoning Quantization
Code Code Available 2DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models Sep 28, 2023 10-shot image generation 1 Image, 2*2 Stitchi
Code Code Available 2PointLLM: Empowering Large Language Models to Understand Point Clouds Aug 31, 2023 3D Object Captioning 3D Object Classification
Code Code Available 2OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Aug 25, 2023 Common Sense Reasoning Computational Efficiency
Code Code Available 2Drive Like a Human: Rethinking Autonomous Driving with Large Language Models Jul 14, 2023 Autonomous Driving Common Sense Reasoning
Code Code Available 2GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest Jul 7, 2023 Attribute Common Sense Reasoning
Code Code Available 2Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory May 25, 2023 Common Sense Reasoning CPU
Code Code Available 2LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models May 23, 2023 Common Sense Reasoning Image Generation
Code Code Available 2The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning May 23, 2023 Common Sense Reasoning Common Sense Reasoning (Zero-Shot)
Code Code Available 2Causal Reasoning and Large Language Models: Opening a New Frontier for Causality Apr 28, 2023 Causal Discovery Common Sense Reasoning
Code Code Available 2LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions Apr 27, 2023 Common Sense Reasoning Coreference Resolution
Code Code Available 2