| Large Language Bayes | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting | Apr 18, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| Scaling sparse feature circuit finding for in-context learning | Apr 18, 2025 | In-Context LearningLarge Language Model | —Unverified | 0 |
| Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Apr 18, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation | Apr 18, 2025 | Anomaly SegmentationLanguage Modeling | CodeCode Available | 0 |
| RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization | Apr 18, 2025 | Action LocalizationAnomaly Detection | —Unverified | 0 |
| Retrieval-Augmented Generation with Conflicting Evidence | Apr 17, 2025 | Large Language ModelMisinformation | CodeCode Available | 1 |
| Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks | Apr 17, 2025 | Epistemic ReasoningLarge Language Model | CodeCode Available | 0 |
| Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge | Apr 17, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| DIDS: Domain Impact-aware Data Sampling for Large Language Model Training | Apr 17, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |
| Causal-Copilot: An Autonomous Causal Analysis Agent | Apr 17, 2025 | Causal DiscoveryCausal Inference | —Unverified | 0 |
| ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images | Apr 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery | Apr 17, 2025 | Large Language ModelMulti-Task Learning | —Unverified | 0 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding | Apr 17, 2025 | Image GenerationLarge Language Model | CodeCode Available | 1 |
| Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification | Apr 17, 2025 | DiversityGaussian Processes | CodeCode Available | 0 |
| SkyReels-V2: Infinite-length Film Generative Model | Apr 17, 2025 | Large Language Modelmodel | CodeCode Available | 9 |
| Mixer Metaphors: audio interfaces for non-musical applications | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BitNet b1.58 2B4T Technical Report | Apr 16, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM | Apr 16, 2025 | Large Language ModelText-to-Video Generation | —Unverified | 0 |
| Generative Recommendation with Continuous-Token Diffusion | Apr 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection | Apr 16, 2025 | Anomaly DetectionLarge Language Model | CodeCode Available | 1 |
| Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification | Apr 16, 2025 | Large Language ModelSentiment Analysis | —Unverified | 0 |