| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements | Jun 10, 2025 | Binary ClassificationFinancial Analysis | CodeCode Available | 1 |
| QuantMCP: Grounding Large Language Models in Verifiable Financial Reality | Jun 7, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 |
| How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation Alignment | Jun 3, 2025 | Data AugmentationDrug Discovery | CodeCode Available | 0 |
| VISTA: Vision-Language Inference for Training-Free Stock Time-Series Analysis | May 24, 2025 | Financial AnalysisStock Price Prediction | —Unverified | 0 |
| Towards Competent AI for Fundamental Analysis in Finance: A Benchmark Dataset and Evaluation | May 22, 2025 | Financial AnalysisLogical Reasoning | —Unverified | 0 |
| A Survey of Attacks on Large Language Models | May 18, 2025 | Autonomous DrivingFinancial Analysis | —Unverified | 0 |
| Non-Stationary Time Series Forecasting Based on Fourier Analysis and Cross Attention Mechanism | May 11, 2025 | Financial AnalysisTime Series | CodeCode Available | 1 |
| MiMIC: Multi-Modal Indian Earnings Calls Dataset to Predict Stock Prices | Apr 12, 2025 | Financial Analysis | CodeCode Available | 0 |
| SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities | Apr 6, 2025 | Financial Analysis | —Unverified | 0 |