| Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL | Feb 18, 2025 | counterfactualDeception Detection | —Unverified | 0 |
| OCCULT: Evaluating Large Language Models for Offensive Cyber Operation Capabilities | Feb 18, 2025 | Large Language ModelMultiple-choice | —Unverified | 0 |
| MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation | Feb 18, 2025 | global-optimizationLarge Language Model | —Unverified | 0 |
| Towards an automated workflow in materials science for combining multi-modal simulative and experimental information using data mining and large language models | Feb 18, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 |
| Investigating and Extending Homans' Social Exchange Theory with Large Language Model based Agents | Feb 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition | Feb 18, 2025 | Emotion RecognitionLarge Language Model | CodeCode Available | 1 |
| Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models | Feb 18, 2025 | BenchmarkingLarge Language Model | —Unverified | 0 |
| G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation | Feb 18, 2025 | Collaborative FilteringExplainable Recommendation | CodeCode Available | 1 |
| Towards Text-Image Interleaved Retrieval | Feb 18, 2025 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |