Calibrated SVM for Probabilistic Classification of In-Vehicle Voices into Vehicle Commands via Voice-to-Text LLM Transformation
Mobina Moeini, Rouhollah Ahmadian, Mehdi Ghatee
Code Available — Be the first to reproduce this paper.
ReproduceCode
Abstract
With the rapid advancement of artificial intelligence technologies in human life, particularly within the automotive industry, the popularity of smart vehicles has increased. Designing an intelligent car cabin capable of seamless interactions with both the driver and the vehicle emerges as a crucial solution to address human-vehicle interaction challenges. This project aims to implement a digital voice assistant that recognizes vehicle commands. It utilizes three main techniques: speech-to-text conversion, text classification, and calibrating the classifier in order to detect out-of-distribution (OOD) sentences. Using Vosk, an LLM model, voices in the vehicle environment are converted into text format. Then, after pre-processing the text, an SVM classifies them. Using Platt scaling the SVM classifier outputs become calibrated, which makes them probabilistic. The experimental result shows that the proposed model achieves command recognition with 96.43% accuracy, 96.83% precision, 96.43% recall, 96.39% F1-score, and 0.1437 cross-entropy loss. Furthermore, the optimal threshold for OOD sentences is 0.4.