Speech Onset Detection Using sEEG | CPSC 554X Course Project
The goal of this course project was to develop a deep learning model for detecting speech onset, providing potential applications for brain-computer interfaces and assisting individuals with speaking disabilities.
We used a public dataset of stereotactic electroencephalography (sEEG) signals and the corresponding audio from 10 patients speaking prompted Dutch words. (Link: https://www.nature.com/articles/s41597-022-01542-9)
For feature extraction, we processed the audio and manually labeled the timestamps for each speech onset (when the patient would begin speaking).

We also extracted frequency-based features from the sEEG signals and clustered channels using spacial similarity via Bisecting K-Means.
We developed a CNN-LSTM-based model which achieved an AUC of 0.89, outperforming classical ML models such and Random Forest.
