The acm multimedia 2022 computational paralinguistics challenge: Vocalisations, stuttering, activity, mosquitoes
Schuller B., Batliner A., Amiriparian S., Bergler C., Gerczuk M., Holz N., Larrouy-Maestri P., Bayerl S., Riedhammer K., Mallol-Ragolta A., Pateraki M., Coppock H., Kiskin I., Sinka M., Roberts S.
The ACM Multimedia 2022 Computational Paralinguistics Challenge addresses four different problems for the first time in a research competition under well-defined conditions: In the Vocalisations and Stuttering Sub-Challenges, a classification on human non-verbal vocalisations and speech has to be made; the Activity Sub-Challenge aims at beyond-audio human activity recognition from smartwatch sensor data; and in the Mosquitoes Sub-Challenge, mosquitoes need to be detected. We describe the Sub-Challenges, baseline feature extraction, and classifiers based on the 'usual' ComParE and BoAW features, the auDeep toolkit, and deep feature extraction from pre-trained CNNs using the DeepSpectrum toolkit; in addition, we add end-to-end sequential modelling, and a log-mel-128-BNN.