参考文献/References:
[1] MESAROS A,HEITTOLA T,BENETOS E,et al.Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge[J].IEEE/ACM Transactions on Audio Speech and Language Processing,2017,26(2):379-393.DOI:10.1109/TASLP.2017.2778423.
[2] VIRTANEN T,MESAROS A,HEITTOLA T,et al.Proceedings of the detection and classification of acoustic scenes and events 2017 workshop(DCASE2017)[R/OL].(2017-09-15)[2019-12-10] .https://www.researchgate.net/publication/320409431_Deep_Sequential_Image_Features_on_Acoustic_Scene_Classification.
[3] PICZAK K J.Environmental sound classification with convolutional neural networks[C]//IEEE 25th International Workshop on Machine Learning for Signal Processing.Boston: IEEE Press,2015:1-6.DOI:10.1109/MLSP.2015.7324337.
[4] MESAROS A,HEITTOLA T,ERONEN A,et al.Acoustic event detection in real life recordings[C]//18th European Signal Processing Conference.Aalborg: IEEE Press,2010:1267-1271.
[5] YUN S,KIM S,MOOM S,et al.Discriminative training of GMM parameters for audio scene classification[R/OL].(2016-03-02)[2019-12-10] .https://www.aminer.cn/pub/5f44d5c49e795ee83b76546b/discriminative-training-of-gmm-parameters-for-audio-scene-classification-and-audio-tagging.
[6] RAKOTOMAMONJY A,GASSO G.Histogram of gradients of time-frequency representations for audio scene detection[J].IEEE/ACM Transactions on Audio Speech and Language Processing,2015,23(1):142-153.DOI:10.1109/TASLP.2014.2375575.
[7] CAI Rui,LU Lie,ZHANG Hongjiang,et al.A flexible framework for key audio effects detection and auditory context inference[J].IEEE Transactions on Audio, Speech, and Language Processing,2006,14(3):1026-1039.DOI:10.1109/TSA.2005.857575.
[8] HEITTOLA T,MESAROS A,ERONEN A,et al.Context-dependent sound event detection[J].EURASIP Journal on Audio, Speech, and Music Processing,2013(1):1-13.DOI:10.1186/1687-4722-2013-1.
[9] MESAROS A,DIMENT A,ELIZALDE B,et al.Sound event detection in the DCASE 2017 challenge[J].IEEE/ACM Transactions on Audio Speech and Language Processing,2019,27(6):992-1006.DOI:10.1109/TASLP.2019.2907016.
[10] IMOTO K,KYOCHI S.Sound event detection using graph Laplacian regularization based on event co-occurrence[C]//IEEE International Conference on Acoustics, Speech and Signal Processing.Brighton: IEEE Press,2019:1-5.DOI:10.1109/ICASSP.2019.8683708.
[11] CHAUDHURI S,RAJ B.Unsupervised hierarchical structure induction for deeper semantic analysis of audio[C]//IEEE International Conference on Acoustics, Speech and Signal Processing.Vancouver: IEEE Press,2013:833-837.DOI:10.1109/ICASSP.2013.6637765.
[12] TONAMI N,IMOTO K,NITTSUMA M,et al.Joint analysis of acoustic events and scenes based on multitask learning[C]//IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.New York:IEEE Press,2019:338-342.DOI:10.1109/WASPAA.2019.8937196.
[13] BEAR H L,NOLASCO I,BENETOS E.Towards joint sound scene and polyphonic sound event recognition[C]//Interspeech.Graz: [s.n.],2019:4594-4598.DOI:10.21437/Interspeech.2019-2169.
[14] WANG Wei,SERAJ F,MERATNIA N,et al.Privacy-aware environmental sound classification for indoor human activity recognition[C]//Proceedings of the 12th ACM International Conference on Pervasive Technologies Related to Assistive Environments.New York: Association for Computing Machinery,2019:36-44.DOI:10.1145/3316782.3321521.
[15] SHOLOKHOV A,SAHIDULLAH M,KINNUNEN T.Semi-supervised speech activity detection with an application to automatic speaker verification[J].Computer Speech and Language,2018,47(1):132-156.DOI:10.1016/j.csl.2017.07.005.
[16] HORI Y,ANDO T,FUKUDA A.Personal identification methods using footsteps of one step[C]//International Conference on Artificial Intelligence in Information and Communication.Tianjin: [s.n.],2020:73-78.DOI:10.1109/ICAIIC48513.2020.9065230.
[17] CHU S,NARAYANAN S,KUO C C J.Environmental sound recognition with time-frequency audio features[J].IEEE Transactions on Audio, Speech, and Language Processing,2009,17(6):1142-1158.DOI:10.1109/TASL.2009.2017438.
[18] WARCAKESTUDIOS.Fast steps on wet stones: Recorded with a T-Bonemicro[EB/OL].(2013-01-10)[2019-12-17] .https://freesound.org/people/WarcakeStudios/sounds/173596/.
[19] INSPECTORJ.Raw audio of running on thin, cracked ice on top of a gravel driveway with trainer shoes[EB/OL].(2018-01-30)[2019-12-17] .https://freesound.org/people/InspectorJ/sounds/416967/.
[20] AUDIONINJA001.Recorded with zoom h5 and sennheizer mk600[EB/OL].(2018-12-27)[2019-12-17] .https://freesound.org/people/audioninja001/packs/25644/.