차량 내부 소음 예측을 위한 멀티 모달 자기 지도 학습 네트워크
Multimodal Self-Supervised Learning Networks for In-vehicle Noise Prediction
고은성(고려대학교); 고은지(고려대학교 산업경영공학과); 남규환(현대모비스); 김상욱(현대모비스); 박경환(현대모비스); 김성범(고려대학교)
39권 4호, 35~46쪽
초록
Predicting in-vehicle noise levels is an important issue in automobile industry. In most previous studies, various supervised learning methods that use both the input and output(labeled) variables are used to predict automobile noise levels. However, collecting labeled data for in-vehicle noise prediction is time consuming and expensive. In this study, we propose a multimodal self-supervised learning framework that can predict in-vehicle noise levels with only a small amount of labeled data, so that resources required to collect labeled data can be saved. In our framework, both original acceleration signals and spectrograms converted from the original data are used as the input to predict in-vehicle noise levels. In the first stage, we pretrain the features of the input data based on the relationship between the signal and spectrogram data using only unlabeled data, which is much easier to collect than labeled data. In the second stage, we use a small amount of data to construct the in-vehicle noise prediction model with the pretrained feature extractor. The effectiveness and applicability of the proposed framework are demonstrated using the actual acceleration signal data collected from various locations of electric power steering vehicle noise levels. The proposed framework outperforms the existing supervised learning method especially when a few labeled data are available.
Abstract
Predicting in-vehicle noise levels is an important issue in automobile industry. In most previous studies, various supervised learning methods that use both the input and output(labeled) variables are used to predict automobile noise levels. However, collecting labeled data for in-vehicle noise prediction is time consuming and expensive. In this study, we propose a multimodal self-supervised learning framework that can predict in-vehicle noise levels with only a small amount of labeled data, so that resources required to collect labeled data can be saved. In our framework, both original acceleration signals and spectrograms converted from the original data are used as the input to predict in-vehicle noise levels. In the first stage, we pretrain the features of the input data based on the relationship between the signal and spectrogram data using only unlabeled data, which is much easier to collect than labeled data. In the second stage, we use a small amount of data to construct the in-vehicle noise prediction model with the pretrained feature extractor. The effectiveness and applicability of the proposed framework are demonstrated using the actual acceleration signal data collected from various locations of electric power steering vehicle noise levels. The proposed framework outperforms the existing supervised learning method especially when a few labeled data are available.
- 발행기관:
- 한국경영과학회
- 분류:
- 경영학