Development of a Named Entity Recognition-based Text Masking System for Preventing Personal Information Exposure in Shipping Labels
Development of a Named Entity Recognition-based Text Masking System for Preventing Personal Information Exposure in Shipping Labels
김하은(고려대학교); 김수용(서울대학교); 김명섭(고려대학교); 김상호(한국외국어대학교); 조예진(동국대학교); 최은선(고려대학교); 유길상(고려대학교)
22권 11호, 181~196쪽
초록
The widespread use of social media and video platforms has frequently and unintentionally exposed sensitive personal information, such as shipping labels, in shared images and videos. Masking entire shipping labels is not only time-consuming and labor-intensive but it can also lead to video distortion. This paper proposes a system based on named entity recognition and text recognition technologies to automatically detect and mask personal information within shipping labels. Experimental results demonstrate that the system achieved an accuracy of 81.2% in recognizing and masking personal information. The proposed technology selectively masks the relevant personal information, minimizing video distortion, and can be effectively applied to social media platforms and video production environments where there is a high risk of exposing sensitive information.
Abstract
The widespread use of social media and video platforms has frequently and unintentionally exposed sensitive personal information, such as shipping labels, in shared images and videos. Masking entire shipping labels is not only time-consuming and labor-intensive but it can also lead to video distortion. This paper proposes a system based on named entity recognition and text recognition technologies to automatically detect and mask personal information within shipping labels. Experimental results demonstrate that the system achieved an accuracy of 81.2% in recognizing and masking personal information. The proposed technology selectively masks the relevant personal information, minimizing video distortion, and can be effectively applied to social media platforms and video production environments where there is a high risk of exposing sensitive information.
- 발행기관:
- 한국정보기술학회
- 분류:
- 기타공학일반