학술논문한국부패학회보2025.12 발행

개인정보가 포함된 인공지능 학습데이터의 처리 근거 - 공개된 개인정보, 이미 보유한 이용자 개인정보를 중심으로 -

Legal Grounds for Processing Artificial Intelligence Training Data Containing Personal Information - Focusing on Publicly Available Personal Information and User Personal Information Already Held -

임용현(개인정보보호위원회)

30권 4호, 197~222쪽

AI이 논문 주제로 AI 상담 원문 보기 (KCI)

초록

2022년 ChatGPT의 등장 이후 생성형 인공지능은 산업과 일상 전반에 급격한 변화를 가져오고 있다. 인공지능 모델 성능은 학습데이터의 품질과 규모에 따라 좌우되기 때문에 기업들은 공개된 개인정보와 이미 보유한 이용자 개인정보를 활용하여 대규모 학습데이터를 구축하고 있다. 그러나 개인정보가 포함된 인공지능 학습데이터의 처리가 개인정보 보호법 상 적법한지 여부가 명확하지 않아 법적 불확실성이 지속되고 있다. 이에, 본고에서는 인공지능 학습의 개념, 특성, 프라이버시 리스크를 살펴보고 공개된 개인정보와 이미 보유한 이용자 개인정보를 중심으로 인공지능 학습데이터의 처리에 대한 해석론을 검토하고 그 한계를 분석하였다. 결국 개인정보 보호와 인공지능 산업 발전 간의 균형을 위해서는 개인정보가 포함된 인공지능 학습데이터의 적법한 처리 근거를 명확히 하고, 정보주체 권리를 보장할 수 있는 법적 기준을 마련하는 것이 시급하다. 이를 통해 기업의 법적 불확실성을 해소하고 개인정보자기결정권을 보호하면서 신뢰 기반의 인공지능 산업 발전을 도모할 수 있을 것이다.

Abstract

Since the emergence of ChatGPT in 2022, generative artificial intelligence has brought rapid changes across industries and daily life. Because artificial intelligence model performance depends on the quality and scale of training data, companies are building large-scale training datasets by utilizing publicly available personal information and personal information of users they already possess. However, legal uncertainty persists as it remains unclear whether the processing of artificial intelligence training data containing personal information is lawful under personal information protection laws. Accordingly, this paper examines the concept, characteristics, and privacy risks of artificial intelligence training, and reviews interpretations regarding the processing of artificial intelligence training data, focusing on publicly available personal information and personal information of existing users, while analyzing their limitations. Ultimately, to achieve a balance between personal information protection and artificial intelligence industry development, it is urgent to clarify the lawful basis for processing artificial intelligence training data containing personal information and establish legal standards that can guarantee the rights of data subjects. Through this, it will be possible to resolve legal uncertainty for businesses, protect the right to self-determination of personal information, and promote trust-based development of the artificial intelligence industry.

발행기관:: 한국부패학회
DOI:: http://dx.doi.org/10.52663/kcsr.2025.30.4.197
분류:: 행정학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작