학술논문시민사회와 NGO2025.05 발행

AI 학습데이터와 개인정보 권리의 경계 : 에이닷(A.) 사례를 통해 본 통제와 거버넌스의 과제

The Boundary Between AI Training Data and Personal Information Rights: Governance and Control Challenges in the Case of “A.”, an AI Assistant by SK Telecom

유성희(서울시 어르신돌봄종사자종합지원센터); 서효중(카톨릭대학교)

23권 1호, 195~245쪽

AI이 논문 주제로 AI 상담 원문 보기 (KCI)

초록

생성형 인공지능(Generative AI)의 확산은 방대한 양의 고품질 학습 데이터를 필수 자원으로 만들고 있으며, 이에 따라 개인정보 수집·활용을 둘러싼 법적·윤리적 문제는 더욱 복잡하고 구조화된 형태로 전개되고 있다. 특히 통신 기반 AI 서비스는 실시간으로 민감 데이터를 수집할 수 있는 구조를 갖추었으나, 이러한 데이터가 인공지능 학습에 활용되는 과정에서 그 정당성과 책임 구조는 여전히 불투명하다. 본 연구는 SK텔레콤의 AI 비서 서비스 ‘에이닷(A.)’ 사례를 통해 통화 데이터, 대화 기록, 제3자 연동 정보 등 고위험 개인정보 활용 방식과 이에 따른 사용자 통제권 침해, 제3자 권리 미보장, 알고리즘 불투명성의 문제를 분석하였다. 특히 통화 데이터의 처리 과정은 개인정보보호법(PIPA)상 목적 제한성 원칙 및 사전 동의 체계와 충돌하며, 기존 법제도가 AI 학습 데이터의 복합성과 재사용 가능성을 충분히 포섭하지 못하고 있음을 실증적으로 드러낸다. 또한 Stack Overflow 사례를 통해, 공개된 데이터라도 정보 주체의 권리 고지, 활용 목적의 명확성, 저작권 보호 등 최소한의 규범 요건이 충족되지 않으면 법적·윤리적 위반으로 전환될 수 있음을 밝혔다. 이러한 분석을 바탕으로 GNU GPL 라이선스의 핵심 원칙 - ‘공개’, ‘책임 공유’, ‘권리 연속성’ - 을 AI 데이터 거버넌스 구조에 적용할 수 있는 가능성을 탐색하였다. 결론적으로 본 연구는 기술·법·윤리 통합적 관점에서 새로운 데이터 규범 설계 필요성을 제시하며, AI 생태계의 투명성과 책임성, 디지털 시민사회의 정보주권 강화를 위한 기반을 제공하고자 한다.

Abstract

The rapid expansion of generative AI has significantly increased the demand for large-scale, high-quality training data, raising critical legal and ethical concerns regarding the use of personal information. Telecommunication-based AI services, such as SK Telecom’s “A.” assistant, have structural access to sensitive data, including call logs and voice content. This study examines how such data is utilized for AI training, highlighting challenges related to user control, third-party rights, and algorithmic transparency. Through a case analysis of A., and a comparison with the Stack Overflow incident, this study highlights how even publicly available datasets can cause harm when proper consent, attribution, and legal compliance are absent. Existing legal frameworks, such as Korea’s Personal Information Protection Act (PIPA), are found to be inadequate in addressing AI-specific risks, particularly concerning high-risk data types. As a normative response, this paper explores the applicability of governance principles derived from the GNU General Public License (GPL), including openness, shared responsibility, and continuity of rights. The findings indicate a need for hybrid governance models that integrate legal, technical, and ethical mechanisms to ensure transparency, accountability, and data sovereignty in the era of artificial intelligence(AI).

발행기관:: 제3섹터연구소
DOI:: http://dx.doi.org/10.35981/ngo.2025.23.01.195
분류:: 기타사회과학일반

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작