학술논문인권과 정의2020.12 발행KCI 피인용 19

인공지능 학습용 데이터세트에 대한 저작권법과 부정경쟁방지법상 보호와 그 한계

How to Protect Data Sets to Train Artificial Intelligence and to Make Limitations toTheir Protection under Copyright and Unfair Competition Laws

이규호(중앙대학교)

494호, 90~112쪽

AI이 논문 주제로 AI 상담 원문 보기 (KCI)

초록

학습가능한 양질의 데이터세트(학습용 데이터세트)는 인공지능 기술의 경쟁력을 좌우한다. 즉, 인공지능의 성능은 인공지능 프로그램의 정확성뿐만 아니라 학습용 데이터세트의 내용에 크게 의존한다. 인공지능산업에서 인공지능 학습용 데이터세트는 인공지능 학습용 데이터세트 생성단계, 인공지능 학습용 모델 작성단계, 인공지능 학습 완료 모델 이용단계 등에서 매우 중요한 역할을 수행한다. 인공지능 학습용 데이터세트를 제작하는 자는 우선 원시 데이터로부터 추출 및/또는 복제하는 과정에서 저작권침해를 하지 말아야 하는데, 이 단계에서는 저작권법상 저작재산권 제한사유의 확대가 인공지능 학습용 데이터세트를 제작하는 자에게 유리하게 작용할 수 있다. 적법하게 인공지능 학습용 데이터세트를 제작한 자의 권리 보호도 매우 중요한데, 이와 관련하여 저작권법 및 부정경쟁방지법의 적용을 고려할 수 있다. 이 쟁점들과 관련하여 이 논문에서 인공지능 학습용 데이터세트와 관련하여 인공지능을 학습시키기 위한 텍스트와 데이터 마이닝에 대한 저작권법적 보호 및 그 제한사유를 살펴본다. 이 쟁점과 관련하여 외국의 입법례를 고찰한다. 그런 다음, 인공지능 학습용 데이터세트와 관련하여 부정경쟁방지법(영업비밀보호법제 포함)상 쟁점(특히, 일본에서 논의되는 한정제공 데이터에 대한 부정경쟁방지법상 보호방안을 중심으로)을 고찰한다. 이 단계에서는 학습용 데이터세트 자체가 부정경쟁방지법상 보호가능한지 여부를 중심으로 살펴본다. 인공지능 학습용 데이터세트는 인공지능을 학습시켜 인공지능 학습 완료 모델을 만들어가는 중간단계이므로 실제적으로는 타인의 저작권침해에 대하여 입증 곤란, 데이터세트의 전 세계적인 이동성 등을 참작하면 저작권침해의 소를 제기하는 것이 수월하지 않을 수 있다. 이는 부정경쟁방지법 제2조 제1호 카목상 타인성과도용행위를 적용하는 경우에도 마찬가지의 문제가 발생하리라 생각한다. 그렇다면 인공지능 학습용 데이터세트 생성장치, 인공지능 학습용 데이터세트 생성방법, 인공지능 학습용 데이터세트 구조가 저장된 기록매체에 대한 발명을 특허출원하는 것도 고려할 만하다. 이 논문에서는 이 쟁점과 관련하여 건설적인 제언을 제시한다.

Abstract

The competitiveness of AI technology is dependant upon good quality of trainable data sets. In other words, the quality of AI heavily relies on the acuteness of AI program and the contents of data sets to train AI. In AI-related industry, data sets to train AI plays a very important role in several stages such as creation of data sets to train AI, creation of complete model trained by AI and usage of AI-trained model. A producer of data sets to train AI should not infringe another’s copyright when he/she extracts and/or reproduce raw data. In this stage, text and data mining exceptions to copyright protection will be in favor of the producer of the data sets. In this regard, the right of the producer of the data sets needs to be protected under copyright and/or unfair competition laws. In this context, this Paper explores copyright protection and its limitations in terms of text and/or data mining in connection with data sets to train AI. Then, the Paper discuss unfair competition law issues, in particular, in terms of data offered only for specified persons under the Japanese unfair competition law. In concluding, this Article proposes how to efficiently protect the producer of data sets to train AI and to harmonize the interests and rights between the producer of date sets and the right holders of raw data.

발행기관:: 대한변호사협회
DOI:: http://dx.doi.org/10.22999/hraj..494.202012.005
분류:: 법학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작