학술논문전자공학회논문지2021.09 발행KCI 피인용 3

특허문서 자동분류를 위한 딥러닝 개별 모델 분류기와 앙상블 분류기의 성능비교

Performance Comparison of Deep Learning Individual Model Classifier and Ensemble Classifier for Automatic Classification of Patent Documents

김성훈(한성대학교 스마트융합컨설팅); 김승천(한성대학교 스마트융합컨설팅학과)

58권 9호, 34~41쪽

AI이 논문 주제로 AI 상담 원문 보기 (KCI)

초록

기술 혁신의 급속한 증가와 이에 따른 출원의 증가로 인해 특허문서 자동분류기는 특허를 분류할 때 개별 발명가와 변리사 모두에게 매우 유용하다. 본 연구에서는 특허전문가의 관점과 유사한 관점을 통해 특허문서 분류를 수행하는 모델을 선택하였다. 주요 키워드의 존재 여부 및 존재 빈도를 나타낼 수 있는 모델인 MLP, 근접거리에 에 존재하는 키워드들의 존재 및 존재 빈도를 나타낼 수 있는 모델인 CNN 그리고 문장의 구성, 키워드의 순서를 나타낼 수 있는 모델인 LSTM, Attention, Transformer을 선택하여 최대한 특허전문가의 분류관점과 유사한 관점으로 분류할 수 있는 모델을 선택하였다. 본 연구에서는 3가지 앙상블 방법을 사용한다. 앙상블 방법은 각 분류기의 결과를 독립적으로 처리하는 배깅[8]을 사용하여 각각 voting, summation 그리고 weighting 의 3가지 방법을 사용하여 각각의 성능을 측정하였다. 본 연구에서 사용된 3개의 데이터셋중에 2개의 데이터셋(데이터셋 #2, 데이터셋 #3)은 앙상블의 정확도가 높았고 1개의 데이터셋 (데이터셋 #1)은 개별모델인 Attention 모델의 정확도가 높았다. 이러한 결과는 각 데이터셋의 분류 관점(키워드에 의한 분류, 문장 구성에 의한 분류)에 기인한 것으로 판단할 수 있다. 사용자 특허분류에 따른 모델의 생성이 아니라 기존 모델을 활용한 앙상블 모델을 사용한다면 데이터셋별 최적의 정확도를 내거나 최적의 정확도를 내는 단일모델에 근접한 정확도를 낼 수 있을 것으로 예상되며 이러한 앙상블 모델 방식이 특허문서 분류기를 상품화 하는데 적합할 것이다.

Abstract

Due to the rapid increase in technological innovation and the corresponding increase in applications, the automatic patent document classifier is very useful for both individual inventors and patent attorneys when classifying patents .In this study, a model for classifying patent documents was selected from a viewpoint similar to that of a patent expert.MLP, which is a model that can indicate the existence and frequency of existence of major keywords, CNN, which is a model that can indicate the existence and frequency of existence of keywords in close proximity, and LSTM, which is a model that can indicate the structure of sentences and order of keyword , Attention, and Transformer were selected to select a model that can be classified as similar to that of patent experts as much as possible. In this study, three ensemble methods are used. The ensemble method used bagging [8], which independently processes the results of each classifier, and measured each performance using three methods: voting, summation, and weighting. Among the three datasets used in this study, two datasets (Dataset #2, Dataset #3) had high ensemble accuracy, and one dataset (Dataset #1) had high accuracy of the attention model, an individual model. was high. This result can be judged to be due to the classification viewpoint (classification by keyword, classification by sentence structure) of each dataset If an ensemble model using an existing model is used instead of the generation of a model according to the user patent classification, it is expected that the optimal accuracy for each data set or accuracy close to that of a single model with optimal accuracy can be achieved. It would be suitable for commercializing a document classifier.

발행기관:: 대한전자공학회
분류:: 전자/정보통신공학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작