애스크로AIPublic Preview
← 학술논문 검색
학술논문Journal of Information and Communication Convergence Engineering2021.12 발행KCI 피인용 1

Generating and Validating Synthetic Training Data for Predicting Bankruptcy of Individual Businesses

Generating and Validating Synthetic Training Data for Predicting Bankruptcy of Individual Businesses

홍동숙(한국신용정보원); 백철(한국신용정보원)

19권 4호, 228~233쪽

초록

In this study, we analyze the credit information (loan, delinquency information, etc.) of individual business owners to generatevoluminous training data to establish a bankruptcy prediction model through a partial synthetic training technique. Furthermore,we evaluate the prediction performance of the newly generated data compared to the actual data. When using conditional tabulargenerative adversarial networks (CTGAN)-based training data generated by the experimental results (a logistic regression task),the recall is improved by 1.75 times compared to that obtained using the actual data. The probability that both the actual andgenerated data are sampled over an identical distribution is verified to be much higher than 80%. Providing artificial intelligencetraining data through data synthesis in the fields of credit rating and default risk prediction of individual businesses, which havenot been relatively active in research, promotes further in-depth research efforts focused on utilizing such methods

Abstract

In this study, we analyze the credit information (loan, delinquency information, etc.) of individual business owners to generatevoluminous training data to establish a bankruptcy prediction model through a partial synthetic training technique. Furthermore,we evaluate the prediction performance of the newly generated data compared to the actual data. When using conditional tabulargenerative adversarial networks (CTGAN)-based training data generated by the experimental results (a logistic regression task),the recall is improved by 1.75 times compared to that obtained using the actual data. The probability that both the actual andgenerated data are sampled over an identical distribution is verified to be much higher than 80%. Providing artificial intelligencetraining data through data synthesis in the fields of credit rating and default risk prediction of individual businesses, which havenot been relatively active in research, promotes further in-depth research efforts focused on utilizing such methods

발행기관:
한국정보통신학회
DOI:
http://dx.doi.org/10.6109/jicce.2021.19.4.228
분류:
전자/정보통신공학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작
Generating and Validating Synthetic Training Data for Predicting Bankruptcy of Individual Businesses | Journal of Information and Communication Convergence Engineering 2021 | AskLaw | 애스크로 AI