애스크로AIPublic Preview
← 학술논문 검색
학술논문인공지능연구 논문지2025.12 발행

Enhancing Korean–Chinese Legal Translation in Low-Resource Scenarios Using Back Translation and Transfer Learning

Enhancing Korean–Chinese Legal Translation in Low-Resource Scenarios Using Back Translation and Transfer Learning

장아남(영산대학교 컴퓨터정보공학과); 소길자(영산대학교)

6권 3호, 132~144쪽

초록

Legal translation between Korean and Chinese faces significant challenges due to complex legal terminology, distinct linguistic structures, and the scarcity of high-quality bilingual corpora. This study proposes an approach to improve neural legal translation in low-resource scenarios by integrating back translation-based data augmentation with transfer learning. Specifically, the multilingual pre-trained mBART model is fine-tuned in two stages: initial fine-tuning with authentic Korean–Chinese legal parallel data, followed by enhanced fine-tuning using pseudo-parallel data generated through back translation and enriched with legal terminology annotations. Experiments on domain-specific datasets demonstrate substantial improvements over baseline Transformer and fine-tuned mBART models, achieving a BLEU score of 34.5 and a TER of 0.42. Human evaluation by bilingual legal experts further confirms enhanced fluency, adequacy, and legal consistency. This work not only advances Korean–Chinese legal neural machine translation in low-resource contexts but also discusses legal implications, including accountability, compliance, and the potential of blockchain for translation traceability. The proposed framework provides a practical foundation for developing reliable AI-assisted legal translation systems.

Abstract

Legal translation between Korean and Chinese faces significant challenges due to complex legal terminology, distinct linguistic structures, and the scarcity of high-quality bilingual corpora. This study proposes an approach to improve neural legal translation in low-resource scenarios by integrating back translation-based data augmentation with transfer learning. Specifically, the multilingual pre-trained mBART model is fine-tuned in two stages: initial fine-tuning with authentic Korean–Chinese legal parallel data, followed by enhanced fine-tuning using pseudo-parallel data generated through back translation and enriched with legal terminology annotations. Experiments on domain-specific datasets demonstrate substantial improvements over baseline Transformer and fine-tuned mBART models, achieving a BLEU score of 34.5 and a TER of 0.42. Human evaluation by bilingual legal experts further confirms enhanced fluency, adequacy, and legal consistency. This work not only advances Korean–Chinese legal neural machine translation in low-resource contexts but also discusses legal implications, including accountability, compliance, and the potential of blockchain for translation traceability. The proposed framework provides a practical foundation for developing reliable AI-assisted legal translation systems.

발행기관:
한국인공지능교육학회
분류:
교육학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작
Enhancing Korean–Chinese Legal Translation in Low-Resource Scenarios Using Back Translation and Transfer Learning | 인공지능연구 논문지 2025 | AskLaw | 애스크로 AI