애스크로AIPublic Preview
← 학술논문 검색
학술논문정보보호학회논문지2025.12 발행

A Study on Privacy-Preserving Synthetic Text Generation via Transformer-Based GANs

A Study on Privacy-Preserving Synthetic Text Generation via Transformer-Based GANs

우타리예바 아쎔(부산대학교); 박혜경(부산대학교); 최윤호(부산대학교)

35권 6호, 1541~1554쪽

초록

Controlled generation of high-quality synthetic data is essential in the modern data-driven world, particularly when addressing human-centered privacy concerns. In this study, we propose a privacy-preserving synthetic text generation framework based on transformer-based generative adversarial networks (GANs). The generator produces fluent, domain-aligned text guided by structured prompts that incorporate human-defined privacy preferences. A multi-task discriminator evaluates each generated sample in three ways: realism, domain appropriateness, and presence of sensitive information. To further enhance the generation process, we introduce a non-parametric feedback loop that iteratively refines the input prompt based on discriminator feedback. Experimental results demonstrate that our method achieves high text quality and strong privacy preservation, enabling on-demand generation of synthetic datasets suitable for fine-tuning large language models in privacy-sensitive domains

Abstract

Controlled generation of high-quality synthetic data is essential in the modern data-driven world, particularly when addressing human-centered privacy concerns. In this study, we propose a privacy-preserving synthetic text generation framework based on transformer-based generative adversarial networks (GANs). The generator produces fluent, domain-aligned text guided by structured prompts that incorporate human-defined privacy preferences. A multi-task discriminator evaluates each generated sample in three ways: realism, domain appropriateness, and presence of sensitive information. To further enhance the generation process, we introduce a non-parametric feedback loop that iteratively refines the input prompt based on discriminator feedback. Experimental results demonstrate that our method achieves high text quality and strong privacy preservation, enabling on-demand generation of synthetic datasets suitable for fine-tuning large language models in privacy-sensitive domains

발행기관:
한국정보보호학회
DOI:
http://dx.doi.org/10.13089/JKIISC.2025.35.6.1541
분류:
컴퓨터학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작
A Study on Privacy-Preserving Synthetic Text Generation via Transformer-Based GANs | 정보보호학회논문지 2025 | AskLaw | 애스크로 AI