애스크로AIPublic Preview
← 학술논문 검색
학술논문Journal of Information Technology Applications & Management2025.12 발행

계층형 이미지-텍스트 멀티모달 모델을 활용한 국내 스타트업-해외 바이어간 B2B 추천 성능 향상 연구

A Study on Improving B2B Recommendation Performance between Korean Startups and Overseas Buyers using a Hierarchical Image-Text Multimodal Model

오소진(한남대학교); 김재경(한남대학교)

32권 6호, 107~117쪽

초록

This study aims to solve the 'semantic interference' problem that occurs in B2B recommender systems when processing over 17,000 product categories within a single embedding space. To address this, we propose a two-stage Hierarchical Multimodal Recommendation Model (Hierarchical Model). The proposed model first predicts one of 238 mid-level categories in the first stage, and then performs detailed item recommendation on a refined candidate pool within that category in the second stage. Comparative experiments against a single model (Baseline Model) using the same CLIP-based architecture showed that the Hierarchical Model achieved a consistent and significant performance improvement of 4.5-5.0%p on average across all key metrics, including Precision@K, MAP@K, and NDCG@K. This paper empirically demonstrates that this performance enhancement is not merely due to search space reduction, but is based on clear theoretical justifications: (1) 'semantic disambiguation' through the specialization of embedding spaces, (2) the efficiency of a 'learned cascade' structure, and (3) the mitigation of data sparsity via the 'information bottleneck' principle.

Abstract

This study aims to solve the 'semantic interference' problem that occurs in B2B recommender systems when processing over 17,000 product categories within a single embedding space. To address this, we propose a two-stage Hierarchical Multimodal Recommendation Model (Hierarchical Model). The proposed model first predicts one of 238 mid-level categories in the first stage, and then performs detailed item recommendation on a refined candidate pool within that category in the second stage. Comparative experiments against a single model (Baseline Model) using the same CLIP-based architecture showed that the Hierarchical Model achieved a consistent and significant performance improvement of 4.5-5.0%p on average across all key metrics, including Precision@K, MAP@K, and NDCG@K. This paper empirically demonstrates that this performance enhancement is not merely due to search space reduction, but is based on clear theoretical justifications: (1) 'semantic disambiguation' through the specialization of embedding spaces, (2) the efficiency of a 'learned cascade' structure, and (3) the mitigation of data sparsity via the 'information bottleneck' principle.

발행기관:
한국데이터전략학회
분류:
경영학

AI 법률 상담

이 논문의 주제에 대해 더 알고 싶으신가요?

460만+ 법률 자료에서 관련 판례·법령·해석례를 찾아 답변합니다

AI 상담 시작
계층형 이미지-텍스트 멀티모달 모델을 활용한 국내 스타트업-해외 바이어간 B2B 추천 성능 향상 연구 | Journal of Information Technology Applications & Management 2025 | AskLaw | 애스크로 AI