• ๋Œ€ํ•œ์ „๊ธฐํ•™ํšŒ
Mobile QR Code QR CODE : The Transactions of the Korean Institute of Electrical Engineers
  • COPE
  • kcse
  • ํ•œ๊ตญ๊ณผํ•™๊ธฐ์ˆ ๋‹จ์ฒด์ด์—ฐํ•ฉํšŒ
  • ํ•œ๊ตญํ•™์ˆ ์ง€์ธ์šฉ์ƒ‰์ธ
  • Scopus
  • crossref
  • orcid




AI, anomaly detection, abnormal behavior, ssh, rdp, GAN, LLM, synthetic dataset

1. ์„œ ๋ก 

๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ์˜ ์•”ํ˜ธํ™” ํ‘œ์ค€ํ™”๋Š” ํ†ต์‹  ๊ธฐ๋ฐ€์„ฑ์„ ํš๊ธฐ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œ์ผฐ์œผ๋‚˜, ์—ญ์„ค์ ์œผ๋กœ ๋ณด์•ˆ ๊ด€์ œ ์ธก๋ฉด์—์„œ ์นจํ•ด ๋ฐ ์ •๋ณด์œ ์ถœ ํƒ์ง€๋ฅผ ์ €ํ•ดํ•˜๋Š” ์š”์†Œ๋กœ ์ž‘์šฉํ•œ๋‹ค. ๊ณต๊ฒฉ์ž๋“ค์€ SSH (Secure Shell), RDP (Remote Desktop Protocol)๊ณผ ๊ฐ™์€ ์›๊ฒฉ ์ ‘์† ํ”„๋กœํ† ์ฝœ์„ ์šฐํšŒ ์ ‘์†ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์•…์„ฑ์ฝ”๋“œ ์œ ์ž…, ๋‚ด๋ถ€ ์‹œ์Šคํ…œ ์›๊ฒฉ ์ ‘์†, ๋‚ด๋ถ€ ๋ฐ์ดํ„ฐ ์œ ์ถœ ๋“ฑ ๊ณ ๋„ํ™”๋œ ์‚ฌ์ด๋ฒ„ ๊ณต๊ฒฉ์„ ์ง€์†์ ์œผ๋กœ ์‹œ๋„ํ•˜๊ณ  ์žˆ๋‹ค. ๋”์šฑ์ด ์ž˜ ์•Œ๋ ค์ง„ ํ‘œ์ค€ ํฌํŠธ ๋ฒˆํ˜ธ๋ฅผ ์ž„์˜๋กœ ๋ณ€๊ฒฝํ•˜๋Š” ์šฐํšŒ ๊ธฐ๋ฒ•์ด ์ผ๋ฐ˜ํ™”๋จ์— ๋”ฐ๋ผ, ํฌํŠธ ์ •๋ณด๋กœ ํ† ๋Œ€๋กœ ์˜์‹ฌ๋˜๋Š” ์„œ๋น„์Šค๋ฅผ ํƒ์ง€ํ•˜๋Š” ๊ธฐ์กด์˜ ์‹œ๊ทธ๋‹ˆ์ฒ˜ ๋ฐฉ์‹์€ ๊ทธ ์‹คํšจ์„ฑ์ด ํ˜„์ €ํžˆ ๊ฐ์†Œํ•˜๊ณ  ์žˆ๋‹ค[1]. Salt Typhoon๊ณผ ๊ฐ™์€ ๊ณต๊ฒฉ ๊ทธ๋ฃน์€ ํƒ์ง€ ์‹œ์Šคํ…œ์„ ์šฐํšŒํ•˜๊ธฐ ์œ„ํ•ด ๋น„ํ‘œ์ค€ TCP 57722ํฌํŠธ์„ ํ†ตํ•ด SSH ์„œ๋น„์Šค๋ฅผ ํ™œ์„ฑํ™”ํ•˜์—ฌ ์žฅ๊ธฐ์ ์ธ ์ ‘๊ทผ ๊ถŒํ•œ์„ ์œ ์ง€ํ•˜๊ธฐ๋„ ํ•˜์˜€์œผ๋ฉฐ[2], Symbiote์™€ ๊ฐ™์€ ์•…์„ฑ์ฝ”๋“œ๋Š” ํŠน์ • ํฌํŠธ ๋ฆฌ์ŠคํŠธ๋ฅผ ๋™์ ์œผ๋กœ ์ˆœํ™˜ํ•˜๋Š” ํฌํŠธ ํ˜ธํ•‘ ๊ธฐ๋ฒ•์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ณด์•ˆ ๋ชจ๋‹ˆํ„ฐ๋ง์„ ๋ฌด๋ ฅํ™”ํ•˜๊ธฐ๋„ ํ•œ๋‹ค[3].

์ด์— ๋”ฐ๋ผ ํ•™๊ณ„์™€ ์‚ฐ์—…๊ณ„๋Š” ํ†ต๊ณ„์  ํŠน์„ฑ๊ณผ ํ–‰์œ„์˜ ํŒจํ„ด์„ ํ•™์Šตํ•˜๋Š” AI ๊ธฐ๋ฐ˜ ์ด์ƒ ํƒ์ง€ ๊ธฐ์ˆ ๋กœ ์ œํ•œ๋œ ์ด์ƒํ–‰์œ„ ํƒ์ง€๋ฅผ ์ง€์†์ ์œผ๋กœ ๊ทน๋ณตํ•ด ๋‚˜๊ฐ€๊ณ  ์žˆ๋‹ค. ํ•˜์ง€๋งŒ AI ๊ธฐ๋ฐ˜ ํƒ์ง€ ์—ญ์‹œ ๊ณ ํ’ˆ์งˆ ํ•™์Šต ๋ฐ์ดํ„ฐ ํ™•๋ณด๊ฐ€ ์ œํ•œ๋จ์— ๋”ฐ๋ผ ํƒ์ง€์˜ ์ •ํ™•๋„๋ฅผ ๊ฐœ์„ ํ•˜๋Š”๋ฐ ๋งŽ์€ ์–ด๋ ค์›€์„ ๊ฒช๊ณ  ์žˆ๋‹ค[4, 5]. ์ตœ๊ทผ์—๋Š” ์ด๋Ÿฌํ•œ ์ œ์•ฝ ์‚ฌํ•ญ๋“ค์„ AI ์ƒ์„ฑ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•˜์—ฌ ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ๊ณต๋ฐฑ์„ ๊ฐœ์„ ํ•˜๋Š” ์—ฐ๊ตฌ๊ฐ€ ์ง€์†์ ์œผ๋กœ ์ง„ํ–‰ ์ค‘์ด๋‹ค.

๋ณธ ๋…ผ๋ฌธ์€ ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€ ๋ชจ๋ธ์˜ ๋ฌธ์ œ์ ๊ณผ ๊ธฐ์ˆ  ๋ฐœ์ „ ์ถ”์ด, ๊ด€๋ จ ์—ฐ๊ตฌ, ์†”๋ฃจ์…˜์„ ๊ธฐ์ˆ ํ•˜๋ฉด์„œ ํ•œ๋‹ค. ๋”๋ถˆ์–ด ๊ธฐ์กด ์—ฐ๊ตฌ ์„ฑ๊ณผ์™€ ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ์„ค๋ช…ํ•˜๋ฉด์„œ ๊ด€๋ จ ์—ฐ๊ตฌ๋ฅผ ์—ฐ๊ตฌ์ž๋“ค์—๊ฒŒ ๊ฐ€์ด๋“œ๋ฅผ ์ œ๊ณตํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์˜ ๊ธฐ์—ฌ๋„๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™๋‹ค. ๋จผ์ € ์ด์ƒํ–‰์œ„ ํƒ์ง€ ๋‹จ๊ณ„์™€ ๊ธฐ์ˆ  ๋ฐœ์ „ ์ถ”์ด๋ฅผ ์„ธ๋Œ€๋ณ„๋กœ ๊ตฌ๋ถ„ํ•˜์—ฌ ๊ด€๋ จ๋œ ์—ฐ๊ตฌ๋ฅผ ์„ค๋ช…ํ•œ๋‹ค. ์ด์–ด์„œ AI ๊ธฐ์ˆ ์„ ์ด์šฉํ•œ 2์„ธ๋Œ€, 3์„ธ๋Œ€ ์ด์ƒํ–‰์œ„ ํƒ์ง€์™€ ๊ด€๋ จํ•˜์—ฌ ๊ธฐ์กด ์œ ๊ด€ ์—ฐ๊ตฌ๋ฅผ ์„ค๋ช…ํ•˜์—ฌ ํ–ฅํ›„ ์ด์–ด์งˆ ์—ฐ๊ตฌ์— ๋Œ€ํ•œ ์ธ์‚ฌ์ดํŠธ๋ฅผ ์ œ๊ณตํ•˜๋ฉฐ, ๋งˆ์ง€๋ง‰์œผ๋กœ 4์„ธ๋Œ€ ์ด์ƒํ–‰์œ„ ํƒ์ง€ ๋ถ€๋ถ„์—์„œ LLM ๊ธฐ๋ฐ˜ ํŠธ๋ž˜ํ”ฝ ํ•ฉ์„ฑ์˜ ๊ฐœ๋…์„ ์‹คํ—˜์„ ํ†ตํ•ด ๊ฒ€์ฆํ•จ์œผ๋กœ์จ ๊ด€๋ จ ์—ฐ๊ตฌ์˜ ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ•œ๋‹ค.

๋ณธ ๋…ผ๋ฌธ์€ 2์žฅ์—์„œ ์•”ํ˜ธํ™” ํŠธ๋ž˜ํ”ฝ ๋ถ„์„์˜ ์—ฐ๊ตฌ ๋™ํ–ฅ๊ณผ ์š”์†Œ ๊ธฐ์ˆ ์„ ๊ณ ์ฐฐํ•˜๊ณ , ์‹ฌ์ธต ํŒจํ‚ท ๊ฒ€์‚ฌ(Deep Packet Inspection, DPI) ๊ธฐ๋ฐ˜ ํƒ์ง€์˜ ํ•œ๊ณ„์—์„œ ์‹œ์ž‘ํ•˜์—ฌ ์‹ฌ์ธต ํ•™์Šต, ๋น„์ง€๋„ยท์ž๊ธฐ ์ง€๋„ ํ•™์Šต, ์ƒ์„ฑํ˜• AI, LLM ๊ธฐ๋ฐ˜ ์œ„ํ˜‘ ์˜ˆ์ธก์œผ๋กœ ์ด์–ด์ง€๋Š” ๊ธฐ์ˆ  ํ๋ฆ„์„ ์„ค๋ช…ํ•œ๋‹ค. 3์žฅ์—์„œ๋Š” ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€ ํ”Œ๋žซํผ์˜ ์„ค๊ณ„ ๊ฐœ์š”์™€ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘, ์ „์ฒ˜๋ฆฌ, ํƒ์ง€๋ชจ๋ธ ์„ค๊ณ„์˜ 3๋‹จ๊ณ„ ๋ณ„ ์ƒ์„ธ ๊ฐ€์ด๋“œ๋ผ์ธ์„ ์ œ์‹œํ•˜๋ฉฐ, ๊ฐ ๋‹จ๊ณ„์—์„œ ๊ธฐ์กด ์‹ค์ฆ ์—ฐ๊ตฌ ์„ฑ๊ณผ๋ฅผ ํ•จ๊ป˜ ๊ธฐ์ˆ ํ•œ๋‹ค. 4์žฅ์€ LLM ๊ธฐ๋ฐ˜ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ ํ•ฉ์„ฑ์„ ์œ„ํ•œ ๊ฐœ๋… ๊ฒ€์ฆ ์‹คํ—˜์— ๋Œ€ํ•ด ์„ค๋ช…ํ•œ๋‹ค. 5์žฅ์—์„œ๋Š” 4์„ธ๋Œ€ LLM ๊ธฐ๋ฐ˜ ๊ธฐ์ˆ ์˜ ๊ธฐ๋Œ€ํšจ๊ณผ์™€ ์ ์šฉ ์‹œ ๊ณ ๋ ค์‚ฌํ•ญ์„ ๋…ผ์˜ํ•˜๊ณ , ํ–ฅํ›„ ์—ฐ๊ตฌ ๋ฐฉํ–ฅ์„ ์ œ์‹œํ•˜๋ฉฐ ๋…ผ๋ฌธ์„ ๋งˆ๋ฌด๋ฆฌํ•œ๋‹ค.

2. ๊ด€๋ จ ์—ฐ๊ตฌ ๋ฐ ์š”์†Œ ๊ธฐ์ˆ 

์•”ํ˜ธํ™” ํŠธ๋ž˜ํ”ฝ ๋ถ„์„ ์—ฐ๊ตฌ๋Š” ํฌ๊ฒŒ ์„ธ ๋ฐฉํ–ฅ์œผ๋กœ ๋ฐœ์ „ํ•ด ์™”๋‹ค. ํฌํŠธ ์ •๋ณด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” ์ดˆ๊ธฐ ์—ฐ๊ตฌ๋Š” Well-known Port๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํŠธ๋ž˜ํ”ฝ์„ ๋ถ„๋ฅ˜ํ•˜์˜€์œผ๋‚˜, ์ด๋Š” ๋™์  ํฌํŠธ ํ• ๋‹น๊ณผ ํฌํŠธ ์œ„์žฅ ๊ธฐ๋ฒ•์˜ ๋“ฑ์žฅ์œผ๋กœ ๊ทธ ์‹คํšจ์„ฑ์ด ๊ธ‰๊ฐํ•˜์˜€๋‹ค. ๋‹ค์Œ์œผ๋กœ DPI ๊ธฐ๋ฐ˜ ์—ฐ๊ตฌ๋Š” ํŽ˜์ด๋กœ๋“œ ์ง์ ‘ ๊ฒ€์‚ฌ๋กœ ๋†’์€ ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•˜์˜€์œผ๋‚˜, TLS ํ†ต์‹ ์ด ์ผ๋ฐ˜ํ™” ๋˜๋ฉด์„œ ํŽ˜์ด๋กœ๋“œ ์•”ํ˜ธํ™”๋กœ ์ธํ•ด ๊ทธ ์ ์šฉ ๋ฒ”์œ„๊ฐ€ ํฌ๊ฒŒ ์ œํ•œ๋˜์—ˆ๋‹ค.

2.1 DPI์™€ AI ๊ธฐ๋ฐ˜ ํƒ์ง€ ๋ชจ๋ธ ๋น„๊ต

์ „ํ†ต์ ์ธ ๋„คํŠธ์›Œํฌ ์นจ์ž… ํƒ์ง€ ์‹œ์Šคํ…œ(IDS)์€ ํŒจํ‚ท ํŽ˜์ด๋กœ๋“œ๋ฅผ ์ง์ ‘ ๊ฒ€์‚ฌํ•˜์—ฌ ์•Œ๋ ค์ง„ ๊ณต๊ฒฉ ์‹œ๊ทธ๋‹ˆ์ฒ˜์™€ ๋น„๊ตํ•˜๋Š” DPI ๋ฐฉ์‹์œผ๋กœ ์ด์ƒ ํŠธ๋ž˜ํ”ฝ์„ ํƒ์ง€ํ•˜์˜€๋‹ค.ํ•˜์ง€๋งŒ ์•”ํ˜ธํ™” ํŠธ๋ž˜ํ”ฝ์ด ๋Œ€๋‹ค์ˆ˜๋ฅผ ์ฐจ์ง€ํ•จ์— ๋”ฐ๋ผ ํŽ˜์ด๋กœ๋“œ์˜ ์ปจํ…์ธ ๋ฅผ ๋ถ„์„ํ•˜๋Š” DPI ๋ฐฉ์‹์€ ํƒ์ง€ ์„ฑ๋Šฅ์ด ๊ธ‰๊ฒฉํžˆ ์ €ํ•˜๋˜์—ˆ๊ณ , Table 1.๊ณผ ๊ฐ™์ด ํŠธ๋ž˜ํ”ฝ์˜ ๋ฉ”ํƒ€ ๋ฐ์ดํ„ฐ ๋ฐ ํŒจํ„ด์„ ๋ถ„์„ํ•˜๋Š” AI ๊ธฐ๋ฐ˜ ํƒ์ง€ ๋ชจ๋ธ์ด ๋Œ€์•ˆ์œผ๋กœ ๋Œ€๋‘๋˜์—ˆ์œผ๋ฉฐ, ์‹ค์ง์ ์œผ๋กœ๋„ ๋†’์€ ํƒ์ง€ ํšจ์œจ์„ฑ์„ ๋ณด์—ฌ์ค€๋‹ค.

ํ‘œ 1. DPI์™€ AI๊ธฐ๋ฐ˜ ํƒ์ง€ ๋ชจ๋ธ ๊ฐ„ ํŠน์ง• ๋น„๊ต

Table 1. Comparison of DPI and AI-Based Detection Models

๋ถ„์„ ์ง€ํ‘œ DPI ๋ฐฉ์‹ AI ๊ธฐ๋ฐ˜ ํƒ์ง€ ๋ชจ๋ธ
๋ถ„์„ ๋Œ€์ƒ ํŒจํ‚ท ํŽ˜์ด๋กœ๋“œ ๋ฐ ์‹œ๊ทธ๋‹ˆ์ฒ˜ ํŠธ๋ž˜ํ”ฝ ํ†ต๊ณ„ยทํ๋ฆ„ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ
์•”ํ˜ธํ™” ๋Œ€์‘ ๋ณตํ˜ธํ™” ์—†์ด ๋ถ„์„ ๋ถˆ๊ฐ€ ๋ณตํ˜ธํ™” ์—†์ด ํ–‰๋™ ํŒจํ„ด ํƒ์ง€
ํƒ์ง€ ๋ฒ”์œ„ ์•Œ๋ ค์ง„ ๊ณต๊ฒฉ ์œ„์ฃผ ๋ฏธ์ง€ยท์ œ๋กœ๋ฐ์ด ๊ณต๊ฒฉ ํƒ์ง€ ๊ฐ€๋Šฅ
์˜ค๋ฒ„ํ—ค๋“œ ๋ณตํ˜ธํ™” ์‹œ ๋งค์šฐ ๋†’์Œ ๋ชจ๋ธ ์ถ”๋ก , ์ƒ๋Œ€์ ์œผ๋กœ ํšจ์œจ์ 
ํ”„๋ผ์ด๋ฒ„์‹œ ํŽ˜์ด๋กœ๋“œ ๋…ธ์ถœ ์œ„ํ—˜ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜, ํ”„๋ผ์ด๋ฒ„์‹œ ๋ณดํ˜ธ

2.2 ํŠธ๋ž˜ํ”ฝ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ๋ฐ ํ–‰๋™ ํŠน์ง• ์ถ”์ถœ

AI ํƒ์ง€ ๋ชจ๋ธ์€ ํŠธ๋ž˜ํ”ฝ์—์„œ ์•”ํ˜ธํ™”๋˜์ง€ ์•Š๋Š” ์™ธํ˜•์  ํŠน์ง•์„ ๋ถ„์„ํ•œ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ํŒจํ‚ท ํฌ๊ธฐ ๋ณ„ ๋ถ„ํฌ, ํŒจํ‚ท ๊ฐ„ ๋„์ฐฉ ๊ฐ„๊ฒฉ, ์„ธ์…˜ ์œ ์ง€ ์‹œ๊ฐ„, ํ”„๋กœํ† ์ฝœ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ๋“ฑ์„ ์ฃผ์š” ํ”ผ์ฒ˜๋กœ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

์ •๋ณด ์ด๋ก ์  ๊ด€์ ์˜ ์—”ํŠธ๋กœํ”ผ ๋ถ„์„๊ณผ ์ฃผ๊ธฐ์„ฑ ํƒ์ง€, ๊ณ„์ธต์  ๊ตฌ์กฐ ๋ถ„์„์„ ํ†ตํ•ด ์ •์ƒ ํŠธ๋ž˜ํ”ฝ๊ณผ ์•…์˜์  ํ•ด์ปค ์„œ๋ฒ„ ๊ฐ„์˜ ์ฐจ์ด๋ฅผ ๋ฐํ˜€๋‚ผ ์ˆ˜ ์žˆ๋‹ค[6]. ํŠนํžˆ ์ตœ๊ทผ ์—ฐ๊ตฌ ์ค‘ ์„ธ์…˜์˜ ์ดˆ๊ธฐ 784๋ฐ”์ดํŠธ๋ฅผ 2์ฐจ์› ๊ทธ๋ ˆ์ด์Šค์ผ€์ผ ์ด๋ฏธ์ง€๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ CNN์— ์ž…๋ ฅํ•˜๋Š” ์‹œ๊ฐํ™” ๊ธฐ๋ฒ•์„ ์ฑ„ํƒํ•œ ์—ฐ๊ตฌ๋„ ์ฐพ์•„๋ณผ ์ˆ˜ ์žˆ๋‹ค[7]. ์ด ๋ฐฉ์‹์€ ์•”ํ˜ธํ™”๋œ ํ”„๋กœํ† ์ฝœ ๋‚ด ๊ตฌ์กฐ์  ํŒจํ„ด์„ ํšจ๊ณผ์ ์œผ๋กœ ์ถ”์ถœํ•˜๋ฉฐ, VPN๊ณผ non-VPN ํŠธ๋ž˜ํ”ฝ ๊ตฌ๋ถ„์—์„œ 99% ์ด์ƒ์˜ ์ •ํ™•๋„๋ฅผ ๋ณด์—ฌ์ค€๋‹ค[8].

ํŠธ๋ž˜ํ”ฝ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ๋Š” ํฌ๊ฒŒ ํŒจํ‚ท ์ˆ˜์ค€, ํ”Œ๋กœ์šฐ ์ˆ˜์ค€, ์„ธ์…˜ ์ˆ˜์ค€์˜ ์„ธ ๊ณ„์ธต์œผ๋กœ ๊ตฌ๋ถ„๋œ๋‹ค[9]. ํŒจํ‚ท ์ˆ˜์ค€์—์„œ๋Š” ๊ฐœ๋ณ„ ํŒจํ‚ท์˜ ํฌ๊ธฐ, ํŒจํ‚ท ๊ฐ„ ๋„์ฐฉ ์‹œ๊ฐ„(Inter-Packet Arrival Time, IPA), ์ „์†ก ๋ฐฉํ–ฅ ๋“ฑ์˜ ํ”ผ์ฒ˜๋ฅผ ์ถ”์ถœํ•  ์ˆ˜ ์žˆ๋‹ค[10]. ํ”Œ๋กœ์šฐ ์ˆ˜์ค€์—์„œ๋Š” ์ถœ๋ฐœ์ง€IP, ๋ชฉ์ ์ง€IP, ํฌํŠธ, ํ”„๋กœํ† ์ฝœ ๋ณ„๋กœ ์ง‘๊ณ„๋œ ํŒจํ‚ท๋“ค์˜ ํ†ต๊ณ„ ๊ฐ’์ธ ๋ฐ”์ดํŠธ ์ „์†ก๋ฅ , ํŒจํ‚ท ์ „์†ก๋ฅ , ํ†ต์‹ ์˜ ์ง€์† ์‹œ๊ฐ„ ๋“ฑ์„ ํ™œ์šฉํ•œ๋‹ค[11]. ์„ธ์…˜ ์ˆ˜์ค€์—์„œ๋Š” ์ธ๋ฐ”์šด๋“œยท์•„์›ƒ๋ฐ”์šด๋“œ ํ”Œ๋กœ์šฐ๋ฅผ ์–‘๋ฐฉํ–ฅ์œผ๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ ์†ก์ˆ˜์‹  ๋ฐ”์ดํŠธ ๋น„์œจ, ์‘๋‹ต ์‹œ๊ฐ„ ๋ถ„ํฌ ๋“ฑ ์‘์šฉ ๊ณ„์ธต์˜ ํ–‰๋™ ํŠน์„ฑ์„ ๋ฐ˜์˜ํ•˜๋Š” ํ”ผ์ฒ˜๋ฅผ ์ถ”๊ฐ€๋กœ ๋„์ถœํ•  ์ˆ˜ ์žˆ๋‹ค.

์ด๋Ÿฌํ•œ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ํ”ผ์ฒ˜ ์ค‘์—์„œ๋„ ํŠนํžˆ ํฌํŠธ์™€ ๋ฌด๊ด€ํ•œ ํ”ผ์ฒ˜์˜ ์ค‘์š”์„ฑ์ด ๋ถ€๊ฐ๋œ๋‹ค. ๊ธฐ์กด์˜ ํฌํŠธ ๋ฒˆํ˜ธ ๊ธฐ๋ฐ˜ ํƒ์ง€๋Š” ๋น„ํ‘œ์ค€ ํฌํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์šฐํšŒ ๊ณต๊ฒฉ์— ์ทจ์•ฝํ•˜๋ฏ€๋กœ, ํฌํŠธ ๋ฒˆํ˜ธ๋ฅผ ๋ฐฐ์ œํ•˜๊ณ  ํ†ต์‹ ์˜ ํ–‰๋™ ํŒจํ„ด๋งŒ์œผ๋กœ ํ”„๋กœํ† ์ฝœ์„ ์‹๋ณ„ํ•˜๋Š” ์ ‘๊ทผ์ด ์š”๊ตฌ๋œ๋‹ค. ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฉ์˜ ํ‰๊ท  ๋ฐ ๋ถ„์‚ฐ, ํŒจํ‚ท ํฌ๊ธฐ ๋ถ„ํฌ, ์„ธ์…˜ ๋ฐฉํ–ฅ ๋น„์œจ, ์—”ํŠธ๋กœํ”ผ ๊ธฐ๋ฐ˜ ๋žœ๋ค์„ฑ ์ง€ํ‘œ ๋“ฑ์€ ์•”ํ˜ธํ™” ์—ฌ๋ถ€์™€ ๋ฌด๊ด€ํ•˜๊ฒŒ ์ถ”์ถœ ๊ฐ€๋Šฅํ•˜๋ฉฐ, ์„œ๋น„์Šค ์œ ํ˜•์— ๋”ฐ๋ผ ๊ณ ์œ ํ•œ ํ†ต๊ณ„์  ํŠน์„ฑ์„ ๋‚˜ํƒ€๋‚ด๋ฏ€๋กœ ํฌํŠธ ๋…๋ฆฝ์  ๋ถ„๋ฅ˜์˜ ํ•ต์‹ฌ ํ”ผ์ฒ˜๋กœ ํ™œ์šฉ๋œ๋‹ค[5, 12]. ํŠนํžˆ ๋ด‡๋„ท ํŠธ๋ž˜ํ”ฝ์€ ์ผ์ •ํ•œ ๋น„์ฝ˜ ์ฃผ๊ธฐ๋ฅผ ๊ฐ€์ง€๋ฏ€๋กœ, ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฉ์˜ ์ฃผ๊ธฐ์„ฑ ๋ถ„์„์ด ์•…์„ฑ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€์— ํšจ๊ณผ์ ์œผ๋กœ ์ ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค[6].

ํ•œํŽธ ํŠธ๋ž˜ํ”ฝ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ์˜ ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ์„ ํ™•๋ณดํ•˜๊ธฐ ์œ„ํ•œ ์—ฐ๊ตฌ๋„ ํ™œ๋ฐœํžˆ ์ง„ํ–‰๋˜๊ณ  ์žˆ๋‹ค. ๊ฐ ํ”ผ์ฒ˜๊ฐ€ ํƒ์ง€ ๋ชจ๋ธ์˜ ์˜ˆ์ธก์— ๋ฏธ์น˜๋Š” ๊ธฐ์—ฌ๋„๋ฅผ ์ •๋Ÿ‰ํ™”ํ•˜๊ธฐ ์œ„ํ•ด ๊ฒŒ์ž„ ์ด๋ก  ๊ธฐ๋ฐ˜์˜ SHAP(SHapley Additive exPlanations) ๋ฐฉ๋ฒ•๋ก ์ด ์ ์šฉ๋˜๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ์–ด๋–ค ํ”ผ์ฒ˜๊ฐ€ ํŠน์ • ๊ณต๊ฒฉ ํƒ์ง€์— ํ•ต์‹ฌ์ ์ธ ์—ญํ• ์„ ํ•˜๋Š”์ง€ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ถ„์„ํ•  ์ˆ˜ ์žˆ๋‹ค[13]. ์ด๋Š” ๋ณด์•ˆ ์ „๋ฌธ๊ฐ€์˜ ์ง๊ด€์— ์˜์กดํ•˜๋˜ ํ”ผ์ฒ˜ ์„ ์ • ๊ณผ์ •์„ ๊ฐ๊ด€ํ™”ํ•˜๊ณ , ํƒ์ง€ ๋ชจ๋ธ์˜ ์‹ ๋ขฐ์„ฑ์„ ์ œ๊ณ ํ•˜๋Š” ๋ฐ ์ค‘์š”ํ•œ ์—ญํ• ์„ ํ•œ๋‹ค.

2.3 ์‹ฌ์ธต ํ•™์Šต ๋ชจ๋ธ ์ ์šฉ

๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ์€ ์‹œ๊ฐ„์  ์ˆœ์„œ๊ฐ€ ์ค‘์š”ํ•œ ์‹œ๊ณ„์—ด ๋ฐ์ดํ„ฐ์ด๋ฏ€๋กœ RNN, LSTM, GRU ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์ด ๋„๋ฆฌ ํ™œ์šฉ๋œ๋‹ค. CNN๊ณผ GRU๋ฅผ ๊ฒฐํ•ฉํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ ๋ชจ๋ธ์€ ๊ณต๊ฐ„์ , ์‹œ๊ฐ„์  ํŠน์ง•์„ ๋™์‹œ์— ํฌ์ฐฉํ•˜์—ฌ ๋‹จ์ผ ๋ชจ๋ธ๋ณด๋‹ค ์šฐ์ˆ˜ํ•œ ์„ฑ๋Šฅ์„ ๋ฐœํœ˜ํ•˜๊ธฐ๋„ ํ•œ๋‹ค[14]. ๋˜ํ•œ Graph Neural Network(GNN)์€ IP์™€ ํฌํŠธ๋ฅผ ๋…ธ๋“œ๋กœ, ํŠธ๋ž˜ํ”ฝ ํ๋ฆ„์„ ์—ฃ์ง€๋กœ ๋ชจ๋ธ๋งํ•˜์—ฌ ๊ณต๊ฒฉ์ž์˜ ์ธก๋ฉด ์ด๋™ ๊ณต๊ฒฉ์ด๋‚˜ ๋ด‡๋„ท ํ†ต์‹  ๊ตฌ์กฐ๋ฅผ ํŒŒ์•…ํ•˜๋Š”๋ฐ ์šฉ์ดํ•˜๋‹ค[15, 16]. GNN ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ด์šฉํ•œ HyperVision์€ ํ๋ฆ„ ์ƒํ˜ธ์ž‘์šฉ ๊ทธ๋ž˜ํ”„๋ฅผ ํ†ตํ•ด ์•”ํ˜ธํ™” ํŽ˜์ด๋กœ๋“œ์— ์˜์กดํ•˜์ง€ ์•Š๋Š” ํƒ์ง€ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค[17]. Table 2๋Š” ์ฃผ์š” ์‹ฌ์ธต ํ•™์Šต ๋ชจ๋ธ์„ ๋น„๊ตํ•˜๊ณ  ์žˆ๋‹ค.

ํ‘œ 2. ์ฃผ์š” ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์˜ ํŠน์ง• ๋น„๊ต

Table 2. Comparison of Major Deep Learning Models

๋ชจ๋ธ ์ฃผ์š” ๋ฉ”์ปค๋‹ˆ์ฆ˜ ์žฅ์ 
CNN ์ด๋ฏธ์ง€ ๊ธฐ๋ฐ˜ ํŠน์ง• ์ถ”์ถœ ๊ณต๊ฐ„์  ํŒจํ„ด ํŒŒ์•…, ์ž๋™ ํŠน์ง• ํ•™์Šต
RNN/LSTM ์‹œํ€€์Šคยท์‹œ๊ฐ„ ์ •๋ณด ํ•™์Šต ์‹œ๊ณ„์—ด ์˜์กด์„ฑ ํŒŒ์•…์— ํƒ์›”
Autoencoder ์žฌ๊ตฌ์„ฑ ์˜ค์ฐจ ๊ธฐ๋ฐ˜ ํƒ์ง€ ๋ผ๋ฒจ ์—†๋Š” ๋ฐ์ดํ„ฐ ํ•™์Šต ๊ฐ€๋Šฅ
GNN ๊ทธ๋ž˜ํ”„ ๊ตฌ์กฐยท๊ด€๊ณ„ ํ•™์Šต ์ „์ฒด ๋„คํŠธ์›Œํฌ ๋งฅ๋ฝ ํŒŒ์•…

2.4 ๋น„์ง€๋„ ํ•™์Šต ๋ฐ ์ž๊ธฐ ์ง€๋„ ํ•™์Šต ์ ์šฉ

๋น„์ง€๋„ ํ•™์Šต์€ ๋ฐ์ดํ„ฐ์˜ ๋ณ„๋„ ๋ ˆ์ด๋ธ”์„ ์‚ฌ์ „์— ์ •์˜ํ•˜์ง€ ์•Š๊ณ  ๋ฐ์ดํ„ฐ๋“ค์˜ ์ˆจ๊ฒจ์ง„ ๊ตฌ์กฐ๋‚˜ ํŒจํ„ด์„ ์ฐพ์•„๋‚ด๋Š” ๊ฒƒ์œผ๋กœ ์ด๋ฅผ ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€ ๊ธฐ์ˆ ์— ์ ์šฉํ•˜๋ฉด ๋ณ„๋„ ์œ„ํ˜‘ ํŠธ๋ž˜ํ”ฝ์˜ ์‚ฌ์ „ ํ•™์Šต ์—†์ด ์ด์ƒ ํŠธ๋ž˜ํ”ฝ์„ ํƒ์ง€ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด๋Š” ๋ณ„๋„ ๋ผ๋ฒจ ์—†์ด๋„ ์•Œ๋ ค์ง€์ง€ ์•Š์€ ๊ณต๊ฒฉ์„ ํƒ์ง€ํ•  ์ˆ˜ ์žˆ์–ด ์œ„ํ˜‘ ํŠธ๋ž˜ํ”ฝ์ด ์ถฉ๋ถ„ํžˆ ํ™•๋ณด๋˜์ง€ ์•Š์€ ์ƒํ™ฉ์—์„œ ์œ ์šฉํ•˜๋ฉฐ, ํŠนํžˆ ์ œ๋กœ๋ฐ์ด ์œ„ํ˜‘ ๋Œ€์‘์— ํšจ๊ณผ์ ์œผ๋กœ ์ ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

๋น„์ง€๋„ ํ•™์Šต ๊ธฐ๋ฐ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์—๋Š” ์žฌ๊ตฌ์„ฑ ์˜ค์ฐจ๋ฅผ ํ™œ์šฉํ•˜๋Š” Autoencoder, ๋ฐ์ดํ„ฐ ๊ฐ„ ๋ฐ€๋„ ๋ฐ ๊ฑฐ๋ฆฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•˜๋Š” One-Class SVM(OCSVM), ์ž…์ฒด์ ์ธ ์ดˆ๊ตฌ๋ฅผ ๊ฒฝ๊ณ„์„ ์œผ๋กœ ๊ตฌ๋ถ„ํ•˜๋Š” Deep Support Vector Data Description(SVDD) ๋“ฑ์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์‚ดํŽด๋ณผ ์ˆ˜ ์žˆ๋‹ค. Autoencoder ๊ธฐ๋ฐ˜ ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€ ๋ชจ๋ธ์€ ์žฌ๊ตฌ์„ฑ ์˜ค์ฐจ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ •์ƒ ํŠธ๋ž˜ํ”ฝ ๋ถ„ํฌ๋ฅผ ํ•™์Šตํ•œ ํ›„ ํ•ด๋‹น ์ž„๊ณ„ ๊ฐ’์„ ์ดˆ๊ณผํ•˜๋Š” ์„ธ์…˜์— ๋Œ€ํ•ด์„œ๋Š” ์ด์ƒ ํŠธ๋ž˜ํ”ฝ์œผ๋กœ ๋ถ„๋ฅ˜ํ•˜๋Š” ๋ฐฉ์‹์ด๋‹ค. ์ •์ƒ ํŠธ๋ž˜ํ”ฝ ๋ฐ์ดํ„ฐ๋งŒ์œผ๋กœ ์ž…๋ ฅ์„ ๋ฐ›์•„ ์••์ถ•ยท๋ณต์›ํ•œ ์žฌ๊ตฌ์„ฑ ์˜ค์ฐจ์— ๋น„ํ•ด ์ด์ƒ ํŠธ๋ž˜ํ”ฝ์€ ์žฌ๊ตฌ์„ฑ ์˜ค์ฐจ๊ฐ€ ํฌ๊ฒŒ ๋ฐœ์ƒํ•˜๊ฒŒ ๋˜๋ฏ€๋กœ ํƒ์ง€๊ฐ€ ๊ฐ€๋Šฅํ•˜๊ฒŒ ๋˜๋Š” ์›๋ฆฌ์ด๋‹ค.

์ž๊ธฐ ์ง€๋„ ํ•™์Šต(Self-Supervised Learning, SSL)์€ ๋ ˆ์ด๋ธ”์ด ์—†๋Š” ๋ฐ์ดํ„ฐ์—์„œ ํ•™์Šต ์‹ ํ˜ธ๋ฅผ ์ž๋™์œผ๋กœ ์ƒ์„ฑํ•˜์—ฌ ์˜๋ฏธ์žˆ๋Š” ๋ฐ์ดํ„ฐ ํ‘œํ˜„์„ ๋งŒ๋“ค์–ด ๋‚ด๋Š” ๋ฐฉ๋ฒ•๋ก ์ด๋‹ค. ์ด์ค‘ ๋Œ€์กฐ ํ•™์Šต์€ ์œ ์‚ฌยท๋น„์œ ์‚ฌ ์Œ์˜ ๊ตฌ๋ถ„์„ ํ†ตํ•ด ๋ ˆ์ด๋ธ”์—†์ด ๋ถ„๋ฅ˜๊ฐ€ ๊ฐ€๋Šฅํ•˜๋‹ค[18]. ๋Œ€์กฐ ํ•™์Šต์˜ ๋Œ€ํ‘œ์ ์ธ ํ”„๋ ˆ์ž„์›Œํฌ์ธ ET-SSL์€ ์ •์ƒ ํŠธ๋ž˜ํ”ฝ์ด ์ž„๋ฒ ๋”ฉ ๊ณต๊ฐ„์—์„œ ๊ฐ€๊น๊ฒŒ ๋ญ‰์น˜๋„๋ก ํ•˜๊ณ , ์ด์ƒ ํŠธ๋ž˜ํ”ฝ์€ ๋ฉ€๋ฆฌ ๋ถ„๋ฆฌ๋˜๋„๋ก ์œ ๋„ํ•˜๋Š” ๋ฐฉ์‹์„ ํ†ตํ•ด 10Gbps ๊ณ ์† ํ™˜๊ฒฝ์—์„œ๋„ 15~25ms์˜ ๋‚ฎ์€ ์ง€์—ฐ์œผ๋กœ ์‹ค์‹œ๊ฐ„ ํƒ์ง€๊ฐ€ ๊ฐ€๋Šฅํ•œ ํ™•์žฅ์„ฑ์„ ์ œ๊ณตํ•œ๋‹ค[19].

2.5 ์ƒ์„ฑํ˜• AI ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•

๋‚˜์•„๊ฐ€ ์ƒ์„ฑํ˜• AI๋ฅผ ํ†ตํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ํ•ฉ์„ฑํ•  ๊ฒฝ์šฐ ์ž๊ธฐ ์ง€๋„ ํ•™์Šต์„ ํ†ตํ•ด ๋ ˆ์ด๋ธ”์„ ๋งตํ•‘ํ•˜์—ฌ ํ•™์Šต ๋ฐ์ดํ„ฐ๋กœ์„œ์˜ ํ™œ์šฉ๋„๋ฅผ ๋†’์ผ ์ˆ˜ ์žˆ๋‹ค. GAN๊ณผ VAE๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์‹ค์ œ ๊ณต๊ฒฉ ํŠธ๋ž˜ํ”ฝ๊ณผ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ์‚ฌํ•œ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•จ์œผ๋กœ์จ ๋ฐ์ดํ„ฐ ๋ถˆ๊ท ํ˜• ๋ฌธ์ œ๋ฅผ ํ•ด์†Œํ•˜๋Š” ์ ‘๊ทผ๋ฒ•์ด ์ฃผ๋ชฉ๋ฐ›๊ณ  ์žˆ๋‹ค. CTGAN, CopulaGAN ๋“ฑ ํ…Œ์ด๋ธ”ํ˜• ๋ฐ์ดํ„ฐ ํŠนํ™” ์ƒ์„ฑ ๋ชจ๋ธ์„ ์ ์šฉํ•จ์œผ๋กœ์จ ์†Œ์ˆ˜ ํด๋ž˜์Šค ํ•™์Šต ์ƒ˜ํ”Œ์„ ๋Œ€ํญ ๋ณด๊ฐ•ํ•  ์ˆ˜ ์žˆ๋‹ค[20].

2.6 LLM ๊ธฐ๋ฐ˜ ์œ„ํ˜‘ ์˜ˆ์ธก

๋„คํŠธ์›Œํฌ ๋กœ๊ทธ๋ฅผ ๊ตฌ์กฐํ™” ํ…์ŠคํŠธ ํ˜•ํƒœ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ LLM์ด ์‹œ๋งจํ‹ฑ ์˜๋ฏธ๋ฅผ ๋ถ„์„ํ•˜๊ณ  ์œ„ํ˜‘์„ ์˜ˆ์ธกํ•œ๋‹ค. ๊ฒ€์ƒ‰ ์ฆ๊ฐ• ์ƒ์„ฑ(Retrieval-Augmented Generation, RAG)์™€ ๊ฒฐํ•ฉํ•˜๋ฉด ์ตœ์‹  ์œ„ํ˜‘ ์ธํ…”๋ฆฌ์ „์Šค๋ฅผ ์‹ค์‹œ๊ฐ„ ์ฐธ์กฐํ•˜์—ฌ ์žฌํ•™์Šต ์—†์ด๋„ ์ƒˆ๋กœ์šด ๊ณต๊ฒฉ ์œ ํ˜•์— ๋Œ€์‘ํ•  ์ˆ˜ ์žˆ๋‹ค. LLM ๊ธฐ๋ฐ˜ ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ์˜ˆ์ธก์€ ํฌ๊ฒŒ ์„ธ ๋‹จ๊ณ„๋กœ ๊ตฌ์„ฑ๋œ๋‹ค. ๋จผ์ € ํŒจํ‚ท ํฌ๊ธฐ, ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฉ, ์„ธ์…˜ ์œ ์ง€ ์‹œ๊ฐ„ ๋“ฑ ์ˆ˜์น˜ ํ”ผ์ฒ˜๋ฅผ ์ž์—ฐ์–ด ํ˜•ํƒœ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์ „์ฒ˜๋ฆฌ ๋‹จ๊ณ„, ๋‘๋ฒˆ์งธ๋กœ ๋ณ€ํ™˜๋œ ํ…์ŠคํŠธ๋ฅผ LLM์— ์ž…๋ ฅํ•˜์—ฌ ํŠธ๋ž˜ํ”ฝ์˜ ์‹œ๋งจํ‹ฑ ํŒจํ„ด๊ณผ ๊ณต๊ฒฉ ์˜๋„๋ฅผ ์ถ”๋ก ํ•˜๋Š” ๋‹จ๊ณ„, ๋งˆ์ง€๋ง‰์œผ๋กœ RAG๋ฅผ ํ†ตํ•ด ์ตœ์‹  ์œ„ํ˜‘ ์ธํ…”๋ฆฌ์ „์Šค๋ฅผ ์‹ค์‹œ๊ฐ„ ์ฐธ์กฐํ•˜์—ฌ ์žฌํ•™์Šต ์—†์ด๋„ ๋ฏธ์ง€์˜ ๊ณต๊ฒฉ ์œ ํ˜•์— ๋Œ€์‘ํ•˜๋Š” ๋‹จ๊ณ„์ด๋‹ค. ์ด ๊ณผ์ •์„ ํ†ตํ•ด ๋‹จ์ˆœ ์ด์ƒ ์—ฌ๋ถ€ ํŒ๋ณ„์„ ๋„˜์–ด ๊ณต๊ฒฉ ์œ ํ˜• ์˜ˆ์ธก, ํƒ์ง€ ๊ทผ๊ฑฐ ์ƒ์„ฑ, ์‚ฌ๋žŒ์ด ์ดํ•ด ๊ฐ€๋Šฅํ•œ ํƒ์ง€ ๊ทœ์น™ ์ž๋™ ์ƒ์„ฑ ๋“ฑ์ด ๊ฐ€๋Šฅํ•˜๋‹ค[21, 22].

3. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ•๋ก 

3.1 ํ”Œ๋žซํผ ์„ค๊ณ„ ๊ฐœ์š”

Table 3์€ ์•”ํ˜ธํ™” ํŠธ๋ž˜ํ”ฝ ํƒ์ง€ ํ”Œ๋žซํผ์˜ 3๋‹จ๊ณ„ ํ”„๋กœ์„ธ์Šค๋ณ„ ๋ฌธ์ œ์ , ๊ธฐ์ˆ  ๋ฐœ์ „ ์ถ”์ด, ๊ด€๋ จ ์—ฐ๊ตฌ ๋ฐ ์†”๋ฃจ์…˜์„ ํ†ตํ•ฉ ์ •๋ฆฌํ•œ ๋‚ด์šฉ์ด๋‹ค. ์ฐธ๊ณ ๋กœ ๊ฐ ๋‹จ๊ณ„์˜ ๊ธฐ์ˆ ๋ฐœ์ „ ์ถ”์ด์— ํ•ด๋‹นํ•˜๋Š” ๊ธฐ์กด ์‹ค์ฆ ์—ฐ๊ตฌ ์„ฑ๊ณผ๋ฅผ ๋ช…์‹œํ•˜์—ฌ, ์œ ์‚ฌ ์—ฐ๊ตฌ ์ˆ˜ํ–‰ ์‹œ ์ฐธ๊ณ ํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ตฌ์„ฑํ•˜์˜€๋‹ค.

ํ‘œ 3. ์ด์ƒํ–‰์œ„ ํƒ์ง€ ํ”Œ๋žซํผ ๋‹จ๊ณ„ ๋ณ„ ๊ธฐ์ˆ  ๋ฐœ์ „ ์ถ”์ด ๋ฐ ๊ด€๋ จ ๊ธฐ์ˆ  ์†Œ๊ฐœ

Table 3. Technological Evolution and Related Technologies for Anomaly Traffic Detection Platforms

๋‹จ๊ณ„ 1๋‹จ๊ณ„:
๋ฐ์ดํ„ฐ ์ˆ˜์ง‘
2๋‹จ๊ณ„:
๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ
3๋‹จ๊ณ„:
ํƒ์ง€๋ชจ๋ธ ์„ค๊ณ„
๋ฌธ์ œ์  ์œ„ํ˜‘ ํŠธ๋ž˜ํ”ฝ ํ™•๋ณด
๊ณผ๋„ํ•œ ๋ผ๋ฒจ๋ง ๋น„์šฉ
ํฌํŠธ ์šฐํšŒ ํŠธ๋ž˜ํ”ฝ
์œ ํšจ ํ”ผ์ฒ˜ ์„ ์ •
์‹ค์‹œ๊ฐ„ ์ฒ˜๋ฆฌ ์ง€์—ฐ
์ •ํ™•๋„์™€ ๊ฒฝ๋Ÿ‰ํ™” ์ƒ์ถฉ
๊ธฐ์ˆ ๋ฐœ์ „์ถ”์ด 1์„ธ๋Œ€
(DPI:์‹œ๊ทธ๋‹ˆ์ฒ˜ ๋งคํ•‘)
ํŒจํ‚ท ํŠธ๋ž˜ํ”ฝ
[1, 2, 3]
ํ—ค๋” ๋‹จ์ˆœ ์ถ”์ถœ[7] ๊ทœ์น™/์‹œ๊ทธ๋‹ˆ์ฒ˜[23]
2์„ธ๋Œ€
(ML/DL:์ด์ง„ ๋ถ„๋ฅ˜)
ํ”Œ๋กœ์šฐ/์„ธ์…˜ ํŠธ๋ž˜ํ”ฝ
[5, 12]
์ˆ˜๋™ ํŠน์ง• ๊ณตํ•™
[5, 12]
์ง€๋„ํ•™์Šต(ML/DL)
[5, 12, 14]
3์„ธ๋Œ€
(์ฆ๊ฐ•/ํ•ฉ์„ฑ:๋™์  ๋ถ„์„)
ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ ์ƒ์„ฑ
[12, 20]
์ž๋™ ํŠน์ง• ์ถ”์ถœ
[4, 24, 25]
๋น„์ง€๋„/์•™์ƒ๋ธ”
[4, 24, 25, 26, 27]
4์„ธ๋Œ€
(LLM: ๋ฌธ๋งฅ ๊ธฐ๋ฐ˜ ๋ถ„์„)
LLM ํŠธ๋ž˜ํ”ฝ ํ•ฉ์„ฑ
[28, 29, 30]
ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ
[29, 31, 32]
LLM ์ถ”๋ก ยท์˜ˆ์ธก
[21, 22, 33, 34]
๊ด€๋ จ AI ๊ธฐ์ˆ  CTGAN, CopulaGAN,
WGAN-GP, VAE, TVAE
SHAP ๊ธฐ๋ฐ˜ ํŠน์ง• ํ•ด์„,
AutoML
TinyML, QLoRA,
๊ฒฝ๋Ÿ‰ Transformer,
RAG ๊ธฐ๋ฐ˜ ์ œ๋กœ์ƒท,RuleLLM
๊ด€๋ จ ์†”๋ฃจ์…˜ Zeek, Wireshark,
Splunk
ELK Stack, Pandas
Scikit-learn
PyTorch, TensorFlow,
ONNX Runtime

3.2 (1๋‹จ๊ณ„) ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘

๊ณต๊ฒฉ ํŠธ๋ž˜ํ”ฝ์€ ์‹ค์šด์˜ ํ™˜๊ฒฝ์—์„œ ์ •์ƒ ํŠธ๋ž˜ํ”ฝ ๋Œ€๋น„ ์ž‘์€ ์ˆ˜๋Ÿ‰์œผ๋กœ ์กด์žฌํ•˜๋Š” ๊ทน๋‹จ์  ๋ถˆ๊ท ํ˜• ๋ฌธ์ œ๋ฅผ ๋‚ดํฌํ•œ๋‹ค. 1์„ธ๋Œ€ DPI ๊ธฐ๋ฐ˜ ์ˆ˜์ง‘์€ ์•”ํ˜ธํ™” ํ™˜๊ฒฝ์—์„œ ์ ์šฉ์ด ์–ด๋ ค์šฐ๋ฉฐ, 2์„ธ๋Œ€์˜ ํ”Œ๋กœ์šฐ ๋ฐ์ดํ„ฐ(NetFlow/IPFIX) ์ˆ˜์ง‘์ด ํ˜„์žฌ ์ฃผ๋ฅ˜๋ฅผ ์ด๋ฃจ๊ณ  ์žˆ๋‹ค.

๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ ์ˆ˜์ง‘ ๋ฐฉ์‹์€ ํฌ๊ฒŒ ํŒจํ‚ท(Packet) ๊ธฐ๋ฐ˜, ํ”Œ๋กœ์šฐ(Flow) ๊ธฐ๋ฐ˜, ์„ธ์…˜(Session) ๊ธฐ๋ฐ˜์œผ๋กœ ๊ตฌ๋ถ„๋œ๋‹ค. ํŒจํ‚ท ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ๋Š” ์ƒ์„ธํ•œ ํ†ต์‹  ์ •๋ณด๋ฅผ ํฌํ•จํ•˜์ง€๋งŒ ๋ฐ์ดํ„ฐ ํฌ๊ธฐ๊ฐ€ ๋ฐฉ๋Œ€ํ•˜์—ฌ ๋ถ„์„์— ๋ง‰๋Œ€ํ•œ ์ปดํ“จํŒ… ์ž์›์ด ์š”๊ตฌ๋œ๋‹ค. ํ”Œ๋กœ์šฐ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ(NetFlow)๋Š” ์ถœ๋ฐœ์ง€ยท๋ชฉ์ ์ง€ ์ฃผ์†Œ๊ฐ€ ๋™์ผํ•œ ํŒจํ‚ท๋“ค์„ ํ•˜๋‚˜๋กœ ์ง‘๊ณ„ํ•˜์—ฌ ํฌ๊ธฐ๋ฅผ ๋Œ€ํญ ์ค„์ผ ์ˆ˜ ์žˆ์ง€๋งŒ, ๋‹จ๋ฐฉํ–ฅ์„ฑ์œผ๋กœ ์ธํ•ด ์ „์ฒด ์„ธ์…˜์˜ ๋งฅ๋ฝ์„ ํŒŒ์•…ํ•˜๊ธฐ ์–ด๋ ต๋‹ค๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค. ์ด๋ฅผ ๋ณด์™„ํ•˜๊ธฐ ์œ„ํ•ด ์„ธ์…˜ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ๋Š” ์–‘๋ฐฉํ–ฅ์˜ ์ธ๋ฐ”์šด๋“œ, ์•„์›ƒ๋ฐ”์šด๋“œ ํ”Œ๋กœ์šฐ๋ฅผ ํ•˜๋‚˜๋กœ ๊ฒฐํ•ฉํ•œ ๋ฐ์ดํ„ฐ๋กœ ํ‘œํ˜„ํ•˜์—ฌ, ํ”Œ๋กœ์šฐ ๋ฐ์ดํ„ฐ ๋‹จ๋…์œผ๋กœ๋Š” ํ‘œํ˜„ํ•˜์ง€ ๋ชปํ•˜๋Š” ํ†ต์‹ ์˜ ๋งฅ๋ฝ์„ ๋ถ„์„ํ•  ์ˆ˜ ์žˆ๋‹ค[12]. ์ด๋Ÿฌํ•œ ๋ฐฉ์‹์œผ๋กœ DARPA99 Week4 ๋ฐ์ดํ„ฐ์…‹ ๊ธฐ์ค€์œผ๋กœ ํŒจํ‚ท ๊ธฐ๋ฐ˜ ๋Œ€๋น„ ์„ธ์…˜ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ์˜ ํ–‰์„ 6,461,795๊ฑด์—์„œ 175,330๊ฑด์œผ๋กœ ์•ฝ 97% ๊ฐ์†Œ์‹œ์ผœ ํ•™์Šต ๋ฐ์ดํ„ฐ์˜ ์ฒ˜๋ฆฌ ํšจ์œจ์„ ๋Œ€ํญ ๊ฐœ์„ ํ•˜์˜€๋‹ค.

3์„ธ๋Œ€์—์„œ๋Š” GAN, VAE ๋“ฑ ์ƒ์„ฑํ˜• AI๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์†Œ์ˆ˜ ํด๋ž˜์Šค ๊ณต๊ฒฉ ํŠธ๋ž˜ํ”ฝ์„ ํ•ฉ์„ฑํ•จ์œผ๋กœ์จ ์›๋ณธ ํ•™์Šต ๋ฐ์ดํ„ฐ๋ฅผ ์ด์šฉํ•˜๋Š” ๋ชจ๋ธ ํƒ์ง€ ์„ฑ๋Šฅ ๋Œ€๋น„ ์žฌํ˜„์œจ(Recall)์„ ์ตœ๋Œ€ 35% ํ–ฅ์ƒ์‹œํ‚ค๋Š” ์—ฐ๊ตฌ๊ฐ€ ์ง„ํ–‰๋˜์—ˆ๋‹ค[20].

์‹ค์ œ ๋„คํŠธ์›Œํฌ ํ™˜๊ฒฝ์—์„œ SSH ๊ณต๊ฒฉ ํŠธ๋ž˜ํ”ฝ์€ ์ •์ƒ ํŠธ๋ž˜ํ”ฝ ๋Œ€๋น„ ๊ทนํžˆ ํฌ์†Œํ•˜์—ฌ ์‹ฌ๊ฐํ•œ ํด๋ž˜์Šค ๋ถˆ๊ท ํ˜• ๋ฌธ์ œ๋ฅผ ์•ผ๊ธฐํ•œ๋‹ค. ๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ๋Š” ์ด๋Ÿฌํ•œ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด WGAN-GP(Wasserstein GAN with Gradient Penalty) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ™œ์šฉํ•˜์—ฌ SSH ํŠธ๋ž˜ํ”ฝ์„ ํ•™์Šต ๋ฐ์ดํ„ฐ๋กœ ์ƒ์„ฑํ•˜์˜€๋‹ค. ๊ธฐ์กด GAN์€ ๋ชจ๋“œ ๋ถ•๊ดด(Mode Collapse)๋กœ ์ธํ•ด ์ œํ•œ๋œ ๋‹ค์–‘์„ฑ์˜ ์ƒ˜ํ”Œ๋งŒ ์ƒ์„ฑํ•˜๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ์œผ๋ฉฐ, WGAN์€ ๊ฐ€์ค‘์น˜ ํด๋ฆฌํ•‘์œผ๋กœ ์ธํ•œ ๊ธฐ์šธ๊ธฐ ์†Œ์‹ค ๋ฌธ์ œ๊ฐ€ ์กด์žฌํ•œ๋‹ค. ํ•˜์ง€๋งŒ ๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ ์ ์šฉํ•œ WGAN-GP๋Š” EM(Earth Mover) ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ˜์˜ ๋ชฉ์  ํ•จ์ˆ˜์— ๊ฒฝ์‚ฌ ํŽ˜๋„ํ‹ฐ(Gradient Penalty)๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ์ด๋Ÿฌํ•œ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•œ๋‹ค. ์‹คํ—˜์ ์œผ๋กœ GAN๊ณผ WGAN์ด Softmax ์ถœ๋ ฅ๊ฐ’ 0.75 ์ด์ƒ์˜ ์œ ํšจ ์ƒ˜ํ”Œ์„ ์ „ํ˜€ ์ƒ์„ฑํ•˜์ง€ ๋ชปํ•œ ๋ฐ˜๋ฉด, WGAN-GP๋Š” ์ถฉ๋ถ„ํ•œ ์ˆ˜์˜ ์œ ํšจ ์ƒ˜ํ”Œ์„ ์ƒ์„ฑํ•˜์—ฌ ์šฐ์ˆ˜ํ•œ ํ•ฉ์„ฑ ์„ฑ๋Šฅ์„ ์ž…์ฆํ•˜์˜€๋‹ค. ๋˜ํ•œ Generator Loss ๊ฐ’์— ๋”ฐ๋ผ ์ƒ˜ํ”Œ์˜ ์œ ์‚ฌ๋„๋ฅผ ํ‰๊ฐ€ํ•˜์—ฌ ๊ณ ์†์‹คยท์ €์†์‹ค ๋ฒ”์œ„ ์ƒ˜ํ”Œ์„ ํ˜ผํ•ฉํ•˜์—ฌ ์‚ฌ์šฉํ•˜๋Š” ์ƒ˜ํ”Œ ์„ ๋ณ„ ์ „๋žต์„ ํ†ตํ•ด F1โ€“Score 0.999๋ฅผ ๋‹ฌ์„ฑํ•˜์˜€๋‹ค[12].

4์„ธ๋Œ€์—์„œ๋Š” LLM์ด ๋‹ค์–‘ํ•œ ๊ณต๊ฒฉ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ด์ƒ ํ–‰์œ„ ํŠธ๋ž˜ํ”ฝ์„ ์ƒ์„ฑํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์ง„ํ™”๊ฐ€ ์˜ˆ์ƒ๋œ๋‹ค. LLM์„ ํ™œ์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ๊ณต๊ฒฉ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ ˆ์ด๋ธ”์„ ๊ณ ๋ คํ•˜์ง€ ์•Š๊ณ ๋„ ์ง์ ‘ ํŠธ๋ž˜ํ”ฝ์„ ํ•ฉ์„ฑํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์ง„ํ™”๊ฐ€ ์˜ˆ์ƒ๋œ๋‹ค. ET-BERT๋Š” ๋Œ€๊ทœ๋ชจ ๋น„๋ ˆ์ด๋ธ” ํŠธ๋ž˜ํ”ฝ์œผ๋กœ๋ถ€ํ„ฐ BERT ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ๊ทธ๋žจ ํ‘œํ˜„์„ ์‚ฌ์ „ ํ•™์Šตํ•˜์—ฌ ์†Œ๋Ÿ‰์˜ ๋ ˆ์ด๋ธ”๋งŒ์œผ๋กœ ์•”ํ˜ธํ™” ํŠธ๋ž˜ํ”ฝ ๋ถ„๋ฅ˜์—์„œ F1โ€“Score 99.2%๋ฅผ ๋‹ฌ์„ฑํ•˜์˜€๋‹ค[29]. TrafficLLM์€ ํŠธ๋ž˜ํ”ฝ ์ „์šฉ ํ† ํฌ๋‚˜์ด์ €์™€ ์ด์ค‘ ๋‹จ๊ณ„ ํŒŒ์ธํŠœ๋‹ ํŒŒ์ดํ”„๋ผ์ธ์„ ํ†ตํ•ด ํƒ์ง€์™€ ํ•ฉ์„ฑ์„ ๋™์‹œ์— ์ง€์›ํ•˜๋ฉฐ, ๊ธฐ์กด GAN ๊ธฐ๋ฐ˜ ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ ํ•ฉ์„ฑ์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด ์ •ํ™•ํ•œ ํ—ค๋”์™€ ํŽ˜์ด๋กœ๋“œ๋ฅผ ํฌํ•จํ•œ ์ „์ฒด ํŒจํ‚ท ํ•ฉ์„ฑ์ด ๊ฐ€๋Šฅํ•จ์„ ์ž…์ฆํ•˜์˜€๋‹ค[30]. ํŠนํžˆ Knowledge-to-Data ์—ฐ๊ตฌ๋Š” ์‹ค์ œ ๋ฐ์ดํ„ฐ ์ƒ˜ํ”Œ์ด๋‚˜ ํ…Œ์ŠคํŠธ๋ฒ ๋“œ ์—†์ด LLM์˜ ๋„๋ฉ”์ธ ์ง€์‹๋งŒ์œผ๋กœ ํ”„๋กœํ† ์ฝœ ์ œ์•ฝ ์กฐ๊ฑด์„ ์ค€์ˆ˜ํ•˜๋Š” ๊ตฌ์กฐํ™”๋œ ํŠธ๋ž˜ํ”ฝ์„ ํ•ฉ์„ฑํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋ฉฐ, ์ด๋Š” ๋ ˆ์ด๋ธ” ํš๋“์ด ๊ทนํžˆ ์–ด๋ ค์šด ํ™˜๊ฒฝ์—์„œ์˜ ๋ฐ์ดํ„ฐ ๋ณด๊ฐ• ์ „๋žต์œผ๋กœ์„œ ์ฃผ๋ชฉ๋œ๋‹ค[28].

4์„ธ๋Œ€์—์„œ LLM ๊ธฐ์ˆ ์„ ์ ์šฉํ•  ์‹œ ๊ณ ๋ คํ•  ์‚ฌํ•ญ์œผ๋กœ๋Š” ์šฐ์„  ํ• ๋ฃจ์‹œ๋„ค์ด์…˜์— ๋”ฐ๋ฅธ ์‹ ๋ขฐ์„ฑ ๊ฒฐ์—ฌ ๋ฌธ์ œ๊ฐ€ ์žˆ๋‹ค. LLM์ด ์‹ค์ œ ์กด์žฌํ•˜์ง€ ์•Š๋Š” ๊ฐ€์ƒ์˜ ๊ณต๊ฒฉ ํŒจํ„ด์ด๋‚˜ ํ”„๋กœํ† ์ฝœ ๊ทœ๊ฒฉ์— ๋ถ€ํ•ฉํ•˜์ง€ ์•Š๋Š” ํŠธ๋ž˜ํ”ฝ ์ƒ์„ฑ ๋กœ์ง์„ ์ถœ๋ ฅํ•˜๋Š” ํ• ๋ฃจ์‹œ๋„ค์ด์…˜ ๋ฌธ์ œ๋Š” ์น˜๋ช…์  ์˜คํƒ์œผ๋กœ ์ด์–ด์งˆ ์ˆ˜ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ์˜ค๋ฅ˜๋ฅผ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•ด, LLM ๋ชจ๋ธ์—๋งŒ ์˜์กดํ•˜์ง€ ์•Š๊ณ  ์ตœ์‹  ์œ„ํ˜‘ ์ธํ…”๋ฆฌ์ „์Šค๋ฅผ ์ •๋ณด๋ฅผ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์—…๋ฐ์ดํŠธํ•˜๋Š” RAG๋ฅผ ๊ณ ๋ คํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, LLM์ด ์„ค๊ณ„ํ•œ ํŠธ๋ž˜ํ”ฝ ์ƒ์„ฑ ๋กœ์ง์ด ์‹ค์ œ ๋„คํŠธ์›Œํฌ ๊ทœ๊ฒฉ์— ๋งž๋Š”์ง€ ์ปดํ“จํ„ฐ๊ฐ€ ์ž๋™์œผ๋กœ ๊ฒ€์‚ฌํ•˜๋Š” ๋ชจ๋“ˆ์„ ์ถ”๊ฐ€ํ•˜์—ฌ ์ƒ์„ฑ ๋ฐ์ดํ„ฐ์˜ ์‹ ๋ขฐ์„ฑ์„ ๋†’์ด๊ณ ์ž ํ•œ๋‹ค. ์ถ”๊ฐ€๋กœ ๋„คํŠธ์›Œํฌ ํŒจํ‚ท์˜ ์ˆ˜์น˜ ์‹œํ€€์Šค ๋ฐ์ดํ„ฐ๋Š” ์ผ๋ฐ˜์ ์ธ ์ž์—ฐ์–ด ํ…์ŠคํŠธ์™€ ๊ตฌ์กฐ์ ์œผ๋กœ ์ด์งˆ์ ์ด๋ฏ€๋กœ ํ‘œ์ค€ ํ† ํฌ๋‚˜์ด์ €๋กœ๋Š” ์˜๋ฏธ ์žˆ๋Š” ํ‘œํ˜„ ํ•™์Šต์ด ์ œํ•œ๋œ๋‹ค. ๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ์— ํŠนํ™”๋œ ๋„๋ฉ”์ธ ์ „์šฉ ํ† ํฌ๋‚˜์ด์ € ์„ค๊ณ„์™€ Structured-to-Text ๋ณ€ํ™˜ ์ „๋žต์„ ํ†ตํ•ด LLM์ด ํŠธ๋ž˜ํ”ฝ ๋ฐ์ดํ„ฐ์˜ ์‹œ๋งจํ‹ฑ ํŒจํ„ด์„ ํšจ๊ณผ์ ์œผ๋กœ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•˜๋Š” ๊ฒƒ ๋˜ํ•œ ํ•„์ˆ˜์ ์œผ๋กœ ๊ณ ๋ ค๋˜์–ด์•ผ ํ•œ๋‹ค.

3.3 (2๋‹จ๊ณ„) ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ

IP ์ฃผ์†Œ, ํฌํŠธ ๋ฒˆํ˜ธ ๋“ฑ ๊ฐ€๋ณ€ ์ •๋ณด๋ฅผ ์ œ๊ฑฐํ•˜๊ณ  ํŒจํ‚ท ๊ธธ์ด ์‹œํ€€์Šค, ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฌ, ์„ธ์…˜ ๋ฐฉํ–ฅ ๋น„์œจ ๋“ฑ ํฌํŠธ ๋…๋ฆฝ์  ํ”ผ์ฒ˜๋ฅผ ์ถ”์ถœํ•˜๋Š” ๊ฒƒ์ด ํ•ต์‹ฌ์ด๋‹ค.

์ „์ฒ˜๋ฆฌ ๋‹จ๊ณ„์—์„œ ์œ ํšจ ํ”ผ์ฒ˜ ์„ ์ •์€ ํƒ์ง€ ์„ฑ๋Šฅ์— ์ง๊ฒฐ๋˜๋Š” ํ•ต์‹ฌ ๊ณผ์ •์ด๋‹ค. ๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ๋Š” ๊ฒฐ์ • ํŠธ๋ฆฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ”ผ์ฒ˜ ์ค‘์š”๋„๋ฅผ ๋ถ„์„ํ•˜๊ณ , ์˜์‚ฌ๊ฒฐ์ • ๋…ธ๋“œ์—์„œ ๋นˆ๋ฒˆํ•˜๊ฒŒ ์‚ฌ์šฉ๋œ ํ”ผ์ฒ˜๋ฅผ ์ฃผ์š” ํ”ผ์ฒ˜๋กœ ์„ ์ •ํ•˜์˜€๋‹ค[5]. ์†ก์ˆ˜์‹  ๋น„์œจ, ์„ธ์…˜ ์‹œ๊ฐ„, ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฉ์˜ ํ‰๊ท , ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฉ์˜ ๋ถ„์‚ฐ ์ด์ƒ 4๊ฐ€์ง€ ํ”ผ์ฒ˜๊ฐ€ ์„ ์ •ํ•˜์˜€์œผ๋ฉฐ, ๋‚˜๋จธ์ง€ ํ”ผ์ฒ˜๋“ค์€ ์ฃผ์„ฑ๋ถ„ ๋ถ„์„(Principal Component Analysis, PCA)๋ฅผ ์ ์šฉํ•˜์—ฌ ์ฃผ์„ฑ๋ถ„(PC) ๊ฐ’์œผ๋กœ ๋Œ€์ฒดํ•˜์˜€๋‹ค. ํŠนํžˆ ํŒจํ‚ท ๋„์ฐฉ ๊ฐ„๊ฒฉ ์‹œ๊ฐ„์˜ ํ‰๊ท ๊ณผ ๋ถ„์‚ฐ์€ ๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ๋Š” ์„ธ์…˜ ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ์— ์ ์šฉ๋˜์ง€ ์•Š์•˜๋˜ ์‹ ๊ทœ ํ”ผ์ฒ˜๋กœ, ํ”Œ๋กœ์šฐ ๊ธฐ๋ฐ˜์˜ Duration์ด ๋‹จ๋ฐฉํ–ฅ ํŒจํ‚ท ํŠน์„ฑ๋งŒ ๋ฐ˜์˜ํ•˜๋Š” ํ•œ๊ณ„๋ฅผ ๋ณด์™„ํ•œ๋‹ค. IPA ์‹œ๊ฐ„์€ ์†ก์ˆ˜์‹  ๋ฐฉํ–ฅ์— ๋ฌด๊ด€ํ•˜๊ฒŒ ์—ฐ์† ํŒจํ‚ท ๊ฐ„์˜ ์‹œ๊ฐ„ ๊ฐ„๊ฒฉ์„ ์ธก์ •ํ•˜์—ฌ ์‘์šฉ ๊ณ„์ธต์˜ ์‘๋‹ต ํŠน์„ฑ์„ ํฌ์ฐฉํ•˜๋ฉฐ, ์ด ํ”ผ์ฒ˜๋ฅผ ์ถ”๊ฐ€ํ•จ์œผ๋กœ์จ ๊ธฐ์กด ๋ชจ๋ธ ๋Œ€๋น„ ์žฌํ˜„์œจ 11.8%, ์ •๋ฐ€๋„ 50.1%๊ฐ€ ํ–ฅ์ƒ๋˜์—ˆ๋‹ค. ๋˜ํ•œ ์ผ๋ถ€ ํ”ผ์ฒ˜์— ๋กœ๊ทธ(Log) ํ•จ์ˆ˜๋ฅผ ์ ์šฉํ•˜์—ฌ ๋ถ„ํฌ ๋ฐ€๋„๋ฅผ ์ค„์ด๊ณ  ์„ ํ˜•์  ํŠน์„ฑ์„ ๊ฐ•ํ™”ํ•˜์˜€์œผ๋ฉฐ, ์ „์ฒด ํ”ผ์ฒ˜๋ฅผ 0~100 ๋ฒ”์œ„๋กœ ์ •๊ทœํ™”ํ•˜์—ฌ ํ•™์Šต ์•ˆ์ •์„ฑ์„ ํ™•๋ณดํ•˜์˜€๋‹ค[12].

3์„ธ๋Œ€์—์„œ๋Š” ์˜คํ† ์ธ์ฝ”๋”๊ฐ€ ์›์‹œ ํŠธ๋ž˜ํ”ฝ์œผ๋กœ๋ถ€ํ„ฐ ์˜๋ฏธ ์žˆ๋Š” ์ž ์žฌ ํ‘œํ˜„์„ ํ•™์Šตํ•˜๋Š” ์ž๋™ ํŠน์ง• ์ถ”์ถœ์„ ์ฑ„ํƒํ•˜๋ฉฐ, SHapley Additive exPlanations(SHAP) ๊ธฐ๋ฐ˜ ํŠน์ง• ์ค‘์š”๋„ ๋ถ„์„์œผ๋กœ ํ•ด์„ ๊ฐ€๋Šฅ์„ฑ์„ ์ œ๊ณ ํ•˜์˜€๋‹ค[13]. ๋‚˜์•„๊ฐ€ 4์„ธ๋Œ€์—์„œ๋Š” ํ๋ฆ„ ํ†ต๊ณ„๋ฅผ ์ž์—ฐ์–ด ํ”„๋กฌํ”„ํŠธ ํ˜•ํƒœ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ LLM์ด ๋งฅ๋ฝ์„ ์ดํ•ดํ•˜๋„๋ก ๊ตฌ์„ฑํ•˜๊ฒŒ ๋œ๋‹ค.

4์„ธ๋Œ€ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ ๋ฐฉ์‹์œผ๋กœ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๊ธฐ๋ฐ˜ ์ „์ฒ˜๋ฆฌ๋Š” ํ”ผ์ฒ˜ ๊ฐ„ ๊ด€๊ณ„์™€ ๋งฅ๋ฝ์„ ํ•จ๊ป˜ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์—์„œ ๊ธฐ์กด ์ˆ˜์น˜ ๋ฒกํ„ฐ ๋ฐฉ์‹ ๋Œ€๋น„ ์žฅ์ ์„ ๊ฐ€์ง„๋‹ค. ๋‹ค์Œ์˜ 3๊ฐ€์ง€ ์—ฐ๊ตฌ๋Š” ์‹ค์ œ ์žฅ์ ์„ ์ˆ˜์น˜์ ์œผ๋กœ ๋ณด์—ฌ์ฃผ๊ณ  ์žˆ๋‹ค. ET-BERT๋Š” ๋Œ€๊ทœ๋ชจ ๋น„๋ ˆ์ด๋ธ” ํŠธ๋ž˜ํ”ฝ์œผ๋กœ๋ถ€ํ„ฐ BERT ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ๊ทธ๋žจ ํ‘œํ˜„์„ ์‚ฌ์ „ ํ•™์Šตํ•จ์œผ๋กœ์จ ์ˆ˜๋™ ํ”ผ์ฒ˜ ์„ค๊ณ„ ์—†์ด๋„ ์•”ํ˜ธํ™” ํŠธ๋ž˜ํ”ฝ ๋ถ„๋ฅ˜์—์„œ F1โ€“Score 98.9%๋ฅผ ๋‹ฌ์„ฑํ•˜์—ฌ ์ „์ด ํ•™์Šต์˜ ์œ ํšจ์„ฑ์„ ์ž…์ฆํ•˜์˜€๋‹ค[29]. ๋‘˜์งธ, GPT-2 ๋ฐ LLaMA ๊ธฐ๋ฐ˜ TrafficLLM์€ ์ตœ์†Œํ•œ์˜ ํŒŒ์ธํŠœ๋‹์œผ๋กœ CNN ๊ธฐ๋ฐ˜ ๋ชจ๋ธ ๋Œ€๋น„ ์ตœ๋Œ€ 21.5%์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋‹ฌ์„ฑํ•˜์—ฌ ์•Œ๋ ค์ง€์ง€ ์•Š์€ ํŠธ๋ž˜ํ”ฝ์— ๋Œ€ํ•œ ์ผ๋ฐ˜ํ™”๊ฐ€ ๊ฐ€๋Šฅํ•˜๋‹ค๋Š” ๊ฒƒ์„ ์ฆ๋ช…ํ•˜์˜€๋‹ค[31]. ์…‹์งธ, MET-LLM์€ ๋„๋ฉ”์ธ ํŠนํ™” ํ† ํฌ๋‚˜์ด์ €๋ฅผ ํ†ตํ•ด ์ž์—ฐ์–ด์™€ ๋„คํŠธ์›Œํฌ ๋ฐ์ดํ„ฐ ๊ฐ„์˜ ๋ชจ๋‹ฌ๋ฆฌํ‹ฐ ๊ฒฉ์ฐจ๋ฅผ ํ•ด์†Œํ•˜๊ณ  ์•…์„ฑ ํ๋ฆ„๊ณผ ์ •์ƒ ํ๋ฆ„์˜ ๋งฅ๋ฝ์  ์ฐจ์ด๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ํฌ์ฐฉํ•˜์˜€๋‹ค[32]. ์ด๋Ÿฌํ•œ ์—ฐ๊ตฌ๋“ค์€ ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ ๊ธฐ๋ฐ˜ ์ „์ฒ˜๋ฆฌ๊ฐ€ ์ˆ˜๋™ ํ”ผ์ฒ˜ ๊ณตํ•™์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ์•Œ๋ ค์ง€์ง€ ์•Š์€ ์œ„ํ˜‘์— ๋Œ€ํ•œ ๋ฒ”์šฉ์  ํƒ์ง€ ์„ฑ๋Šฅ์„ ์ œ๊ณตํ•˜๋Š” ํ•ต์‹ฌ ์ „์ฒ˜๋ฆฌ ๋ฐฉ์‹์œผ๋กœ ์‚ฌ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ค€๋‹ค.

3.4 (3๋‹จ๊ณ„) ํƒ์ง€๋ชจ๋ธ ์„ค๊ณ„

1์„ธ๋Œ€ ์‹œ๊ทธ๋‹ˆ์ฒ˜ ๋งคํ•‘ ํƒ์ง€ ๊ธฐ์ˆ ์€ Snort ๋“ฑ๊ณผ ๊ฐ™์€ ๋„๊ตฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์•Œ๋ ค์ง„ ๊ณต๊ฒฉ ํŒจํ„ด์„ ํŽ˜์ด๋กœ๋“œ ๊ธฐ๋ฐ˜ ์‹œ๊ทธ๋‹ˆ์ฒ˜์™€ ๋Œ€์กฐํ•˜๋Š” ์‹ฌ์ธต ํŒจํ‚ท ๊ฒ€์‚ฌ ๋ฐฉ์‹์— ์˜์กดํ•˜์˜€์œผ๋‚˜, ํŠธ๋ž˜ํ”ฝ ์•”ํ˜ธํ™”๊ฐ€ ํ™•์‚ฐ๋จ์— ๋”ฐ๋ผ ๊ฐ€์‹œ์„ฑ ํ™•๋ณด์— ํ•œ๊ณ„๋ฅผ ๋“œ๋Ÿฌ๋ƒˆ๋‹ค. 2์„ธ๋Œ€ ๋ถ€ํ„ฐ๋Š” ์‹ค์‹œ๊ฐ„ ํƒ์ง€๋ฅผ ์œ„ํ•ด ๋†’์€ ์ •ํ™•๋„์™€ ๋‚ฎ์€ ์ง€์—ฐ ์‹œ๊ฐ„์„ ๋™์‹œ์— ๋งŒ์กฑํ•ด์•ผ ํ•˜๋Š” ๊ทผ๋ณธ์  ์ƒ์ถฉ ๊ด€๊ณ„๊ฐ€ ๋ณธ๊ฒฉํ™” ๋œ๋‹ค. 2์„ธ๋Œ€ ํƒ์ง€ ๋ชจ๋ธ์˜ ๊ฒฝ์šฐ ์•”ํ˜ธํ™” ํ•ด์ œ ์—†์ด๋„ ์ด์ƒ ์ง•ํ›„๋ฅผ ํฌ์ฐฉํ•˜๊ธฐ ์œ„ํ•ด ํŒจํ‚ท ๊ธธ์ด ๋ถ„ํฌ์™€ ๋„์ฐฉ ์‹œ๊ฐ„ ๊ฐ„๊ฒฉ ๋“ฑ ์™ธํ˜•์  ํ†ต๊ณ„ ํ”ผ์ฒ˜๋ฅผ ์ •์ƒ๊ณผ ๊ณต๊ฒฉ ํŠธ๋ž˜ํ”ฝ์„ ๋ถ„๋ฅ˜ํ•˜๋Š” ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ํ–‰๋™ ๋ถ„์„ ๊ธฐ์ˆ ๋กœ ๋ฐœ์ „ํ•˜์˜€๋‹ค. Random Forest, CNN-LSTM ๋“ฑ๊ณผ ๊ฐ™์€ 2์„ธ๋Œ€ ์ง€๋„ ํ•™์Šต ๊ธฐ๋ฐ˜ ๋ถ„๋ฅ˜ ๋ชจ๋ธ์€ 95% ์ด์ƒ์˜ ํƒ์ง€ ์ •ํ™•๋„๋ฅผ ๋‹ฌ์„ฑํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, 3์„ธ๋Œ€์—์„œ๋Š” ์•™์ƒ๋ธ”๊ณผ ๋น„์ง€๋„ ํ•™์Šต์„ ๊ฒฐํ•ฉํ•˜์—ฌ ๋ฏธ์ง€ ๊ณต๊ฒฉ์— ๋Œ€ํ•œ ํƒ์ง€ ๋ฒ”์šฉ์„ฑ์„ ์ถ”๊ฐ€๋กœ ํ™•๋ณดํ•˜์˜€๋‹ค.

4์„ธ๋Œ€์—์„œ๋Š” QLoRa์™€ ๊ฐ™์€ ๊ฒฝ๋Ÿ‰ LLM(Llama-1B ๋“ฑ)์„ ๋„์ž…ํ•˜์—ฌ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์„ ์ ˆ๊ฐํ•˜๋ฉด์„œ ๋†’์€ ์ •ํ™•๋„๋ฅผ ์œ ์ง€ํ•˜๋Š” ์—ฐ๊ตฌ๊ฐ€ ์ง„ํ–‰ ์ค‘์ด๋‹ค. 4์„ธ๋Œ€์—์„œ๋Š” LLM์ด ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€์— ์ง์ ‘ ํˆฌ์ž…๋˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์—ฐ๊ตฌ๊ฐ€ ์ง„ํ–‰๋˜๊ณ  ์žˆ๋‹ค. ๊ฒฝ๋Ÿ‰ํ™” ์ธก๋ฉด์—์„œ๋Š” QLoRA ํŒŒ์ธํŠœ๋‹๊ณผ RAG๋ฅผ ๊ฒฐํ•ฉํ•œ LLaMA-1B ๋ชจ๋ธ์ด ์žฌํ•™์Šต ์—†์ด ๋ฏธ๊ด€์ธก ๊ณต๊ฒฉ์„ ์ œ๋กœ์ƒท์œผ๋กœ ํƒ์ง€ํ•˜๋Š” ๊ฐ€๋Šฅ์„ฑ์„ ์‹ค์ฆํ•˜์˜€๋‹ค[34]. Lin ๋“ฑ[21]์€ LLM ๊ธฐ๋ฐ˜ ํƒ์ง€ ๊ทœ์น™ ์ž๋™ ์ƒ์„ฑ ํ”„๋ ˆ์ž„์›Œํฌ์ธ RuleLLM์„ ์ œ์•ˆํ•˜์—ฌ ์ „๋ฌธ๊ฐ€ ๊ฐœ์ž… ์—†์ด 91.8%์˜ ์ •ํ™•๋„๋กœ ํƒ์ง€ ๊ทœ์น™์„ ์ƒ์„ฑํ•˜์˜€์œผ๋ฉฐ, Lian ๋“ฑ[33]์€ ์ƒˆ๋กœ์šด ๊ณต๊ฒฉ์˜ ๊ฐœ๋… ์ฆ๋ช…๋งŒ์œผ๋กœ๋„ IDS ๊ทœ์น™ยท์„ค๋ช…ยท๋ฐฉ์–ด ๊ถŒ๊ณ ๋ฅผ ๋™์‹œ์— ์ถœ๋ ฅํ•˜๋Š” RuleMaster+๋ฅผ ์ œ์‹œํ•˜์˜€๋‹ค.

4. ๊ฐœ๋… ๊ฒ€์ฆ ์‹คํ—˜

4.1 ์‹คํ—˜ ๋ชฉ์  ๋ฐ ์„ค๊ณ„

๋ณธ ์žฅ์—์„œ๋Š” 4์„ธ๋Œ€ LLM ๊ธฐ๋ฐ˜ ํŠธ๋ž˜ํ”ฝ ํ•ฉ์„ฑ ๊ธฐ๋ฒ•์˜ ์‹คํ˜„ ๊ฐ€๋Šฅ์„ฑ์„ ๊ฐœ๋… ์ฆ๋ช… ์ˆ˜์ค€์—์„œ ๊ฒ€์ฆํ•˜์˜€๋‹ค. ๊ตฌ์ฒด์ ์œผ๋กœ, ์‹ค์ œ SSH ํŠธ๋ž˜ํ”ฝ ํ”ผ์ฒ˜๋ฅผ Structured to Text(S2T) ๋ฐฉ์‹์œผ๋กœ ์ž์—ฐ์–ด ํ”„๋กฌํ”„ํŠธ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ LLM์— ์ž…๋ ฅํ•˜๊ณ , LLM์ด ์ถœ๋ ฅํ•œ ํ…์ŠคํŠธ๋ฅผ ๊ตฌ์กฐํ™” ๋ฐ์ดํ„ฐ๋กœ ํŒŒ์‹ฑํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ํ•ฉ์„ฑ ์ƒ˜ํ”Œ์„ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ํ™•์ธํ•˜์˜€๋‹ค.

ํ‘œ 4. ์›๋ณธ ๋ฐ์ดํ„ฐ์…‹ ํ•ญ๋ชฉ ์„ค๋ช…

Table 4. Feature description of Original Dataset

Dataset feature Description
count total connect
(Total Connect)
Number of connections to the same Destination IP
count connect IP
(Connect IP)
Number of source IP connected to the same destination IP
count avg connect
(Avg Connect)
Average number of connections per IP to the same destination IP
speed transmit BPS
(Speed BPS)
Average transfer speed
byte send
(CS Byte)
Transmit data size
ratio trans receive
(T/R Ratio)
Send byte byte/Receive byte
time taken Time per session
mean of IPA time
(IPA Mean)
Mean of inter-packet arrival (IPA) time
var of IPA time
(IPA Var)
Variance of inter-packet arrival (IPA) time label

์‹คํ—˜์— ์‚ฌ์šฉ๋œ ์›๋ณธ ๋ฐ์ดํ„ฐ๋Š” ์„ ํ–‰ ์—ฐ๊ตฌ[12]์—์„œ ๊ตฌ์ถ•ํ•œ DARPA99 ๊ธฐ๋ฐ˜ ์„ธ์…˜ ๋ฐ์ดํ„ฐ์…‹์ด๋ฉฐ, Table 4๋Š” ํ•ด๋‹น ๋ฐ์ดํ„ฐ์…‹์˜ ํ”ผ์ฒ˜์— ๋Œ€ํ•ด ์„ค๋ช…ํ•œ๋‹ค. DARPA 99์˜ ํŠธ๋ ˆ์ด๋‹ ๋ฐ์ดํ„ฐ์…‹์—์„œ ์›๋ณธ SSH ํŠธ๋ž˜ํ”ฝ 1,000๊ฑด์„ ๋žœ๋คํ•˜๊ฒŒ ์„ ํƒํ•˜์˜€์œผ๋ฉฐ, ์ด๋ฅผ WGAN-GP ํ•ฉ์„ฑ ์ƒ˜ํ”Œ 1,000๊ฑด, LLM์œผ๋กœ ํ•ฉ์„ฑํ•œ ์ƒ˜ํ”Œ 1,000๊ฑด๊ณผ ํ•จ๊ป˜ ๋ฐ์ดํ„ฐ์˜ ๋ถ„ํฌ๋ฅผ ๋น„๊ตํ•˜์˜€๋‹ค.

๋ณธ ์‹คํ—˜์—์„œ LLM ๊ธฐ๋ฐ˜ ํŠธ๋ž˜ํ”ฝ ํ•ฉ์„ฑ ์‹œ์—๋Š” ๊ณ ๋น„์šฉ ์‹œ์Šคํ…œ์„ ํ•„์š”๋กœ ํ•˜๋Š” ๋Œ€ํ˜• ๋ชจ๋ธ ์—†์ด๋„ ์ผ๋ฐ˜์ ์ธ LLM ๋ชจ๋ธ๋กœ๋„ ํ†ต๊ณ„์ ์œผ๋กœ ์œ ํšจํ•œ ํ•ฉ์„ฑ๋ฐ์ดํ„ฐ๋ฅผ ๊ฒ€์ฆํ•˜๋Š” ๊ฒƒ์„ ๊ฒ€์ฆํ•˜๊ธฐ ์œ„ํ•˜์—ฌ llama-3.2-3b-instruct์„ ์‚ฌ์šฉํ•˜์˜€๋‹ค.

LLM ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ์„ ์ƒ์„ฑํ•˜์—ฌ ํƒ์ง€ ๋ชจ๋ธ์— ์ ์šฉํ•˜๋Š” ๊ณผ์ •์€ โ€œ1๋‹จ๊ณ„:์ˆ˜์ง‘ ๋ฐ์ดํ„ฐ์…‹ ํ”ผ์ฒ˜ ๋ถ„์„โ€, โ€œ2๋‹จ๊ณ„:๊ฐ ํ”ผ์ฒ˜์˜ ํ†ต๊ณ„์  ๋ฒ”์œ„ ์ •์˜โ€, โ€œ3๋‹จ๊ณ„:์‹ ๊ทœ๋กœ ์ƒ์„ฑํ•œ ํ”ผ์ฒ˜์— ๋Œ€ํ•œ ์„ค๋ช… ๋ฐ ์ˆ˜์‹ ํ‘œํ˜„โ€, โ€œ4๋‹จ๊ณ„:์ƒ์„ฑํ•œ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์˜ ๋ถ„ํฌ ๋น„๊ตโ€, โ€œ5๋‹จ๊ณ„:ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋„ท์„ ๋ณด๊ฐ•ํ•˜์—ฌ ํƒ์ง€ ๋ชจ๋ธ์— ์ ์šฉโ€์œผ๋กœ ๊ตฌ๋ถ„ํ•˜์˜€๋‹ค. ํ•ฉ์„ฑ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋ถ€๋ถ„์— ์žˆ์–ด GAN, WGAN๊ณผ ๊ฐ™์€ ๊ธฐ์กด ์ƒ์„ฑํ˜• ๋ชจ๋ธ๊ณผ์˜ ์ฐจ์ด์ ์€ ์›๋ณธ ๋ฐ์ดํ„ฐ์…‹์„ ์ง์ ‘ ์ž…๋ ฅํ•˜์ง€ ์•Š๊ณ  ๋ฐ์ดํ„ฐ์…‹์˜ ํŠน์„ฑ์„ LLM ํ”„๋กฌํ”„ํŠธ์— ์ž…๋ ฅํ•˜๋Š” ๋ถ€๋ถ„์ด๋‹ค.

ํ‘œ 5. ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์„ ์œ„ํ•ด LLM ํ”„๋กฌํ”„ํŠธ์— ์ž…๋ ฅํ•˜๋Š” ํ•ญ๋ชฉ ์„ค๋ช…

Table 5. Feature relationships entered into the LLM prompt

../../Resources/kiee/KIEE.2026.75.6.1427/tb5.png

๋„คํŠธ์›Œํฌ ํŠธ๋ž˜ํ”ฝ ํ•ฉ์„ฑ์„ ์œ„ํ•ด ํ”„๋กฌํ”„ํŠธ์— ์ž…๋ ฅํ•˜๋Š” ์ •๋ณด๋กœ๋Š” ์›๋ณธ ๋ฐ์ดํ„ฐ์˜ ํ†ต๊ณ„์  ํŠน์„ฑ์ธ ํ‰๊ท , ํ‘œ์ค€ํŽธ์ฐจ, ์ตœ์†Œ๊ฐ’, ์ตœ๋Œ€๊ฐ’ ๊ทธ๋ฆฌ๊ณ  ํ”ผ์ฒ˜ ๊ฐ„์˜ ๊ด€๊ณ„๋ฅผ ์ž์—ฐ์–ด ํ”„๋กฌํ”„ํŠธ๋กœ ๋ณ€ํ™˜ํ•˜์—ฌ ์ž…๋ ฅํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ํ•ฉ์„ฑ์„ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. Table 5๋Š” ์•ž์„œ 1โˆผ3๋‹จ๊ณ„์—์„œ ํ™•์ธํ•œ ๊ฐ ํ”ผ์ฒ˜ ๊ฐ„์˜ ๊ด€๊ณ„์— ๋Œ€ํ•œ ์ˆ˜์‹๊ณผ ์„ค๋ช…์œผ๋กœ LLM ํ”„๋กฌํ”„ํŠธ์— ์ž…๋ ฅํ•˜๋Š” ์ •๋ณด์ด๋‹ค. ๋ณธ ์‹คํ—˜์€ ๊ฐœ๋… ๊ฒ€์ฆ์„ ์œ„ํ•œ ์‹คํ—˜์œผ๋กœ LLM์— ๋Œ€ํ•œ ๋ณ„๋„ ํŒŒ์ธํŠœ๋‹์„ ์ง„ํ–‰ํ•˜์ง€ ์•Š์•˜์œผ๋ฉฐ, NVIDIA RTX 2000 Ada 16GB, Intel Xeon w5-2445์ด ์„ค์น˜๋œ ์›Œํฌ์Šคํ…Œ์ด์…˜ ํ™˜๊ฒฝ์—์„œ 1,000๊ฐœ์˜ ์ƒ˜ํ”Œ์„ LLM ๊ธฐ๋ฐ˜์œผ๋กœ ์ƒ์„ฑํ•˜๋Š”๋ฐ 706์ดˆ๊ฐ€ ์†Œ์š”๋˜์—ˆ๋‹ค.

4.2 LLM ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๋น„๊ต

Fig. 1์€ ์›๋ณธ ๋ฐ์ดํ„ฐ์…‹, WGAN-GP ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹, LLM ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹ ๊ฐ„ ์œ ์‚ฌ๋„ ๋น„๊ต๋ฅผ ์œ„ํ•œ t-SNE ๋ถ„ํฌ ๋‹ค์ด์–ด๊ทธ๋žจ์ด๋‹ค. LLM์ด ์ƒ์„ฑํ•œ ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ์€ ๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ ์‚ฌ์šฉํ•œ WGAN-GP ๋ชจ๋ธ๋กœ ์ƒ์„ฑํ•œ ํŠธ๋ž˜ํ”ฝ์— ๋Œ€๋น„ํ•˜์—ฌ ์›๋ณธ ํŠธ๋ž˜ํ”ฝ๊ณผ ์ข€ ๋” ์œ ์‚ฌํ•œ ํ˜•ํƒœ๋ฅผ ๋‚˜ํƒ€๋‚ด๊ณ  ์žˆ๋‹ค. ์ด๋Š” ๊ฐ ํ”ผ์ฒ˜๋“ค์„ Jensen-Shannon Divergence (JSD)๋ฅผ ์ด์šฉํ•œ Fig. 2.์˜ ๋ถ„ํฌ ๋น„๊ต์—์„œ๋„ LLM์œผ๋กœ ์ƒ์„ฑํ•œ ํŠธ๋ž˜ํ”ฝ์€ JSD ํ‰๊ท  0.2105๋กœ WGAN-GP์˜ 0.6746 ๋ณด๋‹ค 68.8% ๋‚ฎ์•„ WGAN-GP๋กœ ํ•ฉ์„ฑํ•œ ๋ฐ์ดํ„ฐ์™€ ๋Œ€๋น„ํ•˜์—ฌ ์›๋ณธ๊ณผ ์œ ์‚ฌํ•œ ๋ถ„ํฌ๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค.

๊ทธ๋ฆผ 1. ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ ๋ถ„ํฌ ๋น„๊ต (t-SNE)

Fig. 1. Synthetic traffic distribution (t-SNE)

../../Resources/kiee/KIEE.2026.75.6.1427/fig1.png

๊ทธ๋ฆผ 2. ํ•ญ๋ชฉ ๋ณ„ ์  ์Šจ-์„€๋„Œ ๋ฐœ์‚ฐ ๋น„๊ต

Fig. 2. Jensen-Shannon Divergence per Features

../../Resources/kiee/KIEE.2026.75.6.1427/fig2.png

LLM์ด ์ƒ์„ฑํ•œ ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ์ด ์›๋ณธ๊ณผ ๋ถ„ํฌ๊ฐ€ ์œ ์‚ฌํ•œ ์ด์œ ๋Š” ์›๋ณธ ๋ฐ์ดํ„ฐ ๊ฐ ํ”ผ์ฒ˜์˜ ํ‰๊ท , ๋ถ„์‚ฐ, ์ตœ๋Œ€๊ฐ’, ์ตœ์†Œ๊ฐ’์„ ํ”„๋กฌํ”„ํŠธ์— ์ž…๋ ฅํ•œ ๋ถ€๋ถ„์ด ๋ฐ˜์˜๋œ ๊ฒƒ์œผ๋กœ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ๊ฒฐ๊ณผ๋กœ ๋ฏธ๋ฃจ์–ด ๋ณด์•„, ํ–ฅํ›„ LLM์„ ์ด์šฉํ•œ ํŠธ๋ž˜ํ”ฝ ํ•ฉ์„ฑ ์—ฐ๊ตฌ๊ฐ€ ๋†’์€ ์‹คํšจ์„ฑ์„ ๋ณด์—ฌ์ค„ ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒํ•  ์ˆ˜ ์žˆ๋‹ค.

4.3 LLM ํ•ฉ์„ฑ ํŠธ๋ž˜ํ”ฝ ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๋น„๊ต

์›๊ฒฉ์ ‘์† ํ†ต์‹ ์„ ๋ถ„๋ฅ˜ํ•˜๋Š” ํƒ์ง€ ๋ชจ๋ธ์€ Random Forest๋ฅผ ์ด์šฉํ•˜์˜€์œผ๋ฉฐ, 4.1์žฅ์—์„œ ์ƒ์„ฑํ•œ 1,000๊ฐœ์˜ ์ƒ˜ํ”Œ์„ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ์…‹์— ๋ฐ˜์˜ํ–ˆ์„ ๋•Œ ํƒ์ง€ ์„ฑ๋Šฅ์„ ํ™•์ธํ•˜์˜€๋‹ค. 4.2 ๋ฐ์ดํ„ฐ ๋ถ„ํฌ์—์„œ ์˜ˆ์ƒํ•  ์ˆ˜ ์žˆ๋“ฏ์ด WGAN-GP๋ณด๋‹ค ๋„“์€ ๋ถ„ํฌ๋ฅผ ๊ฐ€์ง„ LLM ๊ธฐ๋ฐ˜ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ์…‹์ด ํƒ์ง€ ๋ชจ๋ธ์˜ Recall์„ ํฐ ํญ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. Table 6์—์„œ LLM ๋ชจ๋ธ๋กœ ์ƒ์„ฑํ•œ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ 1,000๊ฐœ๋ฅผ ์ถ”๊ฐ€ํ•œ ์ƒํ™ฉ์—์„œ Recall 42.3%, F1โ€“Score๊ฐ€ 21.4% ๊ฐœ์„ ๋˜๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ์ฐธ๊ณ ๋กœ ๋ณธ ์‹คํ—˜์— ์‚ฌ์šฉํ•œ ๋ฐ์ดํ„ฐ์…‹๊ณผ ํƒ์ง€ ๋ชจ๋ธ์€ Github ๋ ˆํŒŒ์ง€ํ† ๋ฆฌ ํ†ตํ•ด ๊ณต๊ฐœํ•˜๊ณ  ์žˆ๋‹ค[35].

ํ‘œ 6. ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ ์ ์šฉ์— ๋”ฐ๋ฅธ ํƒ์ง€ ์„ฑ๋Šฅ ๊ฐœ์„  ๋น„๊ต

Table 6. Comparison of Detection Performance Improvement by Synthetic Data

Synthetic Data Generation Model Precision Recall F1-Score
Original Dataset 100% 65.55% 79.18%
WGAN-GP 100% 71.26% 83.22%
LLM 100% 93.25% 96.14%

5. ๊ฒฐ ๋ก 

5.1 ๊ธฐ๋Œ€ํšจ๊ณผ

๋ณธ ๋…ผ๋ฌธ์€ SSHยทRDP ๋“ฑ๊ณผ ๊ฐ™์€ ์•”ํ˜ธํ™”๋œ ์›๊ฒฉ ์ ‘์† ์ ‘์†์„ ํšจ๊ณผ์ ์œผ๋กœ ํƒ์ง€ํ•˜๊ธฐ ์œ„ํ•ด AI ๊ธฐ๋ฐ˜ ํƒ์ง€ ํ”Œ๋žซํผ์˜ ์„ค๊ณ„ ๊ฐ€์ด๋“œ๋ผ์ธ์„ ์ฒด๊ณ„์ ์œผ๋กœ ์ œ์‹œํ•˜๊ณ  ์žˆ๋‹ค. ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘๋ถ€ํ„ฐ ์ „์ฒ˜๋ฆฌ, ํƒ์ง€๋ชจ๋ธ ์„ค๊ณ„์— ์ด๋ฅด๊ธฐ ๊นŒ์ง€ 3๋‹จ๊ณ„ ํ”„๋กœ์„ธ์Šค๋ณ„ ํ•ต์‹ฌ ๋ฌธ์ œ์ ์„ ๊ทœ๋ช…ํ•˜๊ณ , 1์„ธ๋Œ€ DPI์—์„œ 4์„ธ๋Œ€ LLM ๊ธฐ๋ฐ˜ ์˜ˆ์ธก์œผ๋กœ์˜ ๊ธฐ์ˆ  ์ง„ํ™” ๊ณผ์ •์„ ๊ธฐ์กด ์‹ค์ฆ ์—ฐ๊ตฌ์™€ ์—ฐ๊ณ„ํ•˜์—ฌ ์„ค๋ช…ํ•˜์˜€๋‹ค.

4์„ธ๋Œ€ ๊ธฐ์ˆ ์ด ์„ฑ์ˆ™ ๋‹จ๊ณ„์— ๋„๋‹ฌํ•  ๊ฒฝ์šฐ, ๋ณด์•ˆ ๊ด€์ œ ์ž๋™ํ™” ์ˆ˜์ค€์ด ํš๊ธฐ์ ์œผ๋กœ ํ–ฅ์ƒ๋  ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€๋œ๋‹ค. ์ž์—ฐ์–ด ๊ธฐ๋ฐ˜ ํƒ์ง€ ๊ทœ์น™ ์ž๋™ ์ƒ์„ฑ์œผ๋กœ ๋ณด์•ˆ ์ „๋ฌธ์ธ๋ ฅ ์˜์กด๋„๋ฅผ ๋‚ฎ์ถ”๊ณ , ์ œ๋กœ๋ฐ์ด ๊ณต๊ฒฉ์„ ๋ณ„๋„ ์žฌํ•™์Šต ์—†์ด ์ œ๋กœ์ƒท ํƒ์ง€ ๋Šฅ๋ ฅ์œผ๋กœ ํƒ์ง€ํ•˜๊ฒŒ ๋˜์–ด APT ๋“ฑ ๊ณ ๋„ํ™”๋œ ์œ„ํ˜‘์— ์„ ์ œ ๋Œ€์‘์ด ๊ฐ€๋Šฅํ•  ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒํ•œ๋‹ค. ์ถ”๊ฐ€์ ์œผ๋กœ LLM ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ํ™œ์šฉํ•œ ๊ณต๊ฒฉ ์˜๋„ ๋ถ„์„์œผ๋กœ ๋Œ€์‘ ์šฐ์„ ์ˆœ์œ„ ์ž๋™ํ™”๋ฅผ ๊ตฌํ˜„ํ•  ์ˆ˜ ์žˆ์„ ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒํ•œ๋‹ค. ์ด๋Ÿฌํ•œ ํƒ์ง€ ์„ฑ๋Šฅ์˜ ๊ฐœ์„ ์€ ๋ณด์•ˆ ๊ด€์ œ์˜ ํŒจ๋Ÿฌ๋‹ค์ž„์„ ๊ธฐ์กด์˜ ๋ฐ˜์‘์ (Reactive) ๋ณด์•ˆ ๊ด€์ œ์—์„œ ์˜ˆ์ธก์ (Predictive) ๋ณด์•ˆ ๊ด€์ œ๋กœ ๋ณ€ํ™”์‹œ์ผœ ๊ฐˆ ๊ฒƒ์ด๋‹ค.

5.2 ํ–ฅํ›„ ์—ฐ๊ตฌ๊ณ„ํš

๊ธฐ์กด ์—ฐ๊ตฌ์—์„œ๋Š” 2์„ธ๋Œ€ ์—ฐ๊ตฌ๋Š” ํฌํŠธ์™€ ์ƒ๊ด€์—†์ด ๋‹ค์ค‘ ๋ถ„๋ฅ˜๋กœ ์šฐํšŒ ๊ณต๊ฒฉ ํƒ์ง€์˜ ์‹คํšจ์„ฑ์„ ์ž…์ฆํ•˜์˜€๊ณ [5], GAN ๊ธฐ๋ฐ˜ ๋ฐ์ดํ„ฐ ์ฆ๊ฐ•์œผ๋กœ ๊ทน๋‹จ์  ๋ถˆ๊ท ํ˜• ํ™˜๊ฒฝ์—์„œ์˜ ํƒ์ง€ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค[12]. ํ–ฅํ›„์—๋Š” LLM ๊ธฐ๋ฐ˜ 4์„ธ๋Œ€ ๊ธฐ์ˆ ์ด ์ด์ƒ ํŠธ๋ž˜ํ”ฝ ํƒ์ง€์˜ ํ•ต์‹ฌ์ ์œผ๋กœ ์ถ•์œผ๋กœ ๋ถ€์ƒํ•  ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ์ด๋•Œ ์ผ๋ฐ˜ ์ž์—ฐ์–ด์™€ ์ƒ์ดํ•œ ์ˆ˜์น˜ ์‹œํ€€์Šค ์ค‘์‹ฌ์˜ ํŠธ๋ž˜ํ”ฝ ๋ฐ์ดํ„ฐ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” ๋„๋ฉ”์ธ ํŠนํ™” ํ† ํฌ๋‚˜์ด์ € ๋ฐ ์ตœ์ ์˜ ๋ฐ์ดํ„ฐ ํ•ฉ์„ฑ์„ ์œ„ํ•œ Structured to Text(S2T)์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๋ฅผ ์ถ”๊ฐ€๋กœ ์ง„ํ–‰ํ•  ์˜ˆ์ •์ด๋‹ค.

References

1 
Mandiant, cloud.google.com/blog/ko/topics/threat-intelligence/m-trends-2025, "M-Trends 2025," Google Cloud Blog, 2025. Google Search
2 
Cisco Talos, 11 Nov. 2025. blog.talosintelligence.com/salt-typhoon-analysis/, "Seeing Inside the Vortex: Detecting Living off the Land Techniques," Cisco Talos Blog, 2025. Google Search
3 
Symantec, broadcom.com/support/security-center/protection-bulletin/symbiote-and-bpfdoor-linux-malware-variants-implement-new-ebpf-filters, 2025., "Symbiote and BPFdoor Linux Malware Variants Implement New eBPF Filters," Protection Bulletin, Broadcom, 2025. Google Search
4 
Yisroel Mirsky, "Kitsune: An Ensemble of Autoencoders for Online Network Intrusion Detection," 2018. Google Search
5 
Junwon Lee, Heejo Lee, "An SSH Predictive Model Using Machine Learning with Web Proxy Session Logs," International Journal of Information Security, vol. 21, no. 2, pp. 311-322, 2021. DOI
6 
Muhammad Shafiq, "An Efficient Method to Detect Periodic Behavior in Botnet Traffic by Analyzing Control Plane Traffic," Journal of Advanced Research, vol. 5, no. 4, 2014. DOI
7 
Wei Wang, "End-to-End Encrypted Traffic Classification with One-Dimensional Convolutional Neural Networks," pp. 43-48, 2017. Google Search
8 
Tal Shapira, Yuval Shavitt, "FlowPic: A Generic Representation for Encrypted Traffic Classification and Applications Identification," IEEE Transactions on Network and Service Management, 2021. DOI
9 
Gerard Draper-Gil, "Characterization of encrypted and vpn traffic using time-related," 2016. Google Search
10 
Aristide Tanyi-Jong Akem, Guillaume Fraysse, Marco Fiore, e2320, "Real Time Encrypted Traffic Classification in Programmable Networks with P4 and Machine Learning," International Journal of Network Management, vol. 35, no. 1, 2025. DOI
11 
Adrian Pekar, Richard Plny, Karel Hynek, arXiv:2601.04089, "Tutorial on Flow-Based Network Traffic Classification Using Machine Learning," arXiv preprint, 2026. Google Search
12 
Junwon Lee, Heejo Lee, "Improving SSH Detection Model Using IPA Time and WGAN-GP," Computers & Security, vol. 116, pp. 102672, 2022. DOI
13 
Scott M. Lundberg, Su-In Lee, "A Unified Approach to Interpreting Model Predictions," vol. 30, 2017. Google Search
14 
X. Zhang, "Network Traffic Grant Classification Based on 1DCNN-TCN-GRU Hybrid Model," Applied Intelligence, 2024. DOI
15 
Wai Weng Lo, "XG-BoT: An Explainable Deep Graph Neural Network for Botnet Detection and Forensics," Internet of Things, 2022. DOI
16 
Tanzeela Altaf, "GNN-Based Network Traffic Analysis for the Detection of Sequential Attacks in IoT," Electronics, MDPI, vol. 13, no. 12, pp. 2274, 2024. DOI
17 
Chuampu Fu, Qi Li, Ke Xu, "Detecting Unknown Encrypted Malicious Traffic in Real Time via Flow Interaction Graph Analysis," ISOC, San Diego, CA, 2023. Google Search
18 
Sadaf Sattar, "Anomaly detection in encrypted network traffic using self-supervised learning," Scientific Reports, vol. 15, no. 1, pp. 2658, 2025. DOI
19 
Przemyslaw Berezinski, "An Entropy-Based Network Anomaly Detection Method," Entropy, vol. 17, no. 4, 2015. DOI
20 
Nikolaos Peppes, "Evaluating Synthetic Malicious Network Traffic Generated by GAN and VAE Models: A Data Quality Perspective," Future Internet, vol. 17, no. 12, pp. 561, 2025. DOI
21 
Tongcan Lin, J. Wang, "RuleLLM: LLM-Driven Rule Generation for Anomaly Network Traffic Identification," The Computer Journal, 2026. DOI
22 
Furqan Rustam, "Few-Shot Retrieval-Augmented LLMs for Anomaly Detection in Network Traffic," Springer Nature Singapore, Singapore, 2025. Google Search
23 
Il Hwan Ji, "Artificial Intelligence-Based Anomaly Detection Technology over Encrypted Traffic: A Systematic Literature Review," Sensors, vol. 24, no. 3, pp. 898, 2024. DOI
24 
Dong Gong, "Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder (MemAE)," 2019. Google Search
25 
A. Pinto, "Enhancing Critical Infrastructure Security: Unsupervised Learning Approaches for Anomaly Detection," International Journal of Computational Intelligence Systems, vol. 17, pp. 236, 2024. DOI
26 
K. Yang, arXiv:2104.11146, "An Efficient One-Class SVM for Anomaly Detection in the Internet of Things," arXiv, 2021. Google Search
27 
P. Bountzis, "A Deep One-Class Classifier for Network Anomaly Detection Using Autoencoders and One-Class Support Vector Machines," Frontiers in Computer Science, 2025. DOI
28 
K. E. Kampourakis, arXiv:2601.05022, "Knowledge-to-Data: LLM-Driven Synthesis of Structured Network Traffic for Testbed-Free IDS Evaluation," arXiv, 2025. Google Search
29 
Xinjie Lin, "ET-BERT: A Contextualized Datagram Representation with Pre-training Transformers for Encrypted Traffic Classification," 2022. Google Search
30 
T. Cui, arXiv:2504.04222, "TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation," arXiv, 2025. Google Search
31 
Y. Ginige, "TrafficLLM: LLMs for Improved Open-Set Encrypted Traffic Analysis," Computer Networks, 2025. Google Search
32 
Yongjun Huang, "MET-LLM: Enhancing Large Language Models for Malicious Encrypted Traffic Detection," Expert Systems with Applications, vol. 303, pp. 130621, 2025. Google Search
33 
W. Lian, "RuleMaster+: LLM-Based Automated Rule Generation Framework for Intrusion Detection Systems," Chinese Journal of Electronics, vol. 34, no. 5, pp. 1402-1415, 2025. DOI
34 
Piyumi Bhagya Sudasinghe, "Lightweight LLMs for Network Attack Detection in IoT Networks," 2025. Google Search
35 
J. Lee, [Online]. Available: https://github.com/junimirang/Synthetic-Network-Traffic-using-LLM, "Synthetic-Network-Traffic-using-LLM," GitHub, 2026. Google Search

์ €์ž์†Œ๊ฐœ

์ด์ค€์› (Junwon Lee)
../../Resources/kiee/KIEE.2026.75.6.1427/au1.png

He is an Assistant Professor in the Department of Computer Science, Engineering, and Converged Technology at Duksung Womenโ€™s University, Seoul, Korea. Prior to joining academia, he spent 23 years as a Security Engineer at Samsung SDS, gaining extensive industry expertise. His current research interests include AI-based anomaly detection and cloud security.