Language Models are Unsupervised Multitask Learners

Language Models are Unsupervised Multitask Learners

  • Link : https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf

๐Ÿ’ก ๊ธฐ์กด ๋ฌธ์ œ๋Š” ๋ญ์•ผ?

๊ธฐ์กด ์‹œ์Šคํ…œ์€ ์œ ๋Šฅํ•œ ์ œ๋„ˆ๋Ÿด๋ฆฌ์ŠคํŠธ๋ณด๋‹ค ํŠน์ • ํ…Œ์Šคํฌ์— ํŠนํ™”๋œ ์ „๋ฌธ๊ฐ€๋ฅผ ๋งŒ๋“œ๋Š” ๊ฒƒ์œผ๋กœ ๋น„์œ ํ•  ์ˆ˜ ์žˆ๋‹ค.

๐Ÿ’ก ํ•ต์‹ฌ ์•„์ด๋””์–ด๊ฐ€ ๋ญ์•ผ?

Language Models are Unsupervised Multitask Learners(2019)์—์„œ๋Š” OpenAI์˜ GPT-2 ๋ชจ๋ธ์„ ์†Œ๊ฐœํ•ฉ๋‹ˆ๋‹ค. ํ•ต์‹ฌ ์•„์ด๋””์–ด๋Š” ํฐ ๋ชจ๋ธ์„, ํฌ๊ณ  ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ์…‹์œผ๋กœ ํ•™์Šตํ•˜๋ฉด ๋งŽ์€ ๋„๋ฉ”์ธ์—์„œ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋‚ผ ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. GPT-2๋Š” GPT-1๊ณผ ๊ตฌ์กฐ๊ฐ€ ๊ฑฐ์˜ ๋™์ผํ•˜๋ฉฐ ๋”์šฑ ๋งŽ์€ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ๊ณ , Transformer ๋ ˆ์ด์–ด๊ฐ€ ๋” ๋งŽ์ด ์Œ“์ธ ํ˜•ํƒœ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

Pretraining์—์„œ๋Š” GPT-1๊ณผ ๋™์ผํ•œ ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ–ˆ์ง€๋งŒ, ํ›จ์”ฌ ๋งŽ์€ ๋ฐ์ดํ„ฐ์™€ ํ€„๋ฆฌํ‹ฐ ๋†’์€ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ํ•„ํ„ฐ๋งํ•˜์—ฌ ์–‘์งˆ์˜ ๋ฐ์ดํ„ฐ์…‹์„ ๋งŒ๋“ค์—ˆ์Šต๋‹ˆ๋‹ค.

Multitask Learning as Question Answering. GPT-2์—์„œ๋Š” ์ถ”๊ฐ€์˜ Fine-tuning์—†์ด zero-shot ์„ธํŒ…์œผ๋กœ ์—ฌ๋Ÿฌ NLP task๋“ค์„ ์ˆ˜ํ–‰ํ•˜๊ธฐ ์œ„ํ•ด task๋ฅผ QA์˜ ํ˜•ํƒœ๋กœ ๋ณ€ํ™˜ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, ์š”์•ฝ๋ฌธ์ œ๋ฅผ โ€œ(๊ธ€), ์ด ๊ธ€์„ ์š”์•ฝํ•˜๋ฉด?โ€, ๋ฒˆ์—ญ๋ฌธ์ œ๋ฅผ (๋ฌธ์žฅ), ์ด ๋ฌธ์žฅ์„ ์˜์–ด๋กœ ๋ฒˆ์—ญํ•˜๋ฉด?โ€๊ณผ ๊ฐ™์€ ๋ฐฉ์‹์œผ๋กœ ๋ง์ด์ฃ .

  • Summarization : TL;DR text๋ฅผ article์˜ ๋งˆ์ง€๋ง‰์— ์ฃผ๊ณ , TL;DR์ด ๋‚˜์˜ค๋ฉด ์•ž์— ๋‚˜์˜จ ๊ธ€์„ ์š”์•ฝํ•˜๋Š” ํ…Œ์Šคํฌ ์ˆ˜ํ–‰ โ†’ Fine-tuning์—†์ด zero-shot setting์—์„œ ์š”์•ฝ์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๊ฒŒํ•จ.
  • Transaltion : ๋ฌธ์žฅ ๋์— โ€œthey say in French:โ€์™€ ๊ฐ™์€ ์–ด๊ตฌ(context of example pairs of the format : English sentence = French sentence)๋ฅผ ๋ถ™์—ฌ ๋ฒˆ์—ญ ํ…Œ์Šคํฌ๋ฅผ ์ˆ˜ํ–‰

์ด๋Ÿฐ ๋ฐฉ์‹์„ ํ†ตํ•ด ์ถ”๊ฐ€ ํ•™์Šต ์—†์ด ์—ฌ๋Ÿฌ task๋ฅผ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค. ๋ฌผ๋ก  ํŠน์ • task๋ฅผ ์ถ”๊ฐ€์ ์œผ๋กœ ํ•™์Šตํ•˜๋ฉด ๋” ์ข‹์€ ์„ฑ๋Šฅ์„ ๊ธฐ๋กํ•˜์ง€๋งŒ, ์ถ”๊ฐ€ ํ•™์Šต์ด ์—†์ด ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์€ ํŠน์ • task๋ฅผ ์œ„ํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•  ํ•„์š”๊ฐ€ ์—†๋‹ค๋Š” ๋œป์ด๊ธฐ์— ๋งŽ์€ ์‹œ๊ฐ„๊ณผ ๋น„์šฉ์„ ์•„๋‚„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Categories:

Updated:

Leave a comment