๊ณ„๋ฐœ๐Ÿ’พ/๋ถ„์„ ๋ฐฉ๋ฒ•๋ก 

Causal Impact Analysis

wolny 2021. 11. 8. 17:45
Causal Impact Analysis(์ธ๊ณผ ํšจ๊ณผ ๋ถ„์„)

 

{CausalImpact}๋Š” ๊ตฌ๊ธ€ ๋ณธ์‚ฌ์—์„œ ๋งŒ๋“  ์‹œ๊ณ„์—ด ์˜ˆ์ธก ๋ชจํ˜• ๊ด€๋ จ R ํŒจํ‚ค์ง€๋กœ Baysian structural time-series model์„ ๊ธฐ๋ฐ˜์œผ๋กœํ•˜๋ฉฐ, ํŠน์ • ์‚ฌ๊ฑด์ด ๋ฏธ์นœ ์˜ํ–ฅ์— ๋Œ€ํ•ด ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์žฅ์ ์ด ์žˆ๋‹ค. ๋งˆ์ผ€ํŒ… ๋ถ„์„ ๊ด€๋ จํ•ด์„œ ๊ณต๋ถ€ํ•˜๋‹ค๊ฐ€ ์•Œ๊ฒŒ๋œ ๋ฐฉ๋ฒ•๋ก ์ธ๋ฐ, ๋งˆ์ผ€ํŒ… ์ชฝ์—์„œ๋Š” ์ฃผ๋กœ ๊ด‘๊ณ ๋กœ ์ธํ•œ ๊ณ ์œ ํ•œ ํšจ๊ณผ๋ฅผ ์ธก์ •ํ•˜๊ฑฐ๋‚˜ ํ˜น์€ ๊ทธ ํšจ๊ณผ๋ฅผ ๋ฐฐ์ œํ•˜๊ณ  ์ธก์ •ํ•˜๊ณ ์ž ํ•  ๋•Œ ์ฃผ๋กœ ํ™œ์šฉ๋œ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด์„œ ๋‚ด ๋ธ”๋กœ๊ทธ ํฌ์ŠคํŒ…1์˜ ๋ฐฉ๋ฌธ์ž์ˆ˜๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ, ํฌ์ŠคํŒ…2์— ๊ด‘๊ณ ๋ฅผ ํ†ตํ•ด SNS๋‚˜ ์›น์‚ฌ์ดํŠธ์— ํ™๋ณดํ–ˆ์„ ๋•Œ์˜ ๋ฐฉ๋ฌธ์ž์ˆ˜๋ฅผ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ด๋ก ์ด๋‹ค. (๋ธ”๋กœ๊ทธ์—์„œ ์ฐพ์•„๋ดค์„ ๋•Œ์—๋Š” ์ฃผ๋กœ GA(Google Adsence)๋ฐ์ดํ„ฐ๋ฅผ ๋ถˆ๋Ÿฌ์™€์„œ ์ž์‹ ์˜ ๋ธ”๋กœ๊ทธ ํŠธ๋ž˜ํ”ฝ์„ ํ™œ์šฉํ•ด ์ง„ํ–‰ํ•˜๋Š” ๋“ฏ ํ–ˆ๋‹ค.)

 

โ–ถ R์—์„œ ์‹คํ–‰ํ•˜๋ฉด 3๊ฐœ์˜ plot์ด ๋‚˜ํƒ€๋‚œ๋‹ค.

1) original : (์„ ) ๊ด‘๊ณ  ํšจ๊ณผ๊ฐ€ ๋ฐ˜์˜๋œ ๊ฒฐ๊ณผ์น˜, (์ ์„ ) ๊ด‘๊ณ  ํšจ๊ณผ๋ฅผ ๋ฐฐ์ œํ–ˆ์„ ๋•Œ ์˜ˆ์ƒ๋˜๋Š” ์ถ”์ •์น˜

2) pointwise : ๊ด‘๊ณ ๋กœ ์ธํ•œ ํšจ๊ณผ(original์—์„œ ๋‚˜ํƒ€๋‚˜๋Š” ๊ฒฐ๊ณผ์น˜-์ถ”์ •์น˜)

3) cumulative : ๊ด‘๊ณ  ํšจ๊ณผ์˜ ๋ˆ„์  ์ˆ˜์น˜

 

โ–ถ ๋ชจํ˜•์‹ (Bayesian structural time-serires models)

๋ชจํ˜•์‹

Zt๋Š” d-dimensional output, Tt๋Š” dํ–‰ d์—ด์˜ transition matrix, Rt๋Š” dํ–‰ q์—ด์˜ control matrix

εt๋Š” (0,σt^2) ๋ฅผ ๋”ฐ๋ฅด๋Š” ์ •๊ทœ๋ถ„ํฌ, ηt๋Š” (0,Qt)๋ฅผ ๋”ฐ๋ฅด๋Š” ์ •๊ทœ๋ถ„ํฌ๋กœ ๊ฐ’์ด ํ˜•์„ฑ๋œ๋‹ค.

์ด์ „ ์‹œ์ฐจ์˜ ๊ฐ’์„ ํ†ตํ•ด ๋‹ค์Œ์‹œ์ฐจ๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ํ˜•์‹์œผ๋กœ ์ด๋ฃจ์–ด์ ธ ์žˆ๋‹ค.

 

๊ณ„์ ˆํšจ๊ณผ๋ฅผ 7days๋กœ ๋„ฃ์„ ๊ฒƒ์ธ์ง€(์ฃผ๋ณ„), 52weeks๋กœ ๋„ฃ์„ ๊ฒƒ์ธ์ง€(์—ฐ๋ณ„)๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” ์‹๋„ ์žˆ์œผ๋‹ˆ ๋‚˜์ค‘์— ๋” ํ™•์ธํ•ด๋ด์•ผ๊ฒ ๋‹ค.

 

ํ™œ์šฉ ๋ฐฉ์•ˆ ๋ฐ ์ถ”๊ฐ€์ ์ธ ์˜๊ฒฌ

์ตœ๊ทผ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•œ ๋งˆ์ผ€ํŒ… ๋ฐฉ์•ˆ์ด ์Ÿ์•„์ ธ ๋‚˜์˜ค๊ณ  ์žˆ๋‹ค. ๋‹จ์ˆœํžˆ EDA์™€ ์œ ์ € ์„ธ๊ทธ๋จผํŠธ๋ฅผ ํ†ตํ•œ ๊ฒ‰ํ•ฅ๊ธฐ ์‹์ด ์•„๋‹ˆ๋ผ, ์˜ˆ์ธก ๋ชจํ˜•๊นŒ์ง€ ํ™œ๋ฐœํ•˜๊ฒŒ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋Š” ๋“ฏ ํ•˜๋‹ค.

์˜ˆ์‹œ๋กœ BC์นด๋“œ์—์„œ 'BC IDEA'๋ผ๋Š” ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๊ธฐ์—… ๋งž์ถคํ˜• ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ์„œ๋น„์Šค๋ฅผ ์ถœ์‹œํ•˜๋ฉด์„œ, ์ƒ๊ถŒ ๋ถ„์„์„ ํ†ตํ•ด ๊ธฐ์—…์˜ ๋‹ˆ์ฆˆ์— ๋งž๋Š” ์กฐ๊ฑด์„ ์ถ”์ฒœํ•˜๋ฉฐ, ๋‹จ์ˆœํ•œ ์ •๋ณด ์ œ๊ณต์„ ๋„˜์–ด์„œ ์ œ์•ˆ๊นŒ์ง€๋„ ์ด๋ฃจ์–ด์ง€๊ณ  ์žˆ๋Š” ๊ฒƒ์ด๋‹ค. ์•„๋ž˜ ๊ธฐ์‚ฌ์—์„œ ๋ณด๋ฉด ์ž์ฒด ๋ชจ๋ธ์„ ํ†ตํ•ด ํ•ด๋‹น ์ง€์—ญ ๋‚ด 3km ๋‚ด ์œ ์‚ฌ ์ ํฌ, ์œ ๋™ ์ธ๊ตฌ ๋ถ„์„(์†Œ๋“, ๊ฐ€๊ตฌ ํ˜„ํ™ฉ ๋“ฑ)์„ ํ†ตํ•ด ์ ํฌ๋ณ„ ์•ˆ์ •์„ฑ๊ณผ ์„ฑ์žฅ์„ฑ์„ ์˜ˆ์ธกํ•œ๋‹ค๊ณ  ๋˜์–ด์žˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด A์ง€์—ญ์— CU ํŽธ์˜์ ์„ ๋‚ด๊ณ ์ž ํ•  ๊ฒฝ์šฐ, ๊ทผ๋ฐฉ 3km ๋‚ด์— ํŽธ์˜์ ์ด๋‚˜ ์Šˆํผ, ๋งˆ์ผ“ ๋“ฑ ์‹๋ฃŒํ’ˆ์ ์ด ๋ช‡ ๊ฐœ ์ •๋„ ๋ถ„ํฌํ•˜๋Š”์ง€, ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์ด ์ง€๋‚˜๊ฐ€๋Š” ๊ธธ๋ชฉ์ธ์ง€(ํฐ๊ธธ์ธ์ง€ ๊ณจ๋ชฉ์ธ์ง€๋„ ํŒ๋‹จ์ด ๋ ๊นŒ?), ๊ทผ๋ฐฉ์— ์‚ฌ๋Š” ์ฃผ๋ฏผ๋“ค์˜ ์ˆ˜์™€ ์†Œ๋“๋ถ„์œ„๋Š” ์–ด๋–ป๊ฒŒ ๋ถ„ํฌํ•˜๋Š”์ง€ ๋ฅผ ์˜๋ฏธํ•˜๋Š” ๊ฒƒ ๊ฐ™๋‹ค. ์—ฌ๊ธฐ์„œ ๋” ๋งž์ถคํ˜•๋ถ„์„์„ ํ•ด๋ณด์ž๋ฉด ํŽธ์˜์ ์— ๋งž๋Š” ๋ณ€์ˆ˜๋ฅผ ๋„ฃ์–ด ์ค„ ์ˆ˜ ์žˆ๋‹ค. ์ฃผ๋ณ€์— ํ•™๊ต๋‚˜ ํ•™์›์ด ์žˆ๋Š”์ง€, ์•„๋‹ˆ๋ฉด ํšŒ์‚ฌ๊ฐ€ ์žˆ๋Š”์ง€ ๋ณด๊ณ , ํŽธ์˜์ ์˜ ๋ฌผ๊ฑด์„ ๋‹ค๋ฅด๊ฒŒ ์ถ”์ฒœํ•ด์ค„ ์ˆ˜๋„ ์žˆ์„ ๊ฒƒ์ด๊ณ , ํ•™๊ต๋‚˜ ํ•™์›์ด ์žˆ์—ˆ๋‹ค๋ฉด ํ•™์ƒ๋“ค์˜ ๋“ฑํ•˜๊ต ์‹œ๊ฐ„์— ๋งž์ถฐ ๋ฌผ๋Ÿ‰์ด ์ฑ„์›Œ์งˆ ์ˆ˜ ์žˆ๋„๋ก ์กฐ์–ธ๋„ ๊ฐ€๋Šฅํ•  ๊ฒƒ์ด๋‹ค.

 

https://biz.newdaily.co.kr/site/data/html/2021/08/03/2021080300016.html

 

๋น„์”จ์นด๋“œ, ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ์„œ๋น„์Šค 'IDEA' ์ถœ์‹œ…๋งž์ถคํ˜• ๋น„์ฆˆ๋‹ˆ์Šค ์ง€์›

๋น„์”จ์นด๋“œ(BC์นด๋“œ)๋Š” ๊ตญ๋‚ด์™ธ ๊ธฐ์—…, ๊ณต๊ณต๊ธฐ๊ด€, ๋Œ€ํ•™ ๋“ฑ๊ณผ ์ถ•์ ํ•œ ๋ฐ์ดํ„ฐ ์—ญ๋Ÿ‰ ๊ธฐ๋ฐ˜์œผ๋กœ ๊ธฐ์—… ๋งž์ถคํ˜• ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ์„œ๋น„์Šค ‘BC IDEA’๋ฅผ ์ถœ์‹œํ–ˆ๋‹ค๊ณ  3์ผ ๋ฐํ˜”๋‹ค. BC IDEA(Intelligence Data for Enterprise Advanc

biz.newdaily.co.kr

 

๋˜ํ•œ, ๋ธ”๋กœ๊ทธ์˜ ๋ฐฉ๋ฌธ์ž ์ˆ˜๊ฐ€ ์˜ค์ง ๊ด‘๊ณ  ํšจ๊ณผ์—๋งŒ ์˜ํ–ฅ์„ ๋ฐ›์•˜๋‹ค๋ฉด ๋ชจํ˜•์‹์„ ํ†ตํ•ด ์˜ˆ์ธก์ด ๊ฐ€๋Šฅํ• ํ…Œ์ง€๋งŒ, ๊ด‘๊ณ  ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์‚ฌํšŒ์ ์ธ ์ด์Šˆ์™€ ๋งž๋ฌผ๋ ธ๋‹ค๊ฑฐ๋‚˜, SNS์—์„œ ํ•ซํ•œ ์ฃผ์ œ๋ผ๊ฑฐ๋‚˜ ํ•  ๊ฒฝ์šฐ ์—ฌ๋Ÿฌ ํšจ๊ณผ์— ๋Œ€ํ•ด ์ธก์ •ํ•˜๋Š” ๋ฐฉ๋ฒ•๋ก ๋„ ์ฐพ์•„๋ณด๋ฉด ์ข‹์„ ๊ฒƒ ๊ฐ™๋‹ค. 

 

์ฐธ๊ณ  ๋ฌธํ—Œ

https://research.google/pubs/pub41854/

โ–ถ ๋…ผ๋ฌธ๋ช… : INFERRING CAUSAL IMPACT USING BAYESIAN STRUCTURAL TIME-SERIES MODELS

 

Inferring causal impact using Bayesian structural time-series models – Google Research

An important problem in econometrics and marketing is to infer the causal impact that a designed market intervention has exerted on an outcome metric over time. In order to allocate a given budget optimally, for example, an advertiser must assess to what e

research.google

 

โ–ถ {CausalImpact} package in R

http://google.github.io/CausalImpact/

'๊ณ„๋ฐœ๐Ÿ’พ > ๋ถ„์„ ๋ฐฉ๋ฒ•๋ก ' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

A/B TEST  (3) 2021.11.09