๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ
Analysis Project

2020 Kaggle Survey (kaggle survey_Now and After)

by rubyda 2021. 1. 14.
728x90

๐Ÿ“ ์บ๊ธ€์—์„œ๋Š” 2017๋…„๋ถ€ํ„ฐ ์„ค๋ฌธ์กฐ์‚ฌ๋ฅผ ์‹œ์ž‘ํ•˜์˜€์Šต๋‹ˆ๋‹ค. 2020๋…„์—๋„ ์„ค๋ฌธ์กฐ์‚ฌ๋ฅผ ์‹ค์‹œํ–ˆ์œผ๋ฉฐ ์ด ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„ํ•˜์—ฌ ์ธ์‚ฌ์ดํŠธ๋ฅผ ์ฐพ์•„๋ณด๊ณ ์ž ๋ถ„์„์„ ์ง„ํ–‰ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

 

์งˆ๋ฌธ์— ๋”ฐ๋ผ ์นดํ…Œ๊ณ ๋ฆฌ๋ฅผ ๋ถ„๋ฅ˜ํ•˜์—ฌ ๋ถ„์„ ๊ฒฐ๊ณผ๋ฅผ ๊ณต์œ ํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค. 

์บ๊ธ€ ์„ค๋ฌธ์ง€์—์„œ๋Š” ์บ๊ธ€๋Ÿฌ๋“ค์—๊ฒŒ ๋˜‘๊ฐ™์€ ์งˆ๋ฌธ ๋‚ด์šฉ์„ ๊ฐ€์ง€๊ณ  ์ง€๊ธˆ ํ˜„์žฌ์™€ ํ–ฅํ›„ 2๋…„์— ๋Œ€ํ•œ ์ƒ๊ฐ์„ ๋ฌป๋Š” ์งˆ๋ฌธ๋“ค์ด ์žˆ์Šต๋‹ˆ๋‹ค.

์ด ๋‘๊ฐ€์ง€์˜ ์งˆ๋ฌธ์„ ๋น„๊ตํ•˜๋ฉด์„œ ์ƒ๊ฐ์˜ ๋ณ€ํ™”๊ฐ€ ์žˆ๋Š”์ง€์— ๋Œ€ํ•ด ๋ถ„์„์„ ํ•ด๋ณด๋„๋ก ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.

 

โœ๏ธ Now and After


Q26 cloud computing platforms

ํด๋ผ์šฐ๋“œ ์ปดํ“จํŒ…์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” AWS, GCP์— ์ต์ˆ™ํ•ด์ ธ ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ๊ณ  ๋‹ค์Œ์œผ๋กœ ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์ด ์žˆ์Šต๋‹ˆ๋‹ค.

ํ•˜์ง€๋งŒ 2๋…„ํ›„์—๋Š” Microsotf Azure์— ์ต์ˆ™ํ•ด์ง€๊ธธ ์›ํ•˜๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ๋‹ค๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์™”์Šต๋‹ˆ๋‹ค.

 

Q27 cloud computing products

ํด๋ผ์šฐ๋“œ ์ปดํ“จํ„ฐ ์ œํ’ˆ์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” Amazon EC2, Google Cloud Compute Engine, AWS Lambda ์ˆœ์œผ๋กœ ๋งŽ์ด ์‚ฌ์šฉ์„ ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„์—๋„ ์ˆœ์œ„์—์„œ๋Š” ์ฐจ๊ธฐ์•„ ๋งŽ์ด ์—†์—ˆ์Šต๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ํ˜„์žฌ๋Š” ์‚ฌ์šฉ์„ ํ•˜๊ณ  ์žˆ์ง€ ์•Š๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์ง€๋งŒ ์•ž์œผ๋กœ๋Š” ์‚ฌ์šฉ์„ ํ•ด๋ณด๊ณ  ์‹ถ์€ ์‚ฌ๋žŒ๋“ค์ด ๋งŽ๋‹ค๋Š” ๊ฒƒ์„ ์•Œ ์ˆ˜ ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค.

 

Q28 machine learning products

๋จธ์‹ ๋Ÿฌ๋‹ ์ œํ’ˆ์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์€ ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์˜ค๊ณ  ๋‹ค์Œ์œผ๋กœ๋Š” ๊ตฌ๊ธ€ ํด๋ผ์šฐ๋“œ ์ œํ’ˆ์„ ์‚ฌ์šฉํ•˜๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์•˜์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„ ๊ฒฐ๊ณผ๋ฅผ ๋ณด๋ฉด ์‚ฌ๋žŒ๋“ค์ด ๋จธ์‹ ๋Ÿฌ๋‹ ์ œํ’ˆ์„ ์‚ฌ์šฉํ•ด๋ณด๋Š”๊ฒƒ์— ๊ด€์‹ฌ์ด ๋งŽ๋‹ค๋Š” ๊ฒƒ์„ ์•Œ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

์ˆœ์œ„๋ฅผ ๋ณด๋‹ˆ ๊ตฌ๊ธ€ ์ œํ’ˆ์ด ์ƒ์œ„๊ถŒ์„ ๋งŽ์ด ์ฐจ์ง€ํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๋ณด์•„ ๊ตฌ๊ธ€ ์ œํ’ˆ์— ์‚ฌ๋žŒ๋“ค์ด ๊ด€์‹ฌ์ด ๋งŽ๊ณ  ์„ ํ˜ธํ•œ๋‹ค๋Š” ๊ฒƒ์„ ์•Œ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

 

Q29 data products

๋น…๋ฐ์ดํ„ฐ ์ œํ’ˆ์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” SQL, ์ฆ‰ ๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ ๋ฒ ์ด์Šค ์ข…๋ฅ˜๋ฅผ ๋งŽ์ด ์‚ฌ์šฉ ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„ ๊ฒฐ๊ณผ๋ฅผ ๋ณด๋‹ˆ ์‚ฌ๋žŒ๋“ค์ด MongDB์— ๊ด€์‹ฌ์ด ๋งŽ์€ ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค.

 

Q31 business intelligence

business intelligence์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” ์‚ฌ์šฉ์„ ํ•˜์ง€ ์•Š๋Š” ์‚ฌ๋žŒ๋“ค์ด ๊ฐ€์žฅ ๋งŽ์•˜์Šต๋‹ˆ๋‹ค. ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค ์ค‘์—์„œ๋Š” Tableau๋ฅผ ๊ฐ€์žฅ ๋งŽ์ด ์‚ฌ์šฉํ•œ๋‹ค๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์™”์Šต๋‹ˆ๋‹ค. ๊ทธ๋’ค๋กœ๋Š” Power BI, Data Studio ๋“ฑ์ด ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„์—๋Š” ์‚ฌ๋žŒ๋“ค์ด Tableau๋ฅผ ๋งŽ์ด ์‚ฌ์šฉํ•ด๋ณด๊ณ  ์‹ถ๋‹ค๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์™”์Šต๋‹ˆ๋‹ค.

 

Q33 AutoML tools

AutoML์„ ์–ด๋””์— ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š”์ง€์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” ์‚ฌ์šฉ์„ ํ•˜์ง€ ์•Š๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค์ด ๊ต‰์žฅํžˆ ๋งŽ์•˜์Šต๋‹ˆ๋‹ค. ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค ์ค‘์—์„œ๋Š” ๋ชจ๋ธ ์„ ํƒ, ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ํŠœ๋‹๋“ฑ์— ์‚ฌ์šฉํ•˜๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ๋‹ค๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์™”์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„์—๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋ชจ๋ธ ์„ ํƒ์„ ํ•˜๋Š”๋ฐ ์‚ฌ์šฉํ•˜๊ณ  ์‹ถ์–ดํ•œ๋‹ค๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์™”๊ณ  ํ˜„์žฌ์—์„œ๋Š” ์ˆœ์œ„๊ฐ€ ๋‚ฎ์•˜๋˜ ํ”ผ์ฒ˜ ์—”์ง€๋‹ˆ์–ด๋ง ๋ฐ ์„ ํƒ์„ ํ•˜๋Š”๋ฐ ์‚ฌ์šฉํ•˜๊ณ  ์‹ถ์–ดํ•˜๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์™”์Šต๋‹ˆ๋‹ค.

 

Q34 automated machine learning tools (or partial AutoML tools)

AutoML์—์„œ ์ฃผ๋กœ ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” tool์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์•˜์Šต๋‹ˆ๋‹ค. ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค ์ค‘์—์„œ๋Š” Auto-Sklearn, Auto-Kears ๋“ฑ์ด ์ƒ์œ„๊ถŒ์— ์žˆ์—ˆ์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„ ๊ฒฐ๊ณผ๋ฅผ ๋ณด๋‹ˆ ํ˜„์žฌ์™€๋Š” ๋ณ„๋‹ค๋ฅธ ์ฐจ์ด๊ฐ€ ์—†์—ˆ์Šต๋‹ˆ๋‹ค.

 

Q35 Do you use any tools to help manage machine learning experiments?

๋จธ์‹ ๋Ÿฌ๋‹์„ ์‚ฌ์šฉ์„ ํ•˜๋ฉด์„œ ์–ด๋– ํ•œ tool์— ๋„์›€์„ ๋ฐ›๊ณ  ์žˆ๋Š”์ง€์— ๊ด€ํ•œ ์งˆ๋ฌธ์ž…๋‹ˆ๋‹ค.

ํ˜„์žฌ๋Š” ์‚ฌ์šฉ์„ ํ•˜์ง€ ์•Š๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์•˜๊ณ  ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค ์ค‘์—์„œ๋Š” TensorBoard๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์•˜์Šต๋‹ˆ๋‹ค.

2๋…„ํ›„์—๋„ ํ˜„์žฌ์—์„œ ๊ฐ€์žฅ ์ธ๊ธฐ ์žˆ์—ˆ๋˜ TensorBord๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋Š” ์‚ฌ๋žŒ๋“ค์ด ๋งŽ์•˜์Šต๋‹ˆ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ๊ณ„ํš์ด ์—†๋Š” ์‚ฌ๋žŒ๋“ค๋„ ๋งŽ์•˜์Šต๋‹ˆ๋‹ค.


 

์ฝ”๋“œ๋งํฌ


github.com/jaaaamj0711/kaggle_study/blob/master/kaggle_survey/kaggle_survey_Now_and_After.ipynb

 

jaaaamj0711/kaggle_study

Kaggle data๋ฅผ ๊ณต๋ถ€ํ•˜๋Š” ๊ณต๊ฐ„์ž…๋‹ˆ๋‹ค. Contribute to jaaaamj0711/kaggle_study development by creating an account on GitHub.

github.com