RDMSãšã®çžéç¹ãç¥ã
é£èŒã®æçµåãšãªãæ¬çš¿ã§ã¯ãDWH(RDBMS)ãšæ°ãããã¯ãããžãŒã§ãããHadoopãšã®ç¹åŸŽã«ã€ããŠæ¯èŒè§£èª¬ããããšã«ãããã
ãŸããäŒããããã®ã¯ãæä»£ãé²ã¿ãããããŒã¿ã掻çšãããããžãã¹äžã®ããŒãºãåºçŸããããŒã¿ãããã°ããŒã¿åããããã®ããŒã¿ã掻çšããããŒã¿åºç€ãRDBã§ãããHadoopã§ãããšããç¹ã ã
RDBã¯ç§éžãªãã¯ãããžãŒã§ãããå®éäœå幎ã«ãããããã©ã³ã¶ã¯ã·ã§ãã«ãªåŠçããã¢ããªãã£ã«ã«ãªç®çã«ãŸã§å©çšãããŠãããããããåè¿°ã®ããã«ããã°ããŒã¿ã®æä»£ãšãªã£ãŠåæãããŒã¿æŽ»çšã«ã¯RDBã ãã§å¯ŸåŠããã«ã¯é£ãããæ°ãããèŠæãããŒã¿åºç€ã«æ±ããããããã«ãªããããã«åãããæ°ãã¯ãããžãŒãç»å ŽããŠããã
ãããHadoopã§ãããããã«ãããã§æ¯èŒããŠããç¹åŸŽã¯ã©ã¡ããè¯ãæªãã§ã¯ãªãã驿驿ã ãšããããšããçè§£ããã ãããã
ããã€ãã®ã¯ã©ã€ããªã¢ãããããäž»ãªãã®ãæãããšä»¥äžãšãªãã
- ã³ã¹ã
- ããŒã¿ã®ç§»åã®å¿ èŠæ§
- ããŒã¿ã®çš®é¡
- ã¹ããŒãå®çŸ©
å©çšããåŽã§äžçªæ³šç®ããã®ã¯ãããŒã¿éãããã®ã³ã¹ãã ããããããã°ããŒã¿ããšããããŒã¯ãŒãããäžçªå®¹æã«æµ®ãã¶ãã€ã³ãã§ãããã
DWHã¯ããšããšåºå¹¹ç³»ã®ã·ã¹ãã çšã«äœãããŠãããããåºå¹¹ã·ã¹ãã ãæ¯ããæ©èœãè±å¯ã«å«ãŸãããã€åºå¹¹ããŒã¿ã¯æ±ºããŠå€ãã¯ãªãããã£ãŠé«äŸ¡ã§ããã®ã«å¯Ÿãã倧éã®ããŒã¿ã䜿ã£ãŠå©çšããããšãããããã®ç®çãšããŠããHadoopã®å Žåã¯ããŒã¿å®¹éåœããã®ã³ã¹ãã¯å§åçã«å®ããªãã1ãã©ãã€ãåœããã®ã³ã¹ãã¯ãDWHã40,000ãã«ãªã®ã«å¯ŸããHadoopã§ã¯ããã1,000ãã«æªæºãšã40å以äžãã®éãããããšã®ããšã (ãŠã©ãŒã«ã»ã¹ããªãŒãã»ãžã£ãŒãã«ãã¬ããŒããåºããŠããã®ã§ãåç §ãããã)ã
次ã«ãããã°ããŒã¿ã§æ³šç®ããã¹ããã€ã³ãã§ããããŒã¿ã®ç§»åã«ã€ããŠãçè§£ããå¿ èŠãããã
åŸæ¥åã¯ååã«è¿°ã¹ãéããäŸãã°NASã¹ãã¬ãŒãžã«ãã°ããããŠDWHã«ç§»åããŠæŽ»çšãšãã£ãããŒã¿ã®ç§»åã䌎ã£ãŠããããHadoopã¯ããŒã¿ã®ãããšããã§ããŒã¿ãåŠçããã®ã§ç§»åã¯äžèŠãšãªããTBãPBã¯ã©ã¹ã«ãªããšããŒã¿ã®ç§»åã¯ã³ã¹ããé«ããããããã§ããã ãç§»åãããªãããšãéèŠã ã
ãã¡ãããåŠçã§ããããŒã¿ã®çš®é¡ã¯ãDWHã§ã¯æ§é åããŒã¿ã®ã¿ãšãªãããHadoopã§ã¯ããããããŒã¿ãåŠçããããšãå¯èœãšãªã£ãŠãããæšä»æ³šç®ã®ãã£ãŒãã©ãŒãã³ã°ãªã©ç»åãæŽ»çšãããœãªã¥ãŒã·ã§ã³ãæ°ããããŒã¿åºç€ã®æ¹ãé©ããŠããã ããã
ãããŠããã1ã€éèŠãªãã€ã³ããã¹ããŒãã«ã€ããŠã§ããã
RDBã¯ã¹ããŒããäœã£ãŠããŒã¿ãå ¥ãããã€ãŸãããŒã¿ã®äœ¿ãæ¹ã¯ããŸãå€åããªãããšãåæã ããåæã¯è©Šè¡é¯èª€ããªããé²ãããããäºåã®ã¹ããŒãå®çŸ©ã¯é£ãããHadoopã§ããã°ããŒã¿ããšããããå ¥ããŠãåŸããç®çã«åããã¹ããŒããå®çŸ©ããããšãã§ããã
ãã®ä»ãå®éã®éçšäžéèŠãšãªãæ¢åã®ã¹ãã«ãè³ç£ãã©ãæ°ããä»çµã¿ã«äœ¿ããã«ãèšåããŠãããã
RDBãšèšãã°ãSQLã§ã®ã¢ã¯ã»ã¹ã ããHadoopãããã«æ¥ãŠSQLã¢ã¯ã»ã¹ã®ããŒãºãåããŠé²åããŠãããããã€ãã®SQLãšã³ãžã³ãHadoopã«ãæãããã«ãªã£ãããã®é²åã¯å€§ããªæå³ããããåŸæ¥ã®æè¡ãã¹ãã«ãšæ°ãã¯ãããžãŒãæ©æž¡ãããåœ¹ãæ ããæ¡çšãžã®æ·å± ãäžããŠããã
![]() |
䞊å忣ãžã®ã·ãããå éãããœãªã¥ãŒã·ã§ã³ãã³ããŒ
Hadoopäžã®SQLãœãªã¥ãŒã·ã§ã³ãããŸããŸãªãã®ãæäŸãããŠããããå€ãã®äŒæ¥ã§ã¯ããŒãºã«å¿ããŠããã€ãã®ãœãªã¥ãŒã·ã§ã³ã䜿ãåããŠãã(以äžåç §)ã
![]() |
ããã§ãããäŒæ¥ã®ããŒã±ãã£ã³ã°éšéã§ã®MapRã®æŽ»çšäŸãç°¡åã«ç޹ä»ãããã
åéšéã§ã¯ã30åã¬ã³ãŒã(容éãšããŠã¯æ°åGB)ã®POSããŒã¿ãå©çšããéèšåŠçããããã£ãããããã容éããããã®ã¬ã³ãŒãæ°ããã«æ¢åã®æ±çšRDBMSã§ã¯ãCPUãå¢ãããSSDãæ¡çšããŠããéèšåŠçãã¿ã€ã ã¢ãŠãããŠããŸããæŽ»çšã§ããªãç¶æ ã«é¥ã£ãŠããã
ããã§åãå ¥ããã®ãæ°ããããŒã¿åºç€ã§ããMapRã§ããã7å°ã®ãµãŒãã«ãã䞊ååŠçã§ãã客æ§ã®èŠæã®1å以å ã«éèšåŠçãçµãããšããç®æšã容æã«éæããããšãã§ããã®ã§ããã
éèšåŠçãæ€çŽ¢ã¯RDBMSåæ§ã«è¡ãããã€ããã䞊å忣ã§è¡ãããäžãåéšéã¯RDBMSãããå€ãéã®ããŒã¿ãå®äŸ¡ã«åŠçã§ããããã«ãªã£ããä»åŸããŒã¿éã®æ¡ãå¢ããŠããããã©ãŒãã³ã¹èŠä»¶ãäžãã£ãŠããã¹ã±ãŒã«ã¢ãŠããããããšã§ã察å¿ãã§ãããããã¯åŸæ¥ã®RDBMSã«åºå·ããªãã£ãããããå®çŸããã±ãŒã¹ã ãšèšããã ããã
ãŸããSAP HANAãHewlett Packard Enterprise VerticaãšMapR (Hadoop)ãšã®çµã¿åãããæ³šç®ã«å€ããã
HANAãVerticaãšãã£ãã¹ã±ãŒã«ã¢ãŠãåã®DWHã®æ³£ãæã¯ãx86ãµãŒãã®å èµã¹ãã¬ãŒãžã䜿ãããããã®ã¹ãã¬ãŒãžã®ç®¡çã«ãããMapR (Hadoop)ã¯ãã¡ã€ã«ã·ã¹ãã ã§ãããããŒã¿å§çž®ãããã¯ã¢ãããã¹ãããã·ã§ãããšãã£ãäžè¬çãªã¹ãã¬ãŒãžã®æ©èœãæã€ããããã®æ³£ãæãè§£æ¶ã§ããã
ããã«ãSAPã¯HANAãšHadoopã«ããããŒã¿ãééçã«æ€çŽ¢ããããã«ãVoraããšãã補åãåºãããããã¯Sparkäžã§åãã®ã ããSparkã¯HadoopããæäŸãããã®ã§ãããèŠªåæ§ãäžããã®ã§ããã
ãã®ãããªç¹æ§ãããDWHãã³ããŒãSASãšãã£ãã¢ããã³ã¹ãåæã®èèãã³ããŒãªã©ããä»ã次ã ãšHadoopãžãšã·ããããŠãã£ãŠãããããã¯ã€ãŸããããã°ããŒã¿åã«ãã£ãŠãã£ã¹ã¯é åãCPUãã¹ã±ãŒã«ãããå¿ èŠããããããã¹ã±ãŒã«ã¢ãŠãåã®äžŠå忣ãžãšã·ããããŠãããšããããšãæå³ããã
ããžãã¹ããŒãºãå€ãããåãªãBIã¬ããŒãã ãã§ã¯ãªããæ©æ¢°åŠç¿ãããŒã¿ãã€ãã³ã°ãå©çšãã顧客åååæãåŸååæãããã«ã¯ãã£ãŒãã©ãŒãã³ã°ãšããŒã¿æŽ»çšãé²ããšãããã«äŒŽãéããçš®é¡ãæ°ãšããç¹ã§ãããŒã¿ãå€ããã
ãã£ãŒãã©ãŒãã³ã°ãªã©ã¯ããããããäŸã§ãç»åãæŽ»çšããããããªããšãå¿ ç¶çã«ããŒã¿åºç€ãå€ããã®ã§ãããããã°ããŒã¿æä»£ãšã¯ãããã£ãããžãã¹ããŒãºã®å€åãããããããæºããã®ã«å¿ èŠã«ãªãHadoopãšããæ°ããŒã¿åºç€ãç»å Žããã®ã§ããã
ããããHadoopã¯åºæ¬ãããåŠçã§ãããåžå Žã®ããŒãºã¯ãããåã«ãšã©ãŸããã«ãªã¢ã«ã¿ã€ã åããŠãããããã§ç»å ŽããŠããã®ããMapReduceã«ä»£ãããšèšã£ãŠã¯èšãéããããããªãããæ°ããããŒã¿åŠçãã¬ãŒã ã¯ãŒã¯ã§ããSparkã ã
Hadoopã®è§£èª¬ã®æ¬¡ã¯ããããªã¢ã«ã¿ã€ã ã«åŠçãã§ãããä»è©±é¡ã®æ°ãã¬ãŒã ã¯ãŒã¯ãSparkãã«ã€ããŠæ°ãã«ã玹ä»ããŠããã
解説è 玹ä»
äžå è (MIHARA Shigeru)
- æ ªåŒäŒç€Ÿãããã¢ãŒã«ã»ãã¯ãããžãŒãº ã¢ã©ã€ã¢ã³ã¹&ãããã¯ãããŒã±ãã£ã³ã° ãã£ã¬ã¯ã¿ãŒ /
ãæ¥æ¬ããŒã¿ãããžã¡ã³ãã»ã³ã³ãœãŒã·ã¢ã (JDMC) ã»ãããŒéšäŒã¡ã³ããŒ
ãµã³ã»ãã€ã¯ãã·ã¹ãã ãºãæ¥æ¬ãªã©ã¯ã«ãæ¥æ¬IBMãšãã£ã倧æãã³ããŒã§ãããã¯ãããŒã±ãã£ã³ã°ãæ°èŠè£œåã®ããžãã¹éçºã«åŸäºãããŒããŠã§ã¢ããããã«ãŠã§ã¢ãåçšããOSSãŸã§ãå€å²ã«ããã補åã®ããã¢ãŒã·ã§ã³ã販路æ¡å€§ãè¡ãã
æ¥æ¬ãªã©ã¯ã«ãšãã¡ã¹ãã«ãŠæ€çŽ¢ãããã¿ã€ãºåéã®ããžãã¹ã«é¢ãããECãããã°ããŒã¿ãšãã£ãITã®æ°åéã«èå³ãæã¡ã2014幎ã«ãããã¢ãŒã«ã»ãã¯ãããžãŒãºãžå ¥ç€Ÿã

