æ¬èªã§ã¯ãããŸã§ãã«ããããã³ã 蚌åžã®SNSè§£æãµãŒãã¹ãªã©ãäŸã«åºããªãããããã°ããŒã¿ã®æŽ»çšäŸãå¯Ÿå¿æè¡ã玹ä»ããŠãããHadoopãããŒã¹ãšããåæã·ã¹ãã ãã倧éããŒã¿ã«å¯Ÿå¿ããDWHãå°å ¥ããããšã§ããããŸã§ã¯é£ããã£ãåæãè¡ããããšããçè§£ããã ãããšæãã
ãããã«ãããŠãéèŠãªã®ã¯å€§éã®ããŒã¿ãé«éã«åŠçããä»çµã¿ãéåžžã®ã·ã¹ãã ã§ããã°çµæãåŸããŸã§ã«1ã«æä»¥äžããã£ãŠããåæããæ°æéããæ°æ¥ã§å®äºãããããšã§ããžãã¹ãå éãããŠããããšãå¯èœã ã
ããããäžå£ã«ããã°ããŒã¿ãšèšã£ãŠãããã®å©çšã·ãŒã³ã¯ããŸããŸããšãã«ã¯ç®çãšããããŒã¿ã®æ€åºããªã¢ã«ã¿ã€ã ã«è¡ããããšããã±ãŒã¹ãããã ããããã®å Žåãæ°æéåŸã«åæçµæãåŸããšããæµãã§ã¯ãåœç¶ãªããéã«åããªãã
ããã§æ¥æ¬IBMã§ã¯ãã¹ããªãŒãã³ã°ããŒã¿ããªã¢ã«ã¿ã€ã ã«åŠçãããããªã¢ãŒããã¯ãã£ãçšæããŠãããæ¬èªã¯ãã¡ãã®æè¡ã®æŠèŠããæ¥æ¬IBM ãœãããŠã§ã¢äºæ¥ ã€ã³ãã©ã¡ãŒã·ã§ã³ã»ãããžã¡ã³ãäºæ¥éš ã¯ãŒã«ãã¯ã€ã ããã°ããŒã¿ ãã¯ãã«ã«ã»ãªãŒãã®å屿п°ã«èããã®ã§ãç°¡åã«ã玹ä»ãããã
Hadoopã§ã¯å¯Ÿå¿ãéã«åããªãã·ãŒã³ã
ãHadoopããŒã¹ã®ã·ã¹ãã ã§ã¯ãåæã¢ãã«ã«å¯Ÿå¿ããããã°ã©ã ããããããå®è£ ããŠãããããŒã¿ã貯ãŸãã®ãåŸ ã£ãŠãããããå€éããããªã©ã§å®è¡ããããšã«ãªãããããã£ãŠãçµæãåŸããããŸã§ã«ã¯æ°åãæ°æéããæ°æ¥ãããããšã«ãªãã
å屿°ã¯HadoopããŒã¹ã®ã·ã¹ãã ã®èª²é¡ããã®ããã«åæããããã«æ¬¡ã®ããã«ç¶ããã
ãäŸãã°ããããã¯ãŒã¯äžã«æµãã倧éããŒã¿ãåžžã«ç£èŠããç®çã®ããŒã¿ãçºèŠãããããã«ã¹ããŒããã©ã³ã«éç¥ãããšãã£ãããšãã§ããããããžãã¹ã倧ããå€ããå¯èœæ§ãããã
ããã¯ããªãã¡ãããã°ããŒã¿ããªã¢ã«ã¿ã€ã ã«åŠçãããšãã詊ã¿ã ãããã§ã«æŽ»çšãã¯ãããŠããæ¥çš®ããããšãããäŸãã°ã以åã®èšäºã§ã玹ä»ããããã¯ã¬ãžããã«ãŒãæ¥çã§ã¯ã決æžåŠçãå®è¡ããåã«ãŠãŒã¶ãŒã®å©çšãã¿ãŒã³ãç §åããåé¡ã®ãããŠãŒã¶ãŒã®å©çšãæ°Žéã§é£ãæ¢ãããšãã詊ã¿ãè¡ãããŠããã
ãŸããèªç©ºç®¡å¶ã·ã¹ãã ã®ãµã€ããŒããæ€ç¥ããç«å±±åŽç«/ç«å·»çºçæãªã©ã®è¢«å®³äºæž¬ãªã©ã§ãã倧éã«éä¿¡ãããŠããããŒã¿ãç¬æã«åæããã·ã¹ãã ãå©çšãããŠããã
ãé¢çœãäŸãšããŠã¯ãè»äºæœèšãªã©ã®éèŠæœèšãžã®äŸµå ¥è æ€ç¥ã·ã¹ãã ãªã©ããããå°äžã«åã蟌ãŸãããã¡ã€ããŒã±ãŒãã«ãé³å£°ã·ã°ãã«ãæŸãããã®ããŒã¿åæãããŠäžæ£äŸµå ¥ãæ€ç¥ããŠããããã®ã·ã¹ãã ã§ã¯æå€§1600䞊åã¹ããªãŒã åŠçãå®è¡ãããŠããã(å屿°)
ãªã¢ã«ã¿ã€ã è§£æãå®çŸã®ä»çµã¿
äžèšã®ãããªå€§éããŒã¿ã®ãªã¢ã«ã¿ã€ã åŠçãå®çŸããŠããã®ããIBM InfoSphere Streamsã(Streams)ã§ããããã¡ãã¯ããããã¯ãŒã¯äžã«æµããããŒã¿ãã¹ãã¬ãŒãžã«è²¯ããããšãªããã®å Žã§è§£æããå¿ èŠã§ããã°ããŒã¿ãå å·¥ããŠåºåããããšãã§ãããã©ãããã©ãŒã ã«ãªãã
ãäžè¬ã«ããã°ããŒã¿é¢é£ã®è£œåã§ãªã¢ã«ã¿ã€ã ãšè¬³ã£ãŠããã®ã¯ãã€ã³ã¡ã¢ãªããŒã¿ã¹ãã¢ã掻çšãããã®ãã»ãšãã©ããã¡ãã¯ã¡ã¢ãªäžã§åŠçãå®è¡ããŠãããšèšãã©ããããŒã¿ããŒã¹ã®ãããªã¹ãã¢ã«æ ŒçŽããŠããåŠçãå®è¡ããããšã«ãªããããå€å°ã®ã¿ã€ã ã©ã°ãçºçãããããã«å¯ŸããŠStreamsã§ã¯ããããã¯ãŒã¯ããåéããããŒã¿ãããã£ã¹ã¯ãžã®æžã蟌ã¿åŠçãè¡ããã«ãã®å Žã§è§£æããŠåŠçãããå ã«æãããããªãäžç¬ãäºããããªã·ã¹ãã ã§ã¯å§åçã«æå¹ãªææ³ã ã(å屿°)
察å¿ããŒã¿ã¯ããã¹ãã ãã§ãªããVoIPãé»åã¡ãŒã«ãé³å£°ããããªãªã©ã倿§ãªåœ¢åŒããµããŒãããŠãããã¹ã±ãŒã«ã¢ãŠããè¡ãéãéçºè ã®è² æ ã軜æžããããã«ãåèªåçã«æ¡åŒµãè¡ããããšããã
ãããã°ããŒã¿ã®äžå¿åŠçãæ ãHadoopããŒã¹ã®IBM InfoSphere BigInsightsã®åã«Streamsãèšçœ®ããããŒã¿ããã£ã«ã¿ãªã³ã°ãããšããäœ¿ãæ¹ãã§ããããŸããDWHã®IBM Netezzaãšé£æºãããNetezzaã®åæã¢ãã«ãåå©çšããããšãå¯èœããããã®3補åãçµã¿åãããã°ãè¶ å€§èŠæš¡ãªããã°ããŒã¿ã«å¯ŸããŠãååã«å¯Ÿå¿ããŠãããã(å屿°)
*ãã*ãã*
æ¬çš¿ã§ã¯ã¹ããŒã¹ã®éœåäžãStreamsã®æ¬åœã«ãããã®éšåãã玹ä»ã§ããªãã£ããã6æ19æ¥(ç«)ã«éå¬ããããããã°ããŒã¿åæãã©ãããã©ãŒã ã»ã»ãããŒãã§ã¯ãå補åã®ã¢ãŒããã¯ãã£ã詳ãã解説ããŠããäºå®ã ã
ãŸããããã°ããŒã¿ã«ããããªã¢ã«ã¿ã€ã åŠçã®æŽ»çšæ³ã«ã€ããŠãããå ·äœçã«ç€ºãããäºå®ãªã®ã§ãäŒæ¥ã·ã¹ãã ã«é¢ããæ¹ã ã¯ãã²ãšããèŽè¬ããã ãããã