书目

Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale

内容简介

Getreadytounlockthepowerofyourdata.Withthefourtheditionofthiscomprehensiveguide,you?lllearnhowtobuildandmaintainreliable,scalable,distributedsystemswithApacheHadoop.Thisbookisidealforprogrammerslookingtoanalyzedatasetsofanysize,andforadministratorswhowanttosetupandrunHadoopclusters.UsingHadoop2exclusively,authorTomWhitepresentsnewchaptersonYARNandseveralHadoop-relatedprojectssuchasParquet,Flume,Crunch,andSpark.You?lllearnaboutrecentchangestoHadoop,andexplorenewcasestudiesonHadoop?sroleinhealthcaresystemsandgenomicsdataprocessing.LearnfundamentalcomponentssuchasMapReduce,HDFS,andYARNExploreMapReduceindepth,includingstepsfordevelopingapplicationswithitSetupandmaintainaHadoopclusterrunningHDFSandMapReduceonYARNLearntwodataformats:AvrofordataserializationandParquetfornesteddataUsedataingestiontoolssuchasFlume(forstreamingdata)andSqoop(forbulkdatatransfer)Understandhowhigh-leveldataprocessingtoolslikePig,Hive,Crunch,andSparkworkwithHadoopLearntheHBasedistributeddatabaseandtheZooKeeperdistributedconfigurationservice,Counselsprogrammersandadministratorsforbigandsmallorganizationsonhowtoworkwithlarge-scaleapplicationdatasetsusingApacheHadoop,discussingitscapacityforstoringandprocessinglargeamountsofdatawhiledemonstratingbestpracticesforbuildingreliableandscalabledistributedsystems.Original.,

目录

—  END  —