中圖網

>

程序設計

>

其他

Hadoop應用架構-(影印版)

包郵 Hadoop應用架構-(影印版)

作者：(美)馬克·格羅弗(MarkGrove

出版社：東南大學出版社出版時間：2017-02-01

開本： 32開 頁數： 371

本類榜單：計算機/網絡銷量榜

中圖價:¥31.3(3.5折) 定價 ~~¥89.0~~ 登錄后可看到會員價

加入購物車收藏

開年大促， 全場包郵

?新疆、西藏除外

溫馨提示：5折以下圖書主要為出版社尾貨，大部分為全新（有塑封/無塑封），個別圖書品相8-9成新、切口
有劃線標記、光盤等附件不全詳細品相說明>>

本類五星書更多>

>
全國計算機等級考試最新真考題庫模擬考場及詳解·二級MSOffice高級應用

全國計算機等級考試最新真考題庫模擬考場及詳解·二級MSOffice高級應用

¥14.4¥45
>
決戰行測5000題(言語理解與表達)

決戰行測5000題(言語理解與表達)

¥44.1¥88
>
軟件性能測試.分析與調優實踐之路

軟件性能測試.分析與調優實踐之路

¥56.2¥69
>
第一行代碼Android

第一行代碼Android

¥55.4¥99
>
JAVA持續交付

JAVA持續交付

¥58.1¥119
>
EXCEL最強教科書(完全版)(全彩印刷)

EXCEL最強教科書(完全版)(全彩印刷)

¥31.1¥69.9
>
深度學習

深度學習

¥92.4¥168

商品詳情
商品評論(0條)

中圖價:¥31.3 加入購物車

版權信息
本書特色
內容簡介
目錄

Hadoop應用架構-(影印版) 版權信息

ISBN：9787564170011
條形碼：9787564170011 ; 978-7-5641-7001-1
裝幀：暫無
冊數：暫無
重量：暫無
所屬分類：
計算機/網絡
>
程序設計
>
其他

Hadoop應用架構-(影印版) 本書特色

在使用Apache Hadoop設計端到端數據管理解決方案時獲得專家級指導。當其他很多渠道還停留在解釋Hadoop生態系統中該如何使用各種紛繁復雜的組件時，這本專注實踐的書已帶領你從架構的整體角度思考，它對于你的特別應用場景而言是必不可少的，將所有組件緊密結合在一起，形成完整有針對性的應用程序。
為了增強學習效果，本書第二部分提供了各種詳細的架構案例．涵蓋部分*常見的Hadoop應用場景。
無論你是在設計一個新的Hadoop應用還是正計劃將 Hadoop整合到現有的數據基礎架構中，Mark Grover 、Ted Malaska、Jonathan Seidman、Gwen Shapira編*的《Hadoop應用架構(影印版)(英文版) 》都將在這整個過程中提供技巧性的指導。
使用Hadoop存放數據和建模數據時需要考慮的要素在系統中導入數據和從系統中導出數據的*佳實踐指導數據處理的框架，包括MapReduce、Spark和 Hive 常用Hadoop處理模式，例如移除重復記錄和使用窗口分析 Giraph，GraphX以及其他Hadoop上的大圖片處理工具使用工作流協作和調度工具，例如Apache Oozie 使用Apache Storm、Apache Spark Streaming 和Apache Flume處理準實時數據流點擊流分析、欺詐防止和數據倉庫的架構實例

Hadoop應用架構-(影印版) 內容簡介

在使用 Apache Hadoop 設計端到端數據管理解決方案時，獲得專家級指導。當其它很多渠道還停留在解釋 Hadoop 生態系統中該如何使用各種紛紜復雜的組件時，這本專注實踐的書已帶領您從架構的整體角度思考，這樣的角度對于您的特別應用場景而言，是必不可少的。它將所有組件緊密結合在一起，形成完整有針對性的應用程序。為了增強學習效果，本書第二部分提供了各種詳細的架構案例，涵蓋部分*常見的 Hadoop 應用場景。無論您在設計一個新的 Hadoop 應用，或者正計劃將 Hadoop 整合到現有的數據基礎架構中，本書都將在整個過程中提供技巧性的導引。

Hadoop應用架構-(影印版) 目錄

Foreword Preface Part Ⅰ. Architectural Considerations for Hadoop Applications 1. Data Modeling in HadoopData Storage OptionsStandard File FormatsHadoop File TypesSerialization FormatsColumnar FormatsCompressionHDFS Schema DesignLocation of HDFS FilesAdvanced HDFS Schema DesignHDFS Schema Design SummaryHBase Schema DesignRow KeyTimestampHopsTables and RegionsUsing ColumnsUsing Column FamiliesTime-to-LiveManaging MetadataWhat Is Metadata?Why Care About Metadata?Where to Store Metadata?Examples of Managing MetadataLimitations of the Hive Metastore and HCatalogOther Ways of Storing MetadataConclusion 2. Data MovementData Ingestion ConsiderationsTimeliness of Data IngestionIncremental UpdatesAccess PatternsOriginal Source System and Data StructureTransformationsNetwork BottlenecksNetwork SecurityPush or PullFailure HandlingLevel of ComplexityData Ingestion OptionsFile TransfersConsiderations for File Transfers versus Other Ingest MethodsSqoop: Batch Transfer Between Hadoop and Relational DatabasesFlume: Event-Based Data Collection and ProcessingKafkaData ExtractionConclusion 3. Processing Data in HadoopMapReduceMapReduce OverviewExample for MapReduceWhen to Use MapReduceSparkSpark OverviewOverview of Spark ComponentsBasic Spark ConceptsBenefits of Using SparkSpark ExampleWhen to Use SparkAbstractionsPigPig ExampleWhen to Use PigCrunchCrunch ExampleWhen to Use CrunchCascadingCascading ExampleWhen to Use CascadingHiveHive OverviewExample of Hive CodeWhen to Use HiveImpalaImpala OverviewSpeed-Oriented DesignImpala ExampleWhen to Use ImpalaConclusion 4. Common Hadoop Processing PatternsPattern: Removing Duplicate Records by Primary KeyData Generation for Deduplication ExampleCode Example: Spark Deduplication in ScalaCode Example: Deduplication in SQLPattern: Windowing AnalysisData Generation for Windowing Analysis ExampleCode Example: Peaks and Valleys in SparkCode Example: Peaks and Valleys in SQLPattern: Time Series ModificationsUse HBase and VersioningUse HBase with a RowKey of RecordKey and StartTimeUse HDFS and Rewrite the Whole TableUse Partitions on HDFS for Current and Historical RecordsData Generation for Time Series ExampleCode Example: Time Series in SparkCode Example: Time Series in SQLConclusion 5. Graph Processing on HadoopWhat Is a Graph?What Is Graph Processing?How Do You Process a Graph in a Distributed System?The Bulk Synchronous Parallel ModelBSP by ExampleGiraphRead and Partition the DataBatch Process the Graph with BSPWrite the Graph Back to DiskPutting It All TogetherWhen Should You Use Giraph?GraphXJust Another RDDGraphX Pregel Interfacevprog0sendMessage0mergeMessage0Which Tool to Use?Conclusion 6. OrchestrationWhy We Need Workflow OrchestrationThe Limits of ScriptingThe Enterprise Job Scheduler and HadoopOrchestration Frameworks in the Hadoop EcosystemOozie TerminologyOozie OverviewOozie WorkflowWorkflow PatternsPoint-to-Point WorkflowFan- Out WorkflowCapture-and-Decide WorkflowParameterizing WorkflowsClasspath DefinitionScheduling PatternsFrequency SchedulingTime and Data TriggersExecuting WorkflowsConclusion 7. Near-Real-Time Processing with HadoopStream ProcessingApache StormStorm High-Level ArchitectureStorm TopologiesTuples and StreamsSpouts and BoltsStream GroupingsReliability of Storm ApplicationsExactly-Once ProcessingFault ToleranceIntegrating Storm with HDFSIntegrating Storm with HBaseStorm Example: Simple Moving AverageEvaluating StormTridentTrident Example: Simple Moving AverageEvaluating TridentSpark StreamingOverview of Spark StreamingSpark Streaming Example: Simple CountSpark Streaming Example: Multiple InputsSpark Streaming Example: Maintaining StateSpark Streaming Example: WindowingSpark Streaming Example: Streaming versus ETL CodeEvaluating Spark StreamingFlume InterceptorsWhich Tool to Use?Low-Latency Enrichment, Validation, Alerting, and IngestionNRT Counting, Rolling Averages, and Iterative ProcessingComplex Data PipelinesConclusion Part Ⅱ. Case Studies 8. Clickstream AnalysisDefining the Use CaseUsing Hadoop for Clickstream AnalysisDesign OverviewStorageIngestionThe Client TierThe Collector TierProcessingData DeduplicationSessionizationAnalyzingOrchestrationConclusion 9. Fraud DetectionContinuous ImprovementTaking ActionArchitectural Requirements of Fraud Detection SystemsIntroducing Our Use CaseHigh-Level DesignClient ArchitectureProfile Storage and RetrievalCachingHBase Data DefinitionDelivering Transaction Status: Approved or Denied?IngestPath Between the Client and FlumeNear-Real-Time and Exploratory AnalyticsNear-Real-Time ProcessingExploratory AnalyticsWhat About Other Architectures?Flume InterceptorsKafka to Storm or Spark StreamingExternal Business Rules EngineConclusion 10. Data WarehouseUsing Hadoop for Data WarehousingDefining the Use CaseOLTP SchemaData Warehouse: Introduction and TerminologyData Warehousing with HadoopHigh-Level DesignData Modeling and StorageIngestionData Processing and AccessAggregationsData ExportOrchestrationConclusionA. Joins in Impala Index

展開全部

商品評論(0條)

寫書評賺書幣

暫無評論……

書友推薦

>
中國歷史的瞬間
中國歷史的瞬間
李永熾
¥16.7~~¥38.0~~
>
推拿
推拿
畢飛宇
¥12.2~~¥32.0~~
>
回憶愛瑪儂
回憶愛瑪儂
[日]梶尾真治著，王瑋譯
¥24.0~~¥32.8~~
>
史學評論
史學評論
楊玉圣
¥22.7~~¥42.0~~
>
人文閱讀與收藏·良友文學叢書:一天的工作
人文閱讀與收藏·良友文學叢書:一天的工作
魯迅
¥15.7~~¥45.8~~
>
自卑與超越
自卑與超越
[奧]阿爾弗雷德·阿德勒著，韓陽譯
¥13.7~~¥39.8~~
>
我從未如此眷戀人間
我從未如此眷戀人間
史鐵生/汪曾祺
¥23.9~~¥49.8~~
>
大紅狗在馬戲團-大紅狗克里弗-助人
大紅狗在馬戲團-大紅狗克里弗-助人
[美] 諾爾曼·伯德韋爾著，杜可名譯
¥4.5~~¥10.0~~

本類暢銷

編譯原理(第4版)/劉銘

劉銘

¥29.3~~¥45~~
從程序員到架構師大數據量、緩存、高并發、微服務、多團隊協同等核心場景實戰

王偉杰

¥58.9~~¥89~~
架構師的自我修煉:技術、架構和未來:technology, architecture and the future

李智慧著

¥58.9~~¥89~~
陪孩子玩Scratch:在游戲編程中培養計算思維(全三冊)

謝聲濤編著

¥45.8~~¥128~~
從零開始學架構:照著做你也能成為架構師

李運華

¥63.4~~¥99~~
Python極客項目編程

溫科特卡姆

¥25.8~~¥69~~

中图网(原中国图书网)：网上书店，尾货特色书店，30万种特价书低至2折！

包郵 Hadoop應用架構-(影印版)

Hadoop應用架構-(影印版) 版權信息

Hadoop應用架構-(影印版) 本書特色

Hadoop應用架構-(影印版) 內容簡介

Hadoop應用架構-(影印版) 目錄

中國歷史的瞬間

推拿

回憶愛瑪儂

史學評論

人文閱讀與收藏·良友文學叢書:一天的工作

自卑與超越

我從未如此眷戀人間

大紅狗在馬戲團-大紅狗克里弗-助人

編譯原理(第4版)/劉銘

從程序員到架構師大數據量、緩存、高并發、微服務、多團隊協同等核心場景實戰

架構師的自我修煉:技術、架構和未來:technology, architecture and the future

陪孩子玩Scratch:在游戲編程中培養計算思維(全三冊)

從零開始學架構:照著做你也能成為架構師

Python極客項目編程

連科六短篇-短篇經典文庫

上海灘的賈斯汀·比伯

朱仙鎮年畫:七日談

人間草木

女孩們

百年夢憶:梁實秋人生自述