-
>
全國計算機等級考試最新真考題庫模擬考場及詳解·二級MSOffice高級應用
-
>
決戰行測5000題(言語理解與表達)
-
>
軟件性能測試.分析與調優實踐之路
-
>
第一行代碼Android
-
>
JAVA持續交付
-
>
EXCEL最強教科書(完全版)(全彩印刷)
-
>
深度學習
Java大數據分析 版權信息
- ISBN:9787564182878
- 條形碼:9787564182878 ; 978-7-5641-8287-8
- 裝幀:一般膠版紙
- 冊數:暫無
- 重量:暫無
- 所屬分類:>>
Java大數據分析 內容簡介
本書一開始先通過使用Java對大數據進行基本的統計分析,然后再討論如分類、回歸、聚類、集成等其他數據分析主題。它還涵蓋了如推薦引擎、大規模圖形分析、實時分析、深度學習等高級主題。書中涵蓋了各種案例研究,例如tweet數據集的情緒分析、針對MovieLens數據集的推薦、電子商務數據集的客戶細分、真實航班數據集的圖表分析。這本書是使用Java實現大數據分析的端到端指南。Java如今已經是主流大數據環境(包括Hadoop)的事實語言。本書將教你如何使用產品友好的Java對大數據進行分析。
Java大數據分析 目錄
Chapter 1:Big Data Analytics with Java
Why data analytics on big data?
Big data for analytics
Big data - a bigger pay package for Java developers
Basics of Hadoop - a Java sub-project
Distributed computing on Hadoop
HDFS concepts
Design and architecture of HDFS
Main components of HDFS
HDFS simple commands
Apache Spark
Concepts
Transformations
Actions
Spark Java API
Spark samples using Java 8
Loading data
Data operations - cleansing and munging
Analyzing data - count, projection, grouping, aggregation, and max/min
Actions on RDDs
Paired RDDs
Saving data
Collecting and printing results
Executing Spark programs on Hadoop
Apache Spark sub-projects
Spark machine learning modules
Mahout - a popular Java ML library
Deeplearning4j - a deep learning library
Summary
Chapter 2: First Steps in Data Analysis
Datasets
Data cleaning and munging
Basic analysis of data with Spark SQL
Building SparkConf and context
Dataframe and datasets
Load and parse data
Analyzing data - the Spark-SQL way
Spark SQL for data exploration and analytics
Market basket analysis - Apriori algorithm
Implementation of the Apriori algorithm in Apache Spark
Efficient market basket analysis using FP-Growth algorithm
Running FP-Growth on Apache Spark
Summary
Chapter 3: Data Visualization
Data visualization with Java JFreeChart
Using charts in big data analytics
Time Series chart
All India seasonal and annual average temperature series dataset
Simple single Time Series chart
Java大數據分析 作者簡介
拉賈特·梅塔 is a VP (technical architect) in technology at JP Morgan Chase in New York. He is a Sun certified Java developer and has worked on Java-related technologies for more than 16 years. His current role for the past few years heavily involves the use of a big data stack and running analytics on it. He is alsoa contributor to various open source projects that are available on his GitHub repository, and is also a frequent writer for dev magazines.
- >
莉莉和章魚
- >
推拿
- >
中國歷史的瞬間
- >
月亮與六便士
- >
山海經
- >
隨園食單
- >
自卑與超越
- >
中國人在烏蘇里邊疆區:歷史與人類學概述