大數(shù)據(jù)綜合實(shí)戰(zhàn)案例教程
定 價:49 元
叢書名:職業(yè)教育大數(shù)據(jù)技術(shù)與應(yīng)用專業(yè)產(chǎn)教融合系列教材
- 作者:工業(yè)和信息化部教育與考試中心 組編譚志彬 鄧立 吳子穎 主編
- 出版時間:2020/11/1
- ISBN:9787111661030
- 出 版 社:機(jī)械工業(yè)出版社
- 中圖法分類:TP274
- 頁碼:236
- 紙張:
- 版次:
- 開本:16開
本書以某市出租車行業(yè)為背景介紹大數(shù)據(jù)技術(shù)在項(xiàng)目中的應(yīng)用。全書共10章,第1章交通大數(shù)據(jù)概述,第2章某市出租車實(shí)戰(zhàn)案例部署,第3章某市出租車項(xiàng)目設(shè)計,第4章Python語言基礎(chǔ),第5章數(shù)據(jù)提取,第6章數(shù)據(jù)清洗,第7章數(shù)據(jù)存儲,第8章數(shù)據(jù)分析處理,第9章ECharts的應(yīng)用,第10章某市出租車綜合編程實(shí)踐。
本書適合作為各類職業(yè)院校大數(shù)據(jù)及相關(guān)專業(yè)的教材,也可作為大數(shù)據(jù)開發(fā)工程師及其他科技工作者的參考用書。
本書配有電子課件、源代碼,選用本書作為授課教材的教師可登錄機(jī)械工業(yè)出版社教育服務(wù)網(wǎng)(www.cmpedu.com)注冊后免費(fèi)下載。
前 言
第1章 交通大數(shù)據(jù)概述........................................1
1.1 大數(shù)據(jù)概述.......................................................................................3
1.2 大數(shù)據(jù)處理過程................................................................................5
1.3 交通大數(shù)據(jù)的來源............................................................................7
1.4 交通大數(shù)據(jù)的應(yīng)用............................................................................9
1.5 交通大數(shù)據(jù)發(fā)展面臨的挑戰(zhàn).............................................................10
1.6 思考練習(xí)........................................................................................11
第2章 某市出租車項(xiàng)目實(shí)戰(zhàn)案例部署.....................13
2.1 項(xiàng)目背景........................................................................................15
2.2 主要流程........................................................................................15
2.3 項(xiàng)目難點(diǎn)分析.................................................................................16
2.4 數(shù)據(jù)加載........................................................................................16
2.5 本地開發(fā)環(huán)境搭建..........................................................................20
2.6 發(fā)布Tomcat可視化頁面.................................................................31
2.7 數(shù)據(jù)可視化效果展示.......................................................................34
2.8 思考練習(xí)........................................................................................44
第3章 某市出租車項(xiàng)目設(shè)計................................45
3.1 數(shù)據(jù)源...........................................................................................47
3.2 項(xiàng)目整體架構(gòu)設(shè)計..........................................................................48
3.3 選擇所需軟件.................................................................................49
3.4 Hadoop集群規(guī)劃...........................................................................49
3.5 大數(shù)據(jù)ETL過程..............................................................................50
3.6 思考練習(xí)........................................................................................53
第4章 Python語言基礎(chǔ)...................................55
4.1 Python語言概述............................................................................57
4.2 PyCharm概述...............................................................................58
4.3 Python基礎(chǔ)...................................................................................64
4.4 思考練習(xí)........................................................................................72
第5章 數(shù)據(jù)提取...............................................73
5.1 數(shù)據(jù)爬蟲........................................................................................75
5.2 文件數(shù)據(jù)提取.................................................................................86
5.3 思考練習(xí)........................................................................................93
第6章 數(shù)據(jù)清洗..............................................95
6.1 數(shù)據(jù)清洗過濾.................................................................................97
6.2 各類格式文件的數(shù)據(jù)輸出..............................................................105
6.3 思考練習(xí)......................................................................................110
第7章 數(shù)據(jù)存儲..............................................111
7.1 HDFS加載存儲............................................................................113
7.2 Sqoop加載存儲...........................................................................116
7.3 思考練習(xí)......................................................................................120
第8章 數(shù)據(jù)分析處理.......................................121
8.1 MapReduce概述.........................................................................123
8.2 MapReduce體系結(jié)構(gòu)..................................................................125
8.3 MapReduce工作流程..................................................................125
8.4 MapReduce開發(fā)環(huán)境配置...........................................................127
8.5 統(tǒng)計求和......................................................................................139
8.6 全排序.........................................................................................147
8.7 二次排序......................................................................................150
8.8 最值.............................................................................................156
8.9 連接.............................................................................................160
8.10 思考練習(xí)....................................................................................163
第9章 ECharts的應(yīng)用...................................165
9.1 ECharts的基本概念.....................................................................167
9.2 ECharts快速上手........................................................................170
9.3 思考練習(xí)......................................................................................190
第10章 某市出租車項(xiàng)目綜合編程實(shí)踐.................193
10.1 項(xiàng)目整體需求分析......................................................................195
10.2 項(xiàng)目架構(gòu)....................................................................................196
10.3 數(shù)據(jù)Extract—— 提取格式轉(zhuǎn)換....................................................197
10.4 數(shù)據(jù)Transform—— 數(shù)據(jù)過濾.....................................................198
10.5 數(shù)據(jù)Transform—— 補(bǔ)充空數(shù)據(jù)..................................................201
10.6 數(shù)據(jù)Load—— 文件HDFS存儲....................................................203
10.7 數(shù)據(jù)Transform—— MapReduce...............................................204
10.8 數(shù)據(jù)Load—— Sqoop導(dǎo)出數(shù)據(jù)...................................................218
10.9 數(shù)據(jù)可視化................................................................................220
10.10 思考練習(xí)..................................................................................229
參考文獻(xiàn)......................................................230