Skip to content

OCIPOC/flightTicket-bigdata-demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

flightTicket-bigdata-demo

a simple python-flask web project using mysql, spark, hdfs and E-charts.

To learn some basic applications about bigdata framework, l use VMware to build 3 virtual machines which serve as database as well as nodes for spark and hdfs.

data requiring --- crawlers

use regular expressions to match and collect data from static HTML

data storing --- hdfs and mysql

data is stored in CSV and uploaded to remote hdfs use spark to read data from hdfs and write into mysql

data analyzing --- spark

use rdd to analyse flight ticket information

data visualization --- E-charts

use JS and templates to show the charts

ticket recommendation

show the tickets according to the specific requests

screen shots:

截图1 截图2 截图3 截图4

基于flask框架的简单大数据项目

--- 爬虫获取数据 (添加IP代理池功能,使用https://github.com/Python3WebSpider/ProxyPool )

--- spark_rdd处理并写入数据

--- 数据存储在mysql

--- 运用JS模板对数据进行简单展示

About

a simple bigdata project about flight ticket ananlysis 机票大数据项目

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors