项目作者: FatemehTarashi

项目描述 :
Data lake and an ETL pipeline in Spark that loads data from S3, processes the data into analytics tables, and loads them back into S3.
高级语言: Jupyter Notebook
项目地址: git://github.com/FatemehTarashi/data-lake-s3-spark.git
创建时间: 2020-04-14T11:55:48Z
项目社区:https://github.com/FatemehTarashi/data-lake-s3-spark

开源协议:MIT License

下载