数仓实战04:数仓搭建-DWD层
1)对用户行为数据解析
2)对核心数据进行判空过滤。
3)对业务数据采用维度模型重新建模,即维度退化。
1.用户行为启动表数据解析
1.1创建启动表
1)建表语句
hive (gmall) > DROP TABLE
IF EXISTS dwd_start_log;
CREATE EXTERNAL TABLE dwd_start_log (
`mid_id` string,
`user_id` string,
`version_code` string,
`version_name` string,
`lang` string,
`source` string,
`os` string,
`area` string,
`model` string,
`brand` string,
`sdk_version` string,
`gmail` string,
`height_width` string,
`app_time` string,
`network` string,
`lng` string,
`lat` string,
`entry` string,
`open_ad_type` string,
`action` string,
`loading_time` string,
`detail` string,
`extend1` string
) PARTITIONED BY (dt string) stored AS parquet lo
还没有评论,来说两句吧...