Skip to content

Instantly share code, notes, and snippets.

@zeroflag
Created June 11, 2020 08:32
Show Gist options
  • Save zeroflag/eb0f591e300581091b920ebe29621092 to your computer and use it in GitHub Desktop.
Save zeroflag/eb0f591e300581091b920ebe29621092 to your computer and use it in GitHub Desktop.
+----------------------------------------------------+
| Explain |
+----------------------------------------------------+
| STAGE DEPENDENCIES: |
| Stage-1 is a root stage |
| Stage-0 depends on stages: Stage-1 |
| |
| STAGE PLANS: |
| Stage: Stage-1 |
| Tez |
| DagId: hive_20200611083200_799bb622-4aed-46bb-b01f-f1a20121453d:4 |
| Edges: |
| Map 1 <- Map 2 (BROADCAST_EDGE) |
| Map 3 <- Map 1 (BROADCAST_EDGE) |
| Reducer 4 <- Map 3 (SIMPLE_EDGE) |
| Reducer 5 <- Reducer 4 (SIMPLE_EDGE) |
| DagName: hive_20200611083200_799bb622-4aed-46bb-b01f-f1a20121453d:4 |
| Vertices: |
| Map 1 |
| Map Operator Tree: |
| TableScan |
| alias: store_returns |
| filterExpr: sr_reason_sk is not null (type: boolean) |
| Statistics: Num rows: 2879513368 Data size: 79417621500 Basic stats: COMPLETE Column stats: COMPLETE |
| Filter Operator |
| predicate: sr_reason_sk is not null (type: boolean) |
| Statistics: Num rows: 2778808464 Data size: 76640157772 Basic stats: COMPLETE Column stats: COMPLETE |
| Select Operator |
| expressions: sr_item_sk (type: bigint), sr_reason_sk (type: bigint), sr_ticket_number (type: bigint), sr_return_quantity (type: int) |
| outputColumnNames: _col0, _col1, _col2, _col3 |
| Statistics: Num rows: 2778808464 Data size: 76640157772 Basic stats: COMPLETE Column stats: COMPLETE |
| Map Join Operator |
| condition map: |
| Inner Join 0 to 1 |
| keys: |
| 0 _col1 (type: bigint) |
| 1 _col0 (type: bigint) |
| outputColumnNames: _col0, _col2, _col3 |
| input vertices: |
| 1 Map 2 |
| Statistics: Num rows: 39697264 Data size: 635156228 Basic stats: COMPLETE Column stats: COMPLETE |
| Reduce Output Operator |
| key expressions: _col0 (type: bigint), _col2 (type: bigint) |
| null sort order: zz |
| sort order: ++ |
| Map-reduce partition columns: _col0 (type: bigint), _col2 (type: bigint) |
| Statistics: Num rows: 39697264 Data size: 635156228 Basic stats: COMPLETE Column stats: COMPLETE |
| value expressions: _col3 (type: int) |
| Execution mode: vectorized, llap |
| LLAP IO: may be used (ACID table) |
| Map 2 |
| Map Operator Tree: |
| TableScan |
| alias: reason |
| filterExpr: (r_reason_desc = 'Did not like the warranty ') (type: boolean) |
| Statistics: Num rows: 70 Data size: 7350 Basic stats: COMPLETE Column stats: COMPLETE |
| Filter Operator |
| predicate: (r_reason_desc = 'Did not like the warranty ') (type: boolean) |
| Statistics: Num rows: 1 Data size: 105 Basic stats: COMPLETE Column stats: COMPLETE |
| Select Operator |
| expressions: r_reason_sk (type: bigint) |
| outputColumnNames: _col0 |
| Statistics: Num rows: 1 Data size: 8 Basic stats: COMPLETE Column stats: COMPLETE |
| Reduce Output Operator |
| key expressions: _col0 (type: bigint) |
| null sort order: z |
| sort order: + |
| Map-reduce partition columns: _col0 (type: bigint) |
| Statistics: Num rows: 1 Data size: 8 Basic stats: COMPLETE Column stats: COMPLETE |
| Execution mode: vectorized, llap |
| LLAP IO: may be used (ACID table) |
| Map 3 |
| Map Operator Tree: |
| TableScan |
| alias: store_sales |
| Statistics: Num rows: 28800426268 Data size: 3871338091136 Basic stats: COMPLETE Column stats: COMPLETE |
| Select Operator |
| expressions: ss_item_sk (type: bigint), ss_customer_sk (type: bigint), ss_ticket_number (type: bigint), ss_quantity (type: int), ss_sales_price (type: decimal(7,2)) |
| outputColumnNames: _col0, _col1, _col2, _col3, _col4 |
| Statistics: Num rows: 28800426268 Data size: 3871338091136 Basic stats: COMPLETE Column stats: COMPLETE |
| Map Join Operator |
| condition map: |
| Inner Join 0 to 1 |
| keys: |
| 0 _col0 (type: bigint), _col2 (type: bigint) |
| 1 _col0 (type: bigint), _col2 (type: bigint) |
| outputColumnNames: _col3, _col6, _col8, _col9 |
| input vertices: |
| 0 Map 1 |
| Statistics: Num rows: 488445292 Data size: 1794992240 Basic stats: COMPLETE Column stats: COMPLETE |
| Select Operator |
| expressions: _col6 (type: bigint), CASE WHEN (_col3 is not null) THEN ((CAST( (_col8 - _col3) AS decimal(10,0)) * _col9)) ELSE ((CAST( _col8 AS decimal(10,0)) * _col9)) END (type: decimal(18,2)) |
| outputColumnNames: _col0, _col1 |
| Statistics: Num rows: 488445292 Data size: 1794992240 Basic stats: COMPLETE Column stats: COMPLETE |
| Group By Operator |
| aggregations: sum(_col1) |
| keys: _col0 (type: bigint) |
| minReductionHashAggr: 0.99 |
| mode: hash |
| outputColumnNames: _col0, _col1 |
| Statistics: Num rows: 8953312 Data size: 1002770952 Basic stats: COMPLETE Column stats: COMPLETE |
| Reduce Output Operator |
| key expressions: _col0 (type: bigint) |
+----------------------------------------------------+
| Explain |
+----------------------------------------------------+
| null sort order: z |
| sort order: + |
| Map-reduce partition columns: _col0 (type: bigint) |
| Statistics: Num rows: 8953312 Data size: 1002770952 Basic stats: COMPLETE Column stats: COMPLETE |
| value expressions: _col1 (type: decimal(28,2)) |
| Execution mode: vectorized, llap |
| LLAP IO: may be used (ACID table) |
| Reducer 4 |
| Execution mode: vectorized, llap |
| Reduce Operator Tree: |
| Group By Operator |
| aggregations: sum(VALUE._col0) |
| keys: KEY._col0 (type: bigint) |
| mode: mergepartial |
| outputColumnNames: _col0, _col1 |
| Statistics: Num rows: 1119164 Data size: 125346376 Basic stats: COMPLETE Column stats: COMPLETE |
| Top N Key Operator |
| sort order: ++ |
| keys: _col1 (type: decimal(28,2)), _col0 (type: bigint) |
| null sort order: zz |
| Statistics: Num rows: 1119164 Data size: 125346376 Basic stats: COMPLETE Column stats: COMPLETE |
| top n: 100 |
| Reduce Output Operator |
| key expressions: _col1 (type: decimal(28,2)), _col0 (type: bigint) |
| null sort order: zz |
| sort order: ++ |
| Statistics: Num rows: 1119164 Data size: 125346376 Basic stats: COMPLETE Column stats: COMPLETE |
| TopN Hash Memory Usage: 0.04 |
| Reducer 5 |
| Execution mode: vectorized, llap |
| Reduce Operator Tree: |
| Select Operator |
| expressions: KEY.reducesinkkey1 (type: bigint), KEY.reducesinkkey0 (type: decimal(28,2)) |
| outputColumnNames: _col0, _col1 |
| Statistics: Num rows: 1119164 Data size: 116393064 Basic stats: COMPLETE Column stats: COMPLETE |
| Limit |
| Number of rows: 100 |
| Statistics: Num rows: 100 Data size: 11208 Basic stats: COMPLETE Column stats: COMPLETE |
| File Output Operator |
| compressed: false |
| Statistics: Num rows: 100 Data size: 11208 Basic stats: COMPLETE Column stats: COMPLETE |
| table: |
| input format: org.apache.hadoop.mapred.SequenceFileInputFormat |
| output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat |
| serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe |
| |
| Stage: Stage-0 |
| Fetch Operator |
| limit: 100 |
| Processor Tree: |
| ListSink |
| |
+----------------------------------------------------+
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment