An implementation of ROLLUP H2IRG on APACHE PIG

Introduction

Experimental Setup

Experiments' Results

View the Project on GitHub bigfootproject/pig/tree/pig-rollup

Experimental Setup

We ran the experiments on the Hadoop Cluster which has:
We made the experiments on three data input sets:

We also made an experiment on data input set uniform_syn with two ROLLUP in the CUBE clause, the first ROLLUP for (year, month, day) the second ROLLUP for (hour, minute, second) and uses the SUM aggregate function.

For each data input set, we ran two experiments: one which uses the current ROLLUP, one which uses our ROLLUP.
All the experiments with our ROLLUP were executed with the PIVOT=3.