Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

To get started, just load the salaryTravelReport.csv file that you create (or simply download) from the instructions in Preparing Open Georgia Test Data into HDFS itself.  For the examples provided, log into the Sandbox's Hue UI as the user hue and from the File Browser create an opengeorgia folder within /user/hue and then upload the file.  You should see something similar to the following once that is done.

Now, we've got some data and are ready to answer the following question as a simple, figurative, example of what kinds of analysis could be done.

Include Page
Simple Open Georgia Use Case
Simple Open Georgia Use Case

As for which tools to use, the following list of blog entries (if they are not linked, they are coming soon) presents varying tool options to address questions such as this.

  • use mapreduce to calculate salary statistics for georgia educators (first of a three-part series)
  • use pig to calculate salary statistics for georgia educators (second of a three-part series)
  • use hive to calculate salary statistics for georgia educators (third of a three-part series)

 

 

 

 

Warning

Please ignore everything below this warning – it will soon move!!

 

 

We can then just upload the file via Hue to /user/hue/something which has the following format.

...