Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

If the intention was to build an automated "scraping" application to retrieve all the various information available on Open Georgia, then a rigorous, production-ready, ELT (not ETL) solution would need to be developed, tested, and deployed to ensure all data was presented and that it met a high degree of data quality.  For the Open Georgia Analysis effort, we just want to focus in on the salary data presented for teachers and staff across several local boards of education.

Drilling down into the Salaries & Travel Reimbursements area of the site much information is quickly at your fingertips.  For this analysis effort, only the following Organization data was retrieved.

Fiscal YearOrganization TypeOrganizationRecords
2010Local Boards of EducationATLANTA INDEPENDENT SCHOOL SYSTEM9,201
2010Local Boards of EducationCOBB COUNTY SCHOOL DISTRICT20,377
2010State Agencies, Boards, Authorities and CommissionsPUBLIC SAFETY, DEPARTMENT OF1,914
2010Local Boards of EducationFULTON COUNTY BOARD OF EDUCATION15,408
2011Local Boards of EducationFULTON COUNTY BOARD OF EDUCATION15,200
2012Local Boards of EducationFULTON COUNTY BOARD OF EDUCATION14,843

Quite obviously these 76,943 (very well-formed) records do not mee the 3V's of "big data", but they do allow us to exercise the Hadoop tooling to perform some level of data analysis.

  • No labels