Analysis and Visualization
1. The plot of the average gas usage (Gj) of each borough/county.
Findings:
-
Brooklyn/Kings County had the highest average gas usage, with almost double the second highest average.
-
New York had the lowest average gas usage.
3. The plot of the top 5 zip codes with highest average gas usage in Gj using.
Findings:
-
The zip code with the highest average_usage is 11434 which is a part of queens
-
The other four zip codes are a part of Brooklyn/ Kings County which is the borough with the highest average gas usage.
-
80% of the highest average gas usage is a part of Brooklyn
-
20% of the highest average gas usage is a part of Queens
2. The plot of the gas usage (Gj) of an average person in each borough/county in 2010.
Findings:
-
The average gas usage per person is the highest in Brooklyn.
-
The average gas usage per person is the lowest in Staten Island/ Richmond.
4. The plot of the 5 zip codes with the lowest average gas usage in Gj.
Findings:
-
The zip code with the lowest average gas usage is a part of Brooklyn.
-
60% of the lowest average gas usage is a part of queens.
-
40% of the lowest average gas usage is a part of Brooklyn.
5. The plot of the population of each borough in 2010
Findings:
-
Brooklyn had the highest population in 2010
-
Staten Island had the lowest population in 2010
-
Brooklyn's population is about 5x Staten Island's population.
Conclusions
My underlying hypothesis before diving deep into the project was that the borough with the most residential households and the largest population will be the borough with the highest natural gas usage. I predicted that the average gas usage of an average person will be the same regardless of the borough. After cleaning the datasets, performing exploratory data analysis, and creating visualizations, my hypothesis was shown to not be supported. Looking at the first and fifth graphs, it is clear that average gas usage is not directly related to the population size. Despite Staten Island having the smallest population, the average gas usage was not the lowest compared to the other boroughs. Manhattan/ New York County had the third-highest population, however, had the lowest average gas usage.
​
In addition, the second graph depicts the gas usage (Gj) of an average person in each borough/county. I predicted that the average person in each borough/county would have a highly similar gas_usage, however, the graph shows Brooklyn having a much higher usage than the other boroughs. This difference could be due to a lot of different variables such as socioeconomic differences or living environment differences. For example, some bigger or older houses may need more gas to be able to warm up the house. Overall, the exploratory data analysis helped identify trends, in terms of the borough, zip code, and population.