Introduction

After cleaning my data in my last post, I am now going to do some brief Exploratory Data Analysis by making some graphs and looking at the different variables that I decided to keep. I chose to get rid of some more variables, since my data set was so imposing with the 21 different variables, so I ended up shaving it down to just 10 variables for the all the Pokémon in the first five generations. Those variables included some that I decided to explore throughout this EDA, namely HP (hit points), Generation, Height, Weight, Primary type, whether the Pokémon is legendary or not, and the name of the Pokémon.

As I was making these graphs, I wanted to have the same colors that PokémonGo does, so I looked and found this Github repository which had all the different colors for each Pokémon type. I ended up reading them in as a dictionary and was able to apply the same colors for each type of Pokémon throughout all the graphics in this post.

Percentage male

As I discussed in my last blog post, I want to explore how the percentage of males depends on the Pokémon and how that relates to the height, weight, and HP of a Pokémon.

Here is a scatterplot of the different percentage of males bassed on the HP of the Pokémon.

Here are side-by-side box plots of the different percentage of males based on the type of the Pokémon.

Here is a bar chart of the percentage of males based on the type of the Pokémon.

Here is a bar chart of the different percentage of males based on the generation of the Pokémon.

Basic plots

After looking at some plots of the percentage of males per Pokemon, I decided to explore some other possible relationships between some of my other variables.

Here is a bar plot of all the Pokémon based on their type

This is a side-by-side boxplot of the different types of Pokémon and how they relate based on HP.

This is a scatterplot of the height and weight of all the Pokémon with color based on the type and the size of the circles based on the HP of the Pokémon.

Generations

If you look at the above plots, there are so many Pokémon that it is hard to see things, so I decided to look at different variables based on the Generation of the Pokémon.

Side-by-side box plots of the weight of the Pokémon by Generation.

Side-by-side box plots of the height of the Pokémon by Generation.

After looking at these plots, I decided to look at the different Generations on their own plots.

Generation 1

These are some plots only using Pokémon from the first generation.

Here are boxplots of the HP of the first generation Pokémon by their type.

Above is a scatterplot of all the first generation Pokémon by their Height and Weight with the size of the dots signifying their HP.

Generation 2

These are some plots only using Pokémon from the second generation.

Here are boxplots of the HP of the second generation Pokémon by their type.

Above is a scatterplot of all the second generation Pokémon by their Height and Weight with the size of the dots signifying their HP.

Generation 3

These are some plots only using Pokémon from the third generation.

Here are boxplots of the HP of the third generation Pokémon by their type.

Above is a scatterplot of all the third generation Pokémon by their Height and Weight with the size of the dots signifying their HP.

Generation 4

These are some plots only using Pokémon from the fourth generation.

Here are boxplots of the HP of the fourth generation Pokémon by their type.

Above is a scatterplot of all the fourth generation Pokémon by their Height and Weight with the size of the dots signifying their HP.

Generation 5

These are some plots only using Pokémon from the fifth generation.

Here are boxplots of the HP of the fifth generation Pokemon by their type.

Above is a scatterplot of all the fifth generation Pokemon by their Height and Weight with the size of the dots signifying their HP.

Legendary

I also wanted to explore the different legendary Pokémon, so the following plots only include all the legendary Pokémon.

Above is a side-by-side boxplot of the HP of the legendary Pokémon by type.

Above is a scatter plot of Height by Weight for the legendary Pokémon.

Above is a bar chart of the legendary Pokémon based on their generation.

Above is a bar chart of the legendary Pokémon based on their type.

Conclusion

There is still a lot to learn from this data set, I am excited to further explore the data as I seek to tell a story with my data. If you enjoyed the interactive graphs, I would recommend checking out Plotly Express, it is kind of my new favorite thing when making graphs.

Here is my Github repository where I keep my Pokémon code.