The simplified study can be found here, while the extended one is found here.
SQL Data Manipulation Case Study
R Data Manipulation Data Visualization Machine Learning
SQL Data Manipulation Case Study
R Data Visualization Data Manipulation Probability & Statistics Case Study
In this notebook, I will be using the following packages:
R Data Visualization Case Study
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Data Cleaning
In this notebook, I will be using the following package(s):
R Data Visualization Data Manipulation Data Cleaning
In this notebook, I will be using the following package(s):
R Data Visualization Data Manipulation Data Cleaning Date/Time Manipulation
The dataset contains the following variables:
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Data Cleaning
The dataset contains the following variables: year, month, day, hour, location, gender, Res1, Res2.
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Data Cleaning
In this project, we will answer all these questions by manipulating and visualizing United Nations life expectancy data using ggplot2.
The dataset can be found here and contains the average life expectancies of men and women by country (in years). It covers four periods: 1985-1990, 1990-1995, 1995-2000, and 2000-2005.
In this notebook, I will be using the following packages:
Picture source: worldatlas.com
R Data Visualization Data Manipulation Data Cleaning
We are going on a de·tour to determine which match and stadium had the highest attendance during the 2019 FIFA Women’s World Cup. Why? because we can, of course. We’ll bring out our data importing and cleaning skills to dig through the dirty data, clean it up, and present it in the form of informative polished graphs.
We’ll go over questions like:
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Data Cleaning
The Nobel Prize is perhaps the world’s most well known scientific award. Every year it is given to scientists and scholars in chemistry, literature, physics, medicine, economics, and peace. The first Nobel Prize was handed out back in 1901, and at that time the prize was Eurocentric and male-focused, but nowadays it’s not biased in any way. Surely, right?
Well, let’s find out about that! We’ll go over questions like:
The dataset used in this project is from The Nobel Foundation on Kaggle.
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Data Cleaning Case Studies
Quantitative analyses can have a significant impact on initiating change within one’s community. When analyzed responsibly, data can provide evidence to understand difficult social issues correctly. In this project, you will leverage publicly available data to interpret crime patterns within the city of San Francisco.
The dataset used in this project is hosted on Kaggle and updated daily. Note: some of the original column names are altered for adherence to a standard naming scheme.
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Data Cleaning Case Studies Probability Statistics
Candy Crush Saga is a hit mobile game developed by King (part of Activision-Blizzard) that is played by millions of people all around the world.
In this Project, you will get to work with a real Candy Crush dataset and use this data to estimate level difficulty. This Project assumes you can manipulate data frames using dplyr and make plots using ggplot2.
This project uses data from anonymous players playing one “episode” of the game, in the year 2014.
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Case Studies
When beginning a career in data science, one often wonders what programming tools and languages are being used in the industry, and what skills one should learn first. By exploring the 2017 Kaggle Data Science Survey results, you can learn about the tools used by 10,000+ people in the professional data science community.
This project uses a subset of the 2017 Kaggle Machine Learning and Data Science Survey dataset.
In this notebook, I will be using the following packages:
R Data Visualization Data Manipulation Case Studies
“The scientist does not study nature because it is useful; he studies it because he delights in it, and he delights in it because it is beautiful.” - Henri Poincaré
There are many examples of natural facts that can be described in mathematical terms. Nice examples are the shape of snowflakes, the fractal geometry of romanesco broccoli or how self-similarity rules the growth of plants.
In this notebook, I will be using the ggplot2 package. This package is home to many important features that will be useful not only to do art but also to represent data in real-life problems.