Skip to content

Commit

Permalink
Tableau part 3 (#106)
Browse files Browse the repository at this point in the history
* beginning work on part 3 exercises and studio

* finished exercise for part 3

* finished edits to part 3 studio
  • Loading branch information
johncfrauen-lc101 authored Oct 13, 2024
1 parent dda0352 commit 7554106
Show file tree
Hide file tree
Showing 2 changed files with 55 additions and 53 deletions.
83 changes: 41 additions & 42 deletions content/tableau-part-three/exercises/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,68 +10,67 @@ weight = 2
1. Download this [Bike Sharing in Washington D.C. Dataset](https://www.kaggle.com/marklvl/bike-sharing-dataset).
- 2 CSV files total (`day.csv` and `hour.csv`).

1. Be sure to read the context and content about this dataset to better understand how and why data was collected.
1. Open the downloaded dataset in Tableau Public.
- Create a relationship between the CSV files.
- `Dteday` is a shared column between the two tables.
1. Please read the context and content about this dataset to better understand how and why data was collected.
1. Open the downloaded dataset in Tableau Public. Create a relationship between the CSV files using the `Dteday` field.

1. Create a new Tableau Public project to answer the below questions.
1. Create a new Tableau Public project to answer the questions in this exercise.
1. We are going to explore dates and create calculations and parameters with our data that answer our business issue.
1. Your final output will be a dashboard with at least 3 of your charts that provides data useful to the business issue.
1. Your final output will be a dashboard with at least 3 of your charts that provides useful data that addresses the business issue.
1. Here are some [other fun facts about the DC Bike sharing program](https://en.wikipedia.org/wiki/Capital_Bikeshare)

### Business Issue:

The Bike Sharing program did very well in DC and is still popular today. Your city is interested in
creating its own bike sharing program. You have been tasked to see how the first 2 years went in DC to
anticipate any factors that might affect usability such as weather, seasons, or holidays.
anticipate any factors that might affect usability such as weather, seasons, or holidays when implementing the program.

A few questions have been provided to help guide your thoughts in this exercise.
A few questions have been provided to help guide your thoughts in this exercise.

### EDA

1. What hours are these bikes most popular with casual riders?
- What about registered riders?
Remember, sometimes we may want to report on a field as a Measure AND a Dimension.

1. What daily trends do you see between casual riders and registered riders from 2011-2012.
- Use `Dteday` and drill down.

### Basic Calculation:
**We can do this by creating a duplicate of the original field and renaming it to be either `field name - measure`, `field name - dimension`, or your own custom name depending on your use case.**

1. Do holidays impact casual rentals?
1. What about registered rentals?
- Rename `Holidays` from 0 and 1 to “Holiday” and “Non-Holiday” using a calculated field.

1. Does the season increase or decrease usage?
- How does that compare with weather?
### EDA

1. Use a calculation to rename the `Seasons` that is more useful than a number.
- 1 = Spring
- 2 = Summer
- 3 = Fall
- 4 = Winter
1. What hours are these bikes most popular with casual riders? What about registered riders?

1. Use a calculation to rename the `Weather` so that it is useful to us humans.
- 1 = Clear
- 2 = Fog or Mist
- 3 = Light precipitation
- 4 = Heavy precipitation
1. What daily trends do you see between casual riders and registered riders from 2011-2012? Feel free to create multiple sheets to investigate different aspects of the data. Use the `Dteday` field and drill down to investigate.

1. Does the average temp (°F) impact all riders? 1. Use calculations to convert the `Temp` from a “Normalized” scale of 0-1, to °C then finally to °F.
- Convert from 0-1 to Celsius using the following formula: **temp = 47 * temp - 8**
- [Formula source](https://www.andrew.cmu.edu/user/achoulde/94842/homework/homework5.html).
- Quick Check: The average temp of Jan 1, 2011 is 46.72°F and the average temp of Feb 1, 2011 is 33.86°F.
### Basic Calculation:

1. Create these calculated fields and give them relevant names:
1. Use a calculation to rename `Holiday` field values from numbers to the strings below:
- 0 = Holiday
- 1 = Non-Holiday
1. Use a calculation to rename the `Season` field values from numbers to the strings below:
- 1 = Spring
- 2 = Summer
- 3 = Fall
- 4 = Winter
1. Use a calculation to rename the `Weathersit` field values from numbers to the strings below:
- 1 = Clear
- 2 = Fog or Mist
- 3 = Light precipitation
- 4 = Heavy precipitation


2. Then answer these questions:
1. Do holidays impact casual rentals? What about registered rentals?
1. How does the season affect the usage of the program? How is usage affected by the day's average weather?
1. Does the average daily temperature (°F) impact average riders? Use a single calculated field to convert the `Temp` from a “Normalized” scale of 0-1 to °C, then finally to °F.
- Convert from 0-1 to Celsius using the following formula: **celsius = 47 * temp - 8**
- Then, convert the new Celsius value to Fahrenheit using this formula: **(celsius × 9/5) + 32 = fahrenheit**
- [Formula source](https://www.andrew.cmu.edu/user/achoulde/94842/homework/homework5.html).
- Quick Check: The average temp of Jan 1, 2011 is 46.72°F and the average temp of Feb 1, 2011 is 33.86°F

### Table Calculation:

1. How many total bike rentals occurred in the city between Jan 2011 to Dec 2012?
- What are the running totals of bike rides between 2011-2012?
- Hint: You can use a quick calculation for this.
1. How many total bike rentals (`cnt` field) occurred in the city between Jan 2011 to Dec 2012? Then, what is the running total for bike rides between 2011-2012?
- Hint: You can use a table calculation for this.

### Parameter:

1. Duplicate EDA question 1 and create a parameter that will allow a user to choose between hourly casual or registered user usage.
1. Duplicate EDA question 1 and create a parameter that will allow a user to choose between the total of casual or registered user usage, grouped by the `Hr` field.
- Hint: You are changing a field with a parameter.

### Dashboard & Finesse:
Expand All @@ -80,9 +79,9 @@ A few questions have been provided to help guide your thoughts in this exercise.
- Informative titles, axes, and other text elements.

1. Before submitting your work, reflect on these questions. You don't need to submit any answers for these questions.
- Would there be any other data that you wish you had as you worked with this data?
- Is there any other data that you wish you had as you worked with this dataset?
- Are you happy with your data visualizations?
- Is there anything you would like to learn more about for your future projects?
- Is there any Tableau features you would like to learn more about for your future projects?

## Submitting Your Work

Expand Down
25 changes: 14 additions & 11 deletions content/tableau-part-three/studio/_index.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,14 +9,20 @@ weight = 3

### Business Issue:

You work for TarMart corporate. TarMart has 45 department stores across the country, of various
You work for TarMart corporate. TarMart has 45 department stores across the country of various
sizes. Between 2010 and 2013, TarMart has tracked weekly sales of each store by department. Your
supervisors want some insights on which stores and departments are performing the best in weekly
sales.

**Due to the size of the dataset we are using, please use the Desktop version of Tableau Public for these exercises.**

**You can publish a local Tableau workbook to your Tableau Public account by selecting the 'File' dropdown menu in the app toolbar, and then selecting the 'Save to Tableau Public' option.**

**Please make sure to save your local workbook often when using the Desktop version of Tableau Public. You only need to 'Save to Tableau Public' when you finish the exercises and are ready to submit your work.**

### Setting Up the Studio:

Download [Retail Data Analytics data set](https://www.kaggle.com/datasets/manjeetsingh/retaildataset?select=Features+data+set.csv). There are 3 CSV files. To download all CSV files at once, click on the `Download` bubble next to the `New Notebook` bubble. An orange arrow is pointing to the `Download` bubble in the image below.
Download the [Retail Data Analytics data set](https://www.kaggle.com/datasets/manjeetsingh/retaildataset?select=Features+data+set.csv). There are 3 CSV files. To download all CSV files at once, click on the `Download` bubble next to the `New Notebook` bubble.

1. Open the CSV file in Tableau Public.
1. When you open this data set, you will want to create a relationship between the 3 CSV files.
Expand All @@ -28,20 +34,18 @@ This dataset has a large amount of rows that Tableau Public might have trouble q

### EDA

1. What do weekly sales over time look like?
1. What do weekly sales look like over time?
1. Weekly sales by store type?
1. Weekly sales by store?
1. Weekly sales by department?
1. How do sales by store and department change by quarter?

### Calculations

1. Determine a cutoff for constituting a high sale amount.

- Use your discretion from what you observe in your vizzes.
- Use a basic calculation to create a new field indicating your measure for high sales.

- Use this field in a new chart to display which stores return higher sales than others.
1. Determine a cutoff value for constituting a high sale amount.
Use your discretion from what you observe in your vizzes.
- Then, use a basic calculation to create a new field indicating your measure for high sales.
Use this field in a new chart to display which stores return higher sales than others.

1. TarMart wants to know which of the stores included in this dataset have contributed the most to overall sales in the company.

Expand Down Expand Up @@ -74,8 +78,7 @@ This dataset has a large amount of rows that Tableau Public might have trouble q

### Dashboard Clarity

1. Present your charts in a dashboard that summarizes your findings.
1. Now is a good time to update labels and colors for an easily digestible report.
1. Present your charts in a dashboard that summarizes your findings. Now is a good time to update labels and colors for an easily digestible report.

### Finishing Touches

Expand Down

0 comments on commit 7554106

Please sign in to comment.