In this project, you’ll have the opportunity to combine a lot of what you’ve learned throughout this entire program. This project may seem challenging, and there isn’t a lot of guidance provided, as we want you to have the freedom to be creative. That dataset that you are working with is real, and it is pretty big. When you are an analyst for a company, there often isn’t guidance on how to correctly answer a particular question. And there might not really be one right answer. Leverage your skills. Make sure you look through the directions carefully.
Project Description
Your goal in this project is to create visualizations to reveal insights from a data set. You will create data visualizations that tell a story or highlight patterns in the data set. Your work should be a reflection of the theory and practice of data visualization, such as visual encodings, design principles, and effective communication.
There are 3 different data sets you can choose from. (CHOOSE ONE ONLY)
- Flight Delays and Cancellations
- US Census Demographic Data
- Youtube Data from the US
1) Flight Delays and Cancellations
This data comes from a Kaggle dataset, it tracks the on-time performance of US domestic flights operated by large air carriers in 2015. You can find the dataset in supporting materials at the bottom of this page.
The file you must use in creating your data visualizations is the flights.csv
file. The other two provided files may be used in conjunction with the flights.csv file
, but should not be used alone.
You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:
- Which airlines or airports have the worst delays? Determine which destinations and arrival destinations have the most delays? Doing this using maps is actually pretty difficult, but you may choose an alternative visual to provide this information. Think about what kind of aggregates might work best to determine which airlines and airports are the best and worst in terms of delays.
- What causes delays? Think about if you work at an airline and you want to decrease delays. What part of the flight causes the most delays? Do these causes vary by airport or time of year?
- You can also come up with your own question! As you work with the data, come up with a question you’re curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
2) US Census Demographic Data
This data comes from a Kaggle dataset, it includes the census data for all counties in 2015. You can find the dataset in supporting materials at the bottom of this page. Required dashboards
You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:
- Which states have the best transportation? This is a fairly subjective question, so your first job is to define what the best transportation is. Is it highest percentage of transit use? Is it lowest mean commute times. Then you need to determine how to aggregate the data from the county level to the state. Are there outlier counties affecting the data? How should you aggregate all the data from the counties to represent the state effectively? Please provide your reasoning in your report.
- How does income and poverty look across America? Think about how best to contrast this data to show an interesting finding. You can look across many of the different fields to show interesting findings. Do counties with more construction experience more or less poverty? Do counties near the coast experiene more or less income? Remember this is all correlation and not causation so we cannot say any one thing causes it but we can report descriptive statistics.
- You can also come up with your own question! As you work with the data, come up with a question you’re curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
3) Youtube Data US
This data comes from a Kaggle dataset, it includes a bunch of information for videos that were trending for at least one day. This data set will require some cleaning in excel prior to Tableau. You can find the dataset in supporting materials at the bottom of this page.
You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:
- What tags have grow in popularity over time? (The tag field is very hard to work with, consider asking a different question. Reminder all of the questions here are suggestions, they are not required. There is an additional file with the tags transposed into a a single column that you can make use of as well. To join it in you will want to join it on channel title and title of the video) You will need to first figure out how to clean the data from having bunch of tag words all in one cell to a format where you can count the number of times the individual tag word is used each day, month, or year depending on how you are answering this question
- What categories are the most liked and disliked? To answer this question you will need to replace the numeric category IDs with the actual names which are in a separate file. Then you can looks at this question deeper. Perhaps there is a time of year where one category is preferred? Are there certain channels with an outlier amount of likes or dislikes?
- You can also come up with your own question! As you work with the data, come up with a question you’re curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer.
REQUIREMENTS
Use the Project Rubric to review your project. If you are happy with your submission, then you are ready to submit! If you see room for improvement in any category in which you do not meet specifications, keep working!
Your project will be evaluated by a Udacity reviewer according to the same Project Rubric. Your project must “Meet Specifications” in each category in order for your submission to pass.
REMINDERS
Your visualization work should use Tableau: other visualization tools may not be evaluated by reviewers. Host your dashboards/stories on Tableau Public, you’ll be submitting them to reviewers and this makes them easily accessible.
Remember that your visualization should be explanatory in nature and communicate specific results that you want to show.
SUBMISSION
Ready to submit your project? Click on the “Submit Project” button or go back to your Udacity Home and click on the project, then follow the instructions to submit! Make sure you include the following files and information:
- A PDF or Markdown report that includes the following sections:
- Links to your dashboards or story
- You must submit url links for each of your visuals from Tableau Public. If you need a reminder on how to save to Tableau Public, please see the next concept.
- Summary: brief description of the visualization and the main story or findings conveyed
- Design: explain any design choices you made including changes to the visualization after collecting feedback
- Resources: list of Web sites, books, forums, blog posts, GitHub repositories etc that you referred to or used in this submission (Add N/A if you did not use such resources). Project rubric as follows :
Project: Telling Stories with Data
Visualization is Explanatory
Success Criteria Specifications Does the visualization have a clear finding? The visualization centers on a specific, clear finding in the data. Does the visualization focus on its finding? The selected finding is clearly communicated. Design choices foster communication between the reader and the visualization.Visualization does not add additional colors, shapes, or other design elements in an unnecessary way. Rather, each additional element should add to the insight being made. Design
Success Criteria Specifications Does the written summary reflect what a reader would interpret from the graphic? The written summary should include a brief description of the visualization and state at least one finding.A reader’s summary of the graphic would closely match the written summary in the writeup, and a reader is able to identify at least one main point or relationship that the graphic attempts to convey.To reiterate your report should include at least 3 sets of
We offer the best custom writing paper services. We have answered this question before and we can also do it for you.
GET STARTED TODAY AND GET A 20% DISCOUNT coupon code DISC20