M211
Data Management and Visualization
About This Badge
How might a social media company’s data about teen social media use differ from teens’ self-reported hours of social media use? What part of a story is told by data? How is data generated or collected? What kind of measurement can give you insight into an unanswered question? Data can sometimes be very messy, and its collection requires careful consideration. In M211 Data Management and Visualization, you will explore the first steps a data scientist takes when dealing with univariate, bivariate, and multivariate data. You will organize data into rows and columns and learn how to deal with missing values. You will practice representing and describing data, transforming it as needed for a desired predictive modeling procedure. You will make decisions about what visual representation is best depending on the type of data you have. Using data visualization, you will consider what story you can tell from the data. After gathering data ethically, you can expect to use technology to organize, clean, and represent it in a meaningful way. Data management and visualization are useful for careers in a variety of fields, like data analytics, marketing, programming, and investigative journalism.
Suggested prerequisites for this badge: concepts of addition, subtraction, multiplication, and division; ratio concepts; solving problems involving percentages; M113 Modeling with Probability.
This badge is suggested as a prerequisite for: M212 Predictive Modeling; M213 Statistical Error and Predictive Model Validation.