The environment that enables you to work with R, RStudio, will be explained to you. The software tools and programmes that are exclusive to R, including R packages, will also be covered in this course.
You’ll learn how R enables you to more effectively clean, organise, analyse, visualise, and report data. Current Google data analysts will continue to guide you and give you practical examples of how to carry out typical data analyst jobs using the best instruments and resources.Join Now
Data Analysis with R Programming Coursera Quiz Answers
Data Analysis with R Programming Week 01 Quiz Answers
Quiz- 1
L2 Programming languages:
Q1. Fill in the blank: Programming involves _____ a computer to perform an action or set of actions.
- updating
- instructing
- training
- filtering
Q2. What are Python, JavaScript, SAS, Scala, and Julia?
- Integrated development environments
- Databases
- Programming languages
- Web applications
Q3. What are the benefits of using a programming language to work with your data? Select all that apply.
- Clarify the steps of your analysis
- Easily reproduce and share your work
- Save time
- Choose a business task for analysis
Quiz- 2
L3 R programming language:
Q1. Open-source code is only available to people who pay a subscription fee.
- True
- False
Q2. The R programming language can be used for which of the following tasks? Select all that apply.
- Data analysis
- Visualization
- Statistical analysis
- Gaming
Q3. Which of the following terms best describes the R programming language?
- Open-data
- Data-centric
- Closed-source
- Open-ended
Quiz- 3
L4 Programming with RStudio:
Q1. What type of software application is RStudio?
- Data visualization tool
- Source editor
- Database
- Integrated development environment
Q2. RStudio includes which of the following panes? Select all that apply.
- Environment pane
- Source editor pane
- Command pane
- R console pane
Q3. If you write code directly in the R console, RStudio will automatically save your code when you close your current session.
- True
- False
Data Analysis with R Programming Weekly challenge 1 Answers
Q1. A data analyst uses words and symbols to give instructions to a computer. What are the words and symbols known as?
- Syntax language
- Function language
- Programming language
- Coded language
Q2. Many data analysts prefer to use a programming language for which of the following reasons? Select all that apply.
- To choose a topic for analysis
- To easily reproduce and share an analysis
- To clarify the steps of an analysis
- To save time
Q3. Which of the following are benefits of open-source code? Select all that apply.
- Anyone can fix bugs in the code
- Anyone can create an add-on package for the code
- Anyone can pay a fee for access to the code
- Anyone can use the code for free
Q4. Fill in the blank: The benefits of using _____ for data analysis include the ability to quickly process lots of data and create high quality visualizations.
- the R programming language
- a dashboard
- a spreadsheet
- structured query language
Q5. A data analyst needs to quickly create a series of scatterplots to visualize a very large dataset. What should they use for the analysis?
- Structured query language
- A slide presentation
- A dashboard
- R programming language
Q6. RStudio’s integrated development environment lets you perform which of the following actions? Select all that apply.
- Install R packages
- Create data visualizations
- Import data from spreadsheets
- Stream online videos
Q7. In which two parts of RStudio can you execute code? Select all that apply.
- The environment pane
- The plots pane
- The source editor pane
- The R console pane
Q8. Fill in the blank: In RStudio, the _____ is where you can find all the data you currently have loaded, and can easily organize and save it.
- environment pane
- plots pane
- R console pane
- source editor pane;
Data Analysis with R Programming Week 02 Quiz Answers
Quiz- 1
L2 Programming concepts:
Q1. Why do analysts use comments In R programming? Select all that apply.
- To make an R Script more readable
- To explain their code
- To act as functions
- To provide names for variables
Q2. What should you use to assign a value to a variable in R?
- A vector
- An operator
- A comment
- An argument
Q3. Which of the following examples is the proper syntax for calling a function in R? 1 point
- <- 20
- print()
- data_1
- #first
Q4. Which of the following examples can you use in R for date/time data? Select all that apply. 1 point
- 2018-12-21 16:35:28 UTC
- 2019-04-16
- 06:11:13 UTC
- 07/24-2018
Quiz- 2
L3 Coding in R:
Q1. An analyst includes the following calculation in their R programming: midyear_sales <- (quarter_1_sales + quarter_2_sales) – overhead_costs Which variable will the total from this calculation be assigned to?
- midyear_sales
- quarter_1_sales
- quarter_2_sales
- overhead_costs
Q2. An analyst is checking the value of the variable x using a logical operator, so they run the following code:
x > 35 & x < 65
Which values of x would return TRUE when the analyst runs the code? Select all that apply.
- 35
- 50
- 60
- 70
Q3. Which of the following functions can analysts use to create conditional statements in their R programming? Select all that apply.
- print()
- else()
- if()
- c()
Quiz- 3
L4 R Packages:
Q1. When using RStudio, what does the installed.packages() function do?
- Presents a list of packages currently installed in an RStudio session
- Selects the best packages to use based on an analyst’s current needs
- Creates code for analysts to use to edit their packages
- Installs all available packages for use in an RStudio session
Q2. In data analytics, what is CRAN?
- A commonly used online archive with R packages and other R resources
- A collection of packages that function together to make analysis in R more efficient
- An R interface that has many of the same functions as RStudio
- A function for finding packages to use for analysis in RStudio
Q3. What are ggplot2, tidyr, dplyr, and forcats all a part of?
- A list of functions that clean data efficiently
- A list of variables for use in programming in RStudio
- A collection of core tidyverse packages
- A collection of commonly used, CRAN-based data sets;
Quiz- 4
L5 Explore the tidyverse:
Q1. When working in R, for which part of the data analysis process do analysts use the tidyr package?
- Data security
- Data visualization
- Data cleaning
- Data calculations
Q2. Which tidyverse package contains a set of functions, such as select(), that help with data manipulation?
- forcats
- ggplot2
- readr
- dplyr
Q3. An analyst is organizing a dataset in RStudio using the following code:
arrange(filter(Storage_1, inventory >= 40), count)
Which of the following examples is a nested function in the code?
- filter
- arrange
- inventory
- count
Data Analysis with R Programming Weekly challenge 2 Answers
Q1. Which of the following is an example of a piece of R code that contains both a function and an argument?
- print(“peaches”)
- weekly_sales <- 7450
- #filter
- mass > 1000
Q2. A data analyst is assigning a variable to a value in their company’s sales dataset for 2020. Which variable name uses the correct syntax?
- _2020sales
- sales_2020
- -sales-2020
- 2020_sales
Q3. You want to create a vector with the values 12, 23, 51, in that exact order. After specifying the variable, what R code chunk allows you to create the vector?
- v(12, 23, 51)
- c(12, 23, 51)
- c(51, 23, 12)
- v(51, 23, 12)
Q4. An analyst comes across dates listed as strings in a dataset, for example December 10th, 2020. To convert the strings to a date/time data type, which function should the analyst use?
- mdy()
- now()
- datetime()
- lubridate()
Q5. A data analyst inputs the following code in RStudio:
sales_1 <- (3500.00 * 12)
Which of the following types of operators does the analyst use in the code? Select all that apply.
- Assignment
- Arithmetic
- Logical
- Relational
Q6. A data analyst is deciding on naming conventions for an analysis that they are beginning in R. Which of the following rules are widely accepted stylistic conventions that the analyst should use when naming variables? Select all that apply.
- Use single letters, such as “x” to name all variables
- Use an underscore to separate words within a variable name
- Use all lowercase letters in variable names
- Begin all variable names with an underscore
Q7. Which of the following are included in R packages? Select all that apply.
- Tests for checking your code
- Sample datasets
- Reusable R functions
- Naming conventions for R variable names
Q8. Packages installed in RStudio are called from CRAN. CRAN is an online archive with R packages and other R-related resources.
- True
- False
Q9. When programming in R, what is a pipe used as an alternative for?
- Variable
- Vector
- Nested function
- Installed package
Data Analysis with R Programming Week 03 Quiz Answers
Quiz- 1
L2 Explore data and R:
Q1. Which of the following are best practices for creating data frames? Select all that apply.
- Columns should be named
- Data can be stored as many different types
- Rows should be named
- Each column should contain the same number of data items
Q2. Why are tibbles a useful variation of data frames?
- Tibbles make changing the names of variables easier.
- Tibbles can create row names
- Tibbles make printing easier
- Tibble can change the data type of inputs
Q3. Tidy data is a way of standardizing the organization of data within R.
- True
- False
Q4. Which R function can be used to make changes to a data frame?
- colnames()
- mutate()
- str()
- head()
Quiz- 2
L3 Cleaning data:
Q1. A data analyst is cleaning their data in R. They want to be sure that their column names are unique and consistent to avoid any errors in their analysis. What R function can they use to do this automatically?
- rename()
- select()
- rename_with()
- clean_names()
Q2. A data analyst is trying to sort the penguins bill_length_mm data in descending order. They input the following code:
penguins %>%
What code does the analyst add to organize the column bill_length_mm in descending order?
- arrange(-bill_length_mm)
- arrange(=bill_length_mm)
- arrange(+bill_length_mm)
- arrange(%>%bill_length_mm)
Q3. A data analyst is working with customer information from their company’s sales data. The first and last names are in separate columns, but they want to create one column with both names instead. Which of the following functions can they use?
- separate()
- unite()
- arrange()
- select();
Quiz- 3
L4 R functions:
Q1. Which of the following functions can a data analyst use to get a statistical summary of their dataset? Select all that apply.
- cor()
- ggplot2()
- sd()
- mean()
Q2. A data analyst inputs the following command:
quartet %>% group_by(set) %>% summarize(mean(x), sd(x), mean(y), sd(y), cor(x, y)).
Which of the functions in this command can help them determine how strongly related their variables are?
- mean(y)
- sd(x)
- cor(x,y)
- sd(y)
Q3. Fill in the blank: The bias function compares the actual outcome of the data with the _____ outcome to determine whether or not the model is biased.
- probable
- desired
- predicted
- final;
Data Analysis with R Programming Weekly challenge 3 Answers
Q1. A data analyst is creating a new data frame. Their dataset has dates, currency, and text strings. What characteristic of data frames is this an instance of?
- Data stored can be many different types
- Columns should contain the same number of items
- Columns should be named
- Variables should be named
Q2. A data analyst is considering using tibbles instead of basic data frames. What are some of the limitations of tibbles? Select all that apply.
- Tibbles can overload a console
- Tibbles can never create row names
- Tibbles won’t automatically change the names of variables
- Tibbles can never change the input type of the data
Q3. A data analyst is working with a large data frame. It contains so many columns that they don’t all fit on the screen at once. The analyst wants a quick list of all of the column names to get a better idea of what is in their data. What function should they use?
- colnames()
- head()
- str()
- mutate()
Q4. A data analyst is working with the ToothGrowth dataset in R. What code chunk will allow them to get a quick summary of the dataset?
- glimpse(ToothGrowth)
- min(ToothGrowth)
- separate(ToothGrowth)
- colnames(ToothGrowth)
Q5. A data analyst is working with the penguins dataset. What code chunk does the analyst write to make sure all the column names are unique and consistent and contain only letters, numbers, and underscores?
- drop_na(penguins)
- clean_names(penguins)
- rename(penguins)
- select(penguins)
Q6. A data analyst is working with the penguins data. They write the following code:
penguins %>%
The variable species includes three penguin species: Adelie, Chinstrap, and Gentoo. What code chunk does the analyst add to create a data frame that only includes the Gentoo species?
- filter(Gentoo == species)
- filter(species <- “Gentoo”)
- filter(species == “Gentoo”)
- filter(species == “Adelie”)
Q7. A data analyst is working with the penguins dataset. They write the following code:
penguins %>%
group_by(species) %>%
What code chunk does the analyst add to find the mean value for the variable body_mass_g?
- summarize(=body_mass_g)
- summarize(max(body_mass_g))
- summarize(mean(body_mass_g))
- summarize(body_mass_g(mean))
Q8. A data analyst is working with a data frame named salary_data. They want to create a new column named wages that includes data from the rate column multiplied by 40. What code chunk lets the analyst create the wages column?
- mutate(salary_data, rate = wages * 40)
- mutate(wages = rate * 40)
- mutate(salary_data, wages = rate * 40)
- mutate(salary_data, wages = rate + 40)
Q9. A data analyst is working with a data frame named customers. It has separate columns for area code (area_code) and phone number (phone_num). The analyst wants to combine the two columns into a single column called phone_number, with the area code and phone number separated by a hyphen. What code chunk lets the analyst create the phone_number column?
- unite(customers, area_code, phone_num, sep=”-“)
- unite(customers, “phone_number”, area_code, phone_num)
- unite(customers, “phone_number”, area_code, sep=”-“)
- unite(customers, “phone_number”, area_code, phone_num, sep=”-“)
Q10. A data analyst wants to summarize their data with the sd(), cor(), and mean(). What kind of measures are these?
- Statistical
- Numerical
- Summary
- Standard
Q11. In R, which statistical measure demonstrates how strong the relationship is between two variables?
- Standard deviation
- Correlation
- Average
- Maximum
Q12. A data analyst is studying weather data. They write the following code chunk:
bias(actual_temp, predicted_temp)
What will this code chunk calculate?
- The minimum difference between the actual and predicted values
- The maximum difference between the actual and predicted values
- The average difference between the actual and predicted values
- The total average of the values;
Data Analysis with R Programming Week 04 Quiz Answers
Quiz- 1
L2 Aesthetics in analysis:
Q1. In ggplot2, you can use the _____ function to specify the data frame to use for your plot.
- labs()
- aes()
- geom_point()
- ggplot()
Q2. In ggplot2, you use the plus sign (+) to add a layer to your plot.
- True
- False
Q3. In ggplot2, what function do you use to map variables in your data to visual features of your plot?
- The aes() function
- The geom_bar() function
- The ggplot() function
- The geom_point() function
Q4. What type of plot will the following code create?
ggplot(data = penguins) +
geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g))
- Bar chart
- Scatterplot
- Line diagram
- Boxplot;
Quiz- 2
L3 Aesthetics in analysis:
Q1. Which of the following aesthetics attributes can you map to the data in a scatterplot? Select all that apply.
- Text
- Color
- Size
- Shape
Q2. Which of the following functions let you display smaller groups, or subsets, of your data?
- ggplot()
- geom_bar()
- geom_point()
- facet_wrap()
Q3. You can use the color aesthetic to add color to the outline of each bar in a bar chart.
- True
- False
Q4. What is the role of the x argument in the following code?
ggplot(data = diamonds) +
geom_bar(mapping = aes(x = cut))
- A dataset
- A function
- A variable
- An aesthetic
Q5. A data analyst creates a scatterplot with a lot of data points. It is difficult for the analyst to distinguish the individual points on the plot because they overlap. What function could the analyst use to make the points easier to find?
- geom_line()
- geom_bar()
- geom_jitter()
- geom_point();
Quiz- 3
L4 Annotating and saving visualizations:
Q1. Which of the following are benefits of adding labels and annotations to your plot? Select all that apply.
- Indicating the main purpose of your plot
- Helping stakeholders quickly understand your plot
- Highlighting important data in your plot
- Choosing a geom for your plot
Q2. A data analyst is creating a plot for a presentation to stakeholders. The analyst wants to add a title, subtitle, and caption to the plot to help communicate important information. What function could the analyst use?
- The geom_bar() function
- The facet_wrap() function
- The geom_point() function
- The labs() function
Q3. What function can you use to put a text label inside the grid of your plot to call out specific data points?
- The annotate() function
- The labs() function
- The aes() function
- The facet_wrap() function
Q4. A data analyst wants to add the title “Penguins” to a plot that visualizes the penguins dataset. What is the correct syntax for the argument of the labs() function?
- labs(title <- “Penguins”))
- labs(title = “Penguins”)
- labs(“Penguins”)
- labs(“Penguins” = title)
Q5. Which of the following functions can you use to save your plots in ggplot2?
- The ggsave() function
- The ggplot() function
- The saveplot() function
- The ggplotsave() function;
Data Analysis with R Programming Weekly challenge 4 Answers
Q1. Which of the following are benefits of using ggplot2? Select all that apply.
- Automatically clean data before creating a plot
- Easily add layers to your plot
- Combine data manipulation and visualization
- Customize the look and feel of your plot
Q2. In ggplot2, what symbol do you use to add layers to your plot?
- The equal sign (=)
- The ampersand symbol (&)
- The pipe operator (%>%)
- The plus sign (+)
Q3. A data analyst creates a plot using the following code chunk:
ggplot(data = penguins) +
geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g))
Which of the following represents a variable in the code chunk? Select all that apply.
- body_mass_g
- x
- flipper_length_mm
- y
Q4. A data analyst uses the aes() function to define the connection between their data and the plots in their visualization. What argument is used to refer to matching up a specific variable in your data set with a specific aesthetic?
- Faceting
- Mapping
- Jittering
- Annotating
Q5. A data analyst is working with the penguins data. The analyst creates a scatterplot with the following code:
ggplot(data = penguins) +
geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g,alpha = species))
What does the alpha aesthetic do to the appearance of the points on the plot?
- Makes some points on the plot more transparent
- Makes the points on the plot more colorful
- Makes the points on the plot smaller
- Makes the points on the plot larger
Q6. You are working with the penguins dataset. You create a scatterplot with the following code chunk:
ggplot(data = penguins) +
geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g))
How do you change the second line of code to map the aesthetic size to the variable species?
- geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g, species = size)
- geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g, size = species))
- geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g, species + size)
- geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g, size + species))
Q7. Fill in the blank: The _____ creates a scatterplot and then adds a small amount of random noise to each point in the plot to make the points easier to find.
- geom_bar() function
- geom_jitter() function
- geom_smooth() function
- geom_point() function
Q8. You have created a plot based on data in the diamonds dataset. What code chunk can be added to your existing plot to create wrap around facets based on the variable color?
- facet_wrap(~color)
- facet_wrap(color)
- facet_wrap(color~)
- facet(~color);
Q9. A data analyst uses the annotate() function to create a text label for a plot. Which attributes of the text can the analyst change by adding code to the argument of the annotate() function? Select all that apply.
- Change the size of the text
- Change the font style of the text
- Change the color of the text
- Change the text into a title for the plot
Q10. You are working with the penguins dataset. You create a scatterplot with the following lines of code:
ggplot(data = penguins) +
geom_point(mapping = aes(x = flipper_length_mm, y = body_mass_g)) +
What code chunk do you add to the third line to save your plot as a jpeg file with “penguins” as the file name?
- ggsave(penguins)
- ggsave(“penguins.jpeg”)
- ggsave(penguins.jpeg)
- ggsave(“jpeg.penguins”)
Data Analysis with R Programming Week 05 Quiz Answers
Quiz- 1
L2 Documentation and reports:
Q1. R Markdown allows you to create a record of the steps you took to complete your analysis directly in RStudio.
- True
- False
Q2. Fill in the blank: Markdown is a _____ for formatting plain text files.
- file application
- coding language
- guide
- syntax
Q3. A data analyst creates an interactive version of their R Markdown document to share with other users that allows them to execute code the analyst wrote. What did they create?
- A markdown
- An R notebook
- A code chunk
- An HTML report
Q4. A data analyst wants to convert their R Markdown file into another format. What are their options? Select all that apply.
- HTML, PDF, and Word
- Slide presentation
- JPEG, PNG, and GIF
- Dashboard
Q5. A data analyst has finished editing their R Markdown file and wants to save it as an HTML report. What tool will they use?
- Knit
- Output
- Save
- Hashtags
Quiz- 2
L3 Creating R Markdown documents:
Q1. What information does a data analyst usually find in the header section of an RMarkdown document? Select all that apply.
- Title and author
- Conclusions
- File type
- Date
Q2. While formatting their R Markdown document, a data analyst decides to make one of the headers smaller. What do they type into the document to do this?
- Brackets
- Parentheses
- Hashtags
- Backticks
Q3. Inline code can be inserted directly into a .rmd file.
- True
- False
Q4. To create bullet points to their output document, a data analyst adds _____ to their RMarkdown document.
- brackets
- hashtags
- asterisks
- spaces
Q5. A data analyst wants to embed a link in their RMarkdown document. They write (click here!)(www.rstudio.com) but it doesn’t work. What should they write instead?
- [click here!](www.rstudio.com)
- <click here!>(www.rstudio.com)
- “click here!”(www.rstudio.com)
- click here!(www.rstudio.com) Quiz- 3
L4 Code chunks:
Q1. A data analyst includes a section of code in their RMarkdown file so they can add comments and allow stakeholders to run it. What is this the term for this section of code?
- Template
- Markdown
- YAML
- Code chunk
Q2. Fill in the blank: A delimiter is a character that marks the beginning and end of _____.
- a data item
- an HTML report
- a command line
- an .rmd file
Q3. Data analysts put three backticks at the end of their code chunks to act as a delimiter.
- True
- False
Q4. A data analyst has to create a monthly report for their stakeholders. What can they create to help them save time generating these reports?
- HTML report
- .rmd file
- Template
- R notebook
Q5. A data analyst wants to mark the beginning of their code chunk. What delimiter should they type in their .rmd file?
- +++{r }
- “`{r }
- ==={r }
- ***{r }
Data Analysis with R Programming Weekly challenge 5 Answers
Q1. A data analyst wants to create a shareable report of their analysis with documentation of their process and notes explaining their code to stakeholders. What tool can they use to generate this?
- Code chunks
- Filters
- Dashboards
- R Markdown
Q2. Fill in the blank: R Markdown notebooks can be converted into HTML, PDF, and Word documents, slide presentations, and _____.
- dashboards
- spreadsheets
- tables
- YAML
Q3. A data analyst notices that their header is much smaller than they wanted it to be. What happened?
- They have too few hashtags
- They have too few asterisks
- They have too many hashtags
- They have too many asterisks
Q4. A data analyst wants to include a line of code directly in their .rmd file in order to explain their process more clearly. What is this code called?
- Inline code
- YAML
- Documented
- Markdown
Q5. What symbol can be used to add bullet points in R Markdown?
- Backticks
- Asterisks
- Brackets
- Exclamation marks
Q6. A data analyst adds a section of executable code to their .rmd file so users can execute it and generate the correct output. What is this section of code called?
- Data plot
- YAML
- Documentation
- Code chunk
Q7. A data analyst is inserting a line of code directly into their .rmd file. What will they use to mark the beginning and end of the code?
- Hashtags
- Delimiters
- Asterisks
- Markdown
Q8. If an analyst creates the same kind of document over and over or customizes the appearance of a final report, they can use _____ to save them time.
- a filter
- a template
- an .rmd file
- a code chunk
.
Review:
Based on our knowledge, we urge you to enroll in this course so you can pick up new skills from specialists. It will be worthwhile, we trust.