D465 - Data Applications | STUDY GUIDE
R and Python similarities Both widely used in data science with extensive libraries.
R unique challenges Steeper learning curve, limited web dev capabilities.
Advantage of storing code in Allows reproducibility and collaboration among analysts.
R
Programming languages and Python: web dev, data science, ML, automation.
use cases
MIN function in spreadsheets Returns the smallest value in a cell range.
COUNTIF function in Counts cells meeting a specified condition.
spreadsheets
Pivot table elements Rows, columns, values, filters for data aggregation.
SELECT command in SQL Retrieves data from one or more database tables.
JOIN commands in SQL INNER, LEFT, RIGHT, FULL OUTER JOIN types.
Delimiter for code chunks Triple backticks or markup to define code sections.
Output formats for HTML, PDF, Word (docx), Markdown.
documents
Presentation formats Slides, Dashboards, Interactive web apps.
Knit button in R Compiles R Markdown into desired output formats.
Symbol for comments in R Pound sign (`#`) precedes comments in R.
Nested function usage Simplifies operations, improves code readability.
Logical operators AND (&&), OR (||), NOT (!).
Advantage of tidyverse Cohesive data manipulation packages in R.
Functions in ggplot2 ggplot(), geom_point(), geom_line(), aes().
Plus sign in ggplot2 Adds layers to ggplot objects for customization.
,Common errors in ggplot2 Incorrect aesthetic mappings, syntax misunderstanding.
Basic aesthetic attributes in x-axis, y-axis, color for plot customization.
ggplot2
Smoothing line usage Visual representation of trends in data.
, dplyr filter() function Subset rows based on specific conditions in R.
VLOOKUP function in Searches for values in a vertical column.
spreadsheets
Locking table array in VLOOKUP Prevents range changes for formula accuracy.
Different JOIN functions in SQL INNER, LEFT, RIGHT, FULL OUTER JOIN types.
COUNT vs. COUNT DISTINCT in COUNT: total rows, COUNT DISTINCT: unique values.
SQL
SELECT statement usage in SQL Retrieving data from one or more tables.
FROM statement in SQL Specifies tables for data retrieval in SQL queries.
Tibbles vs. data frames Modernized data frames with improved features.
Main operators in R Arithmetic, relational, logical, assignment operators.
sample() function for biased data Creates random unbiased data samples.
Fill in the blank: The spreadsheet COUNTIF
function
___returns the number of cells
within a
range that match a specified
value.
COUNT
IF
ARRAY
COUNT
DISTINCT
VALUE
What is an example of an array The values in cells B2 through B31
in a spreadsheet?
R and Python similarities Both widely used in data science with extensive libraries.
R unique challenges Steeper learning curve, limited web dev capabilities.
Advantage of storing code in Allows reproducibility and collaboration among analysts.
R
Programming languages and Python: web dev, data science, ML, automation.
use cases
MIN function in spreadsheets Returns the smallest value in a cell range.
COUNTIF function in Counts cells meeting a specified condition.
spreadsheets
Pivot table elements Rows, columns, values, filters for data aggregation.
SELECT command in SQL Retrieves data from one or more database tables.
JOIN commands in SQL INNER, LEFT, RIGHT, FULL OUTER JOIN types.
Delimiter for code chunks Triple backticks or markup to define code sections.
Output formats for HTML, PDF, Word (docx), Markdown.
documents
Presentation formats Slides, Dashboards, Interactive web apps.
Knit button in R Compiles R Markdown into desired output formats.
Symbol for comments in R Pound sign (`#`) precedes comments in R.
Nested function usage Simplifies operations, improves code readability.
Logical operators AND (&&), OR (||), NOT (!).
Advantage of tidyverse Cohesive data manipulation packages in R.
Functions in ggplot2 ggplot(), geom_point(), geom_line(), aes().
Plus sign in ggplot2 Adds layers to ggplot objects for customization.
,Common errors in ggplot2 Incorrect aesthetic mappings, syntax misunderstanding.
Basic aesthetic attributes in x-axis, y-axis, color for plot customization.
ggplot2
Smoothing line usage Visual representation of trends in data.
, dplyr filter() function Subset rows based on specific conditions in R.
VLOOKUP function in Searches for values in a vertical column.
spreadsheets
Locking table array in VLOOKUP Prevents range changes for formula accuracy.
Different JOIN functions in SQL INNER, LEFT, RIGHT, FULL OUTER JOIN types.
COUNT vs. COUNT DISTINCT in COUNT: total rows, COUNT DISTINCT: unique values.
SQL
SELECT statement usage in SQL Retrieving data from one or more tables.
FROM statement in SQL Specifies tables for data retrieval in SQL queries.
Tibbles vs. data frames Modernized data frames with improved features.
Main operators in R Arithmetic, relational, logical, assignment operators.
sample() function for biased data Creates random unbiased data samples.
Fill in the blank: The spreadsheet COUNTIF
function
___returns the number of cells
within a
range that match a specified
value.
COUNT
IF
ARRAY
COUNT
DISTINCT
VALUE
What is an example of an array The values in cells B2 through B31
in a spreadsheet?