18 - Case Study 1 - Classify Personal
Income (part 2)
ProgrammingKnowledge
Cross Tabulation and Data Visualization
In this video, we'll explore cross tabulation and data
visualization using Pandas and Seaborn libraries.
Column Count
We start by checking the number of columns in our
data frame. We use the data.columns function to
get all the columns.
Gender Proportion
We use the pd.crosstab function to get the count
and proportion of males and females in the
population.
Bivariate Analysis
We explore the relationship between gender and
salary status using the pd.crosstab function. We
create a new variable, gender_salary_stat, to get
the count and proportion of males and females in
di erent salary categories.
Frequency Distribution of Salary Status
We use the Seaborn library to draw a count plot of
the salary status column.
ff