ASSIGNMENT / REPORT COVER SHEET
This sheet must be attached to all material being submitted for
marking.
Student name: Student Student name:
number: Student number:
Sections completed Sections completed
individually individually
Introduction to Data Science COMP2025
Unit name & number:
Monday 11-1
Tutorial day and time:
Assignment 1
Title of Assignment:
Student Submitting the
Assignment:
11/09/2022
Date submitted:
Student Declaration (must be signed)
Declaration:
I hold a copy of this assignment if the original is lost or damaged.
I hereby certify that no part of this assignment or product has been copied
from any other student’s work or from any other source except where due
acknowledgement is made in the assignment.
No part of the assignment/product has been written / produced for me by
any other person except where collaboration has been authorised by the
subject lecturer/tutor concerned q I am aware that this work may be
reproduced and submitted to plagiarism detection software programs for the
purpose of detecting possible plagiarism (which may retain a copy on its
database for future plagiarism checking))
, I hereby certify that no part of this assignment or product has been
submitted by me in another (previous or current) assessment, except where
appropriately referenced, and with prior permission from the
Lecturer/Tutor/ Unit Co-ordinator for this unit.
q
Student signature and date: Vishwapriya pavate 11/09/2022
Student signature and date
, ITDS_Assignment
2022-09-02
Housing = read.csv("kc_house.csv")
attach(Housing)
head(Housing)
## id price bedrooms bathrooms sqft_living sqft_lot floors
waterfront
## 1 7922800400 95.10 5 3.25 3.25 14.342 2
0
## 2 1516000055 65.00 3 2.25 2.15 21.235 1
0
## 3 2123039032 36.99 1 0.75 0.76 10.079 1
1
## 4 9297300045 55.00 3 2.00 1.97 4.166 2
, 0
## 5 1860600135 238.40 5 2.50 3.65 9.050 2
0
## 6 1560930070 84.00 4 3.50 2.84 40.139 1
0
## sqft_living15 sqft_lot15
## 1 2.96 11.044
## 2 2.57 18.900
## 3 1.23 14.267
## 4 2.39 4.166
## 5 2.88 5.400
## 6 3.18 36.852
#1) Construct the matrix plot and correlation matrix (consider only relevant
variables). Comment on the relationship among variables
pairs(Housing, panel=panel.smooth)
cor(Housing)
## id price bedrooms bathrooms
sqft_living
## id 1.000000000 -0.02353996 0.108453574 0.096787228
0.04223220
## price -0.023539962 1.00000000 0.365855613 0.649982920
0.78807930
## bedrooms 0.108453574 0.36585561 1.000000000 0.554497306