Data science is the process of extracting insights from data using technology. It involves using
advanced tools and programming languages such as Python and R to analyze big data, which is
not feasible with traditional tools like Microsoft Excel. A typical data science project involves a
business problem, data collection, data cleaning and exploration, building a model, and
deploying the model to production.
Example: Restaurant Business
For instance, let's consider a restaurant business with locations in New York and San Diego. The
local managers send monthly sales data in an Excel file to the business owner. By plotting a side-
by-side bar chart of the monthly sales numbers, the business owner can identify that the New
York restaurant is not performing well during summer months. This visualization can help the
owner make a business decision to run a special promotion during the summer in the New York
restaurant.
Tools Used in Data Science
For data storage and distributed computing, data scientists use technologies like Apache Hadoop
and Apache Spark. For visualization, they use tools like Tableau, Power BI, Matplotlib, and
Jupiter Notebook. For deep learning, they might use Tensorflow, PyTorch, or Scikit-learn.
Real-Life Examples
Data science is used heavily in various industries, including:
Amazon: product recommendations and video recommendations
Healthcare: analyzing data from wearable devices to predict heart disease or respiratory
problems
Finance: fraud detection using advanced algorithms
Shipping: finding the best route between two destinations
1. How to Install Python 3.5 on Windows
To begin, go to the Python.org website. Click on the "Downloads" button and select "Python
3.5".
If you prefer to install previous versions, such as 2.7.11, you can click on the corresponding
download button. Once you have selected Python 3.5, click on "Download".
, If you are using Firefox or Internet Explorer, the download will be saved in a different location.
After downloading, click on the setup file and select "Add Python 3.5 to PATH". This is an
important step.
Choose "Customized Installation" and select "Install for all users". This will install Python to the
default location.
After a few minutes, Python should be successfully installed. You can verify this by running
IDLE or the Windows Command Prompt. To open IDLE, click on the "Start" button and type
"IDLE". To increase font size, go to "Options" and select "Configure IDLE".
To write a simple Python program, open IDLE and type in your code. For example, you can use
Python as a calculator or print a statement. IDLE is a good editor for beginners and allows you to
write simple Python programs.
If you are using Windows Command Prompt, type "python" to access the Python shell.
However, IDLE is preferred as it has syntax highlighting and other cool features.
If you encounter issues with Windows not recognizing the Python command, check that the
Python installation directory is present in the system path. You can manually add the path to the
system environment variables.
2. Variables in python [Python 3 Programming]
Variables in Python
Variables are containers that can hold any type of data such as numerical data, tax data, etc. They
are used to store and manipulate data in Python. For example, you can use them to track your
monthly expenses in Python programming language.
Creating Variables
To create a variable, you need to use an assignment operator. The left-hand side of the operator
is the variable name, and the right-hand side is the value. For instance, you can create variables
for your monthly expenses:
Rent: $1220