Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Class notes

big data

Rating
-
Sold
-
Pages
5
Uploaded on
20-11-2024
Written in
2024/2025

useful for college students for there exams

Institution
Course

Content preview

FUNDAMENTALS OD BIG DATA:

Big data is a term used to describe the large volume of structured and unstructured data that inundates
businesses daily. But it's not the amount of data that's important, it's what organizations do with the
data that matters. Big data can be analyzed for insights that can lead to better decisions and strategic
moves.

The term "big data" refers to data that is too large, too complex, and too diverse to be processed and
analyzed by traditional data-processing software. The concept of big data involves a new approach to
data analytics that allows for data to be analyzed in its raw form, without having to first structure or
categorize it.

There are three main characteristics of big data, often referred to as the "three V's":

1. Volume: Big data involves large amounts of data. This can range from a few terabytes to
petabytes or even exabytes of data.

For example, a social media platform like Facebook generates billions of data points every day, such as
likes, shares, comments, and more. This data can be analyzed to gain insights into user behavior and
preferences, which can then be used to improve the user experience and target ads more effectively.

2. Velocity: Big data is often generated at high speed, making it difficult to process and analyze in
real-time using traditional data-processing software.

For instance, high-frequency stock trading systems generate millions of data points per second,
requiring real-time data processing and analysis to make informed trading decisions.

3. Variety: Big data comes in a wide variety of formats and types, including structured, semi-
structured, and unstructured data.

For example, a healthcare organization might collect data from a variety of sources, including electronic
health records, medical images, lab results, and patient-generated data from wearable devices. This
data can be analyzed to improve patient outcomes, reduce costs, and develop new treatments.

To process and analyze big data, organizations often use distributed computing systems, such as Hadoop
and Spark. These systems enable data to be processed and analyzed in parallel, allowing for faster and
more efficient processing of large volumes of data.

Additionally, machine learning and artificial intelligence algorithms can be used to analyze big data and
uncover insights that would be difficult or impossible for humans to identify on their own.

In summary, big data is a term used to describe the large volume of structured and unstructured data
that inundates businesses daily. It's characterized by its volume, velocity, and variety, and requires
distributed computing systems and machine learning algorithms to process and analyze. With the right
tools and approach, big data can provide valuable insights that can lead to better decisions and strategic
moves.

, Smart devices and the Internet of Things (IoT):



Smart devices and the Internet of Things (IoT) are changing the way we live and work. By connecting
everyday objects to the internet, we can collect and analyze data to gain insights and automate tasks.

For example, take a look at the video about the smart city. Sensors installed throughout the city collect
data on traffic, weather, and pollution. This data is then analyzed and used to optimize traffic flow,
reduce pollution, and improve the overall quality of life for city residents.

Another example is the video about the smart home. A smart thermostat, such as Nest, can learn your
daily routine and adjust the temperature in your home to save energy and keep you comfortable. You
can also control the thermostat remotely using a smartphone app.

In the video about the smart factory, we see how IoT is used to increase efficiency and productivity.
Sensors and machines are connected to the internet, allowing for real-time monitoring and analysis of
production data. This allows for predictive maintenance and reduced downtime.

In the video about the smart farm, we see how IoT is used to optimize crop yields and conserve
resources. Sensors installed in the fields measure soil moisture, temperature, and other environmental
factors. This data is then used to optimize irrigation and fertilization, resulting in higher crop yields and
reduced water usage.

In the video about the smart car, we see how IoT is used to improve safety and convenience. The car is
connected to the internet, allowing for real-time traffic updates, navigation, and even remote
diagnostics.

In the video about the smart health, we see how IoT is used to improve healthcare. Wearable devices
and implantable sensors monitor vital signs, such as heart rate and blood sugar, allowing for real-time
monitoring and early detection of potential health issues.

In summary, Smart devices and the Internet of Things are revolutionizing various industries and aspects
of life by connecting everyday objects to the internet, allowing for the collection and analysis of data,
automation of tasks and improving the overall quality of life, efficiency, productivity, optimizing crop
yields, conserving resources, improving safety and convenience in healthcare.

MapReduce Programming and Hadoop: A Powerful Duo for Big Data Processing

MapReduce and Hadoop are two fundamental concepts in the world of big data processing. In a
nutshell, MapReduce is a programming model for processing large datasets, while Hadoop is a
framework that implements the MapReduce programming model. Together, they allow for the
distributed processing of large datasets across a cluster of computers, making it possible to handle data
too big for a single machine.

Key Concepts in MapReduce

Written for

Institution
Course

Document information

Uploaded on
November 20, 2024
Number of pages
5
Written in
2024/2025
Type
Class notes
Professor(s)
Karthick
Contains
18-04-24

Subjects

$26.39
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller
Seller avatar
princyleema

Get to know the seller

Seller avatar
princyleema sherine
Follow You need to be logged in order to follow users or courses
Sold
-
Member since
1 year
Number of followers
0
Documents
4
Last sold
-

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions