Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Class notes

Big Data Basic Concepts

Rating
-
Sold
-
Pages
60
Uploaded on
06-07-2024
Written in
2023/2024

Big data basic concepts are covered such a data science, ML, AI, big data analytics

Institution
Course

Content preview

BIG DATA
What is Big Data?
 Big Data is a collection of data that is huge in volume, yet growing
exponentially with time.
 It is a data with so large size and complexity that none of traditional data
management tools can store it or process it efficiently.
 Big data is also a data but with huge size.

Characteristics of Big Data
 Big Data contains a large amount of data that is not being processed by
traditional data storage or the processing unit.
 It is used by many multinational companies to process the data and
business of many organizations.
 The data flow would exceed 150 exabytes per day before replication.
There are five v's of Big Data that explains the characteristics.
5 V's of Big Data
o Volume
o Veracity
o Variety
o Value
o Velocity

,Volume
 The name Big Data itself is related to an enormous size. Big Data is a vast
'volumes' of data generated from many sources daily, such as business
processes, machines, social media platforms, networks, human
interactions, and many more.
 Facebook can generate approximately a billion messages, 4.5 billion times
that the "Like" button is recorded, and more than 350 million new posts are
uploaded each day. Big data technologies can handle large amounts of
data.

,Variety
 Big Data can be structured, unstructured, and semi-structured that are
being collected from different sources.
 Data will be collected only from databases and sheets in the past, but these
days the data comes in form of arrays, that are PDFs, Emails, audios, SM
posts, photos, videos, etc.




The data is categorized as below:
a) Structured data: In Structured schema, along with all the required columns,
it is in a tabular form. Structured Data is stored in the relational database
management system.

b) Semi-structured: In Semi-structured, the schema is not appropriately
defined, e.g., JSON, XML, CSV, TSV, and email. OLTP (Online Transaction
Processing) systems are built to work with semi-structured data. It is stored
in relations, i.e., tables.


c) Unstructured Data: All the unstructured files, log files, audio files,
and image files are included in the unstructured data. Some organizations
have much data available, but they did not know how to derive the value of
data since the data is raw.

, Veracity
 Veracity means how much the data is reliable. It has many ways to filter or
translate the data.
 Veracity is the process of being able to handle and manage data efficiently.
Big Data is also essential in business development.
 For example, Facebook posts with hashtags.
Value
 Value is an essential characteristic of big data. It is not the data that we
process or store. It is valuable and reliable data that we store, process, and
also analyze.




Velocity
 Velocity plays an important role compared to others. Velocity creates the
speed by which the data is created in real-time. It contains the linking of
incoming data sets speeds, rate of change, and activity bursts. The primary
aspect of Big Data is to provide demanding data rapidly.
 Big data velocity deals with the speed at the data flows from sources
like application logs, business processes, networks, and social media sites,
sensors, mobile devices, etc.

Written for

Institution
Course

Document information

Uploaded on
July 6, 2024
Number of pages
60
Written in
2023/2024
Type
Class notes
Professor(s)
Sujata
Contains
All classes

Subjects

$8.59
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller
Seller avatar
siddhichavan

Get to know the seller

Seller avatar
siddhichavan Pratibha College of Commerce & Computer Studies
Follow You need to be logged in order to follow users or courses
Sold
-
Member since
1 year
Number of followers
0
Documents
1
Last sold
-

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions