Introduction and Overview of Big Data
Big data is exactly what the name suggests, a “big” amount of data.
Big data is a collection of massive and complex data sets and data
volume that include the huge quantities of data, data management
capabilities, social media analytics and real-time data.
Big data analytics is the process of examining large amounts of
data. There exist large amounts of heterogeneous digital data.
Big data is about data volume and large data set's measured in terms
of terabytes. This phenomenon is called Bigdata. After examining
of Bigdata, the data has been launched as Big Data analytics.
, With such a massive amount of data being collected, it only makes
sense for companies to use this data to understand their customers and
their behaviour better.
, There are different kinds of data
1. Structured data
2. Semi structured data
3. Unstructured data
1. Structured data includes quantitative data that is stored in
an organized manner. It consists of numerical and text data. It
is easy to analyse and process structured data. It is generally
stored in a relational database and can be queried using
Structured Query Language (SQL). e.g. Relational data
2. Semi Structured data which is partially structured. e.g.
XML data
, 3. Unstructured data includes qualitative data that lacks any
predefined structure and can come in a variety of formats (images,
mp3 files, wav files, etc.). Unstructured data is said to lack
“structure”. It is stored in a non-relational database and can be
queried using NoSQL. E.g. Word, Pdf, Text, Media Logs.
Big data is exactly what the name suggests, a “big” amount of data.
Big data is a collection of massive and complex data sets and data
volume that include the huge quantities of data, data management
capabilities, social media analytics and real-time data.
Big data analytics is the process of examining large amounts of
data. There exist large amounts of heterogeneous digital data.
Big data is about data volume and large data set's measured in terms
of terabytes. This phenomenon is called Bigdata. After examining
of Bigdata, the data has been launched as Big Data analytics.
, With such a massive amount of data being collected, it only makes
sense for companies to use this data to understand their customers and
their behaviour better.
, There are different kinds of data
1. Structured data
2. Semi structured data
3. Unstructured data
1. Structured data includes quantitative data that is stored in
an organized manner. It consists of numerical and text data. It
is easy to analyse and process structured data. It is generally
stored in a relational database and can be queried using
Structured Query Language (SQL). e.g. Relational data
2. Semi Structured data which is partially structured. e.g.
XML data
, 3. Unstructured data includes qualitative data that lacks any
predefined structure and can come in a variety of formats (images,
mp3 files, wav files, etc.). Unstructured data is said to lack
“structure”. It is stored in a non-relational database and can be
queried using NoSQL. E.g. Word, Pdf, Text, Media Logs.