Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Summary

Summary Big Data Analytics

Rating
-
Sold
-
Pages
214
Uploaded on
22-08-2023
Written in
2023/2024

It contains notes of Big data Analytics topics covering pig,sqoop,flume,hadoop.

Institution
Course

Content preview

UNIT – III
Introduction to PIG

0...........................00000
0000000000000000000
000000000000000000P
repared by:
` Aditya Sharma

, PIG
Pig’s are omnivores animals which means they
can consume both plants and animals.

The PIG consumes any type of data whether
Structured or unStructured or any other machine
data & helps processing the same.

,What is PIG?
Pig is a high-level programming language useful for analysing large data sets. A pig was a
result of development effort at Yahoo!. It is an open source data flow language.

In a MapReduce framework, programs need to be translated into a series of Map and Reduce
stages. However, this is not a programming model which data analysts are familiar with. So,
in order to bridge this gap, an abstraction called PIG was built on top of Hadoop.

Apache PIG enables people to focus more on analyzing bulk data sets and to spend less
time writing Map-Reduce programs. Similar to Pigs, who eat anything, the PIG
programming language is designed to work upon any kind of data. That's why the name, PIG!

Apache Pig is an abstraction over MapReduce.

• It is a tool/platform which is used to analyze larger sets of data representing them as data
flows.
• Pig is generally used with Hadoop; we can perform all the data manipulation operations in
Hadoop using Apache Pig.
• To write data analysis programs, Pig provides a high-level language known as Pig Latin.
• This language provides various operators using which programmers can develop their own
functions for reading, writing, and processing data.

, Features of Pig:

• Rich set of operators: It provides many operators to perform operations like join, sort,
filer, etc.

• Ease of programming: Pig Latin is similar to SQL and it is easy to write a Pig script if
you are good at SQL.

• UDF’s: Pig provides the facility to create User-defined Functions in other programming
languages such as Java and invoke or embed them in Pig Scripts.

• Handles all kinds of data: Apache Pig analyzes all kinds of data, both structured as well
as unstructured. It stores the results in HDFS.

Written for

Course

Document information

Uploaded on
August 22, 2023
Number of pages
214
Written in
2023/2024
Type
SUMMARY

Subjects

$8.49
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller
Seller avatar
kondaharshitha11

Also available in package deal

Get to know the seller

Seller avatar
kondaharshitha11 Mlrit
Follow You need to be logged in order to follow users or courses
Sold
-
Member since
2 year
Number of followers
0
Documents
3
Last sold
-

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions