Introduction to Hortonworks Data Platform (HDP)
A platform for data in rest is called HDP.
It is an open-source Apache Hadoop distribution that is safe, business-ready, and
built on a centralised design (YARN).
The following characteristics of HDP:
Open
Central
Interoperable
Enterprise-ready
• Data flow
Kafka :
The publish-subscribe messaging system Apache Kafka is quick, scalable,
reliable, and fault-tolerant.
utilised for creating streaming applications and real-time data pipelines
Since it has a higher throughput, dependability, and replication than
conventional message brokers like JMS and AMQP, it is frequently
employed in their place.
• Kafka works in combination with variety of Hadoop tools:
Apache Storm
Apache HBase
Apache Spark
Sqoop :
a tool for quickly importing data into your Hadoop cluster from related Hadoop
systems (like Hive and HBase) and structured databases (like Db2, MySQL,
Netezza, Oracle, and Mode).
enables the export of data from Hadoop to relational databases and business data
warehouses.
A platform for data in rest is called HDP.
It is an open-source Apache Hadoop distribution that is safe, business-ready, and
built on a centralised design (YARN).
The following characteristics of HDP:
Open
Central
Interoperable
Enterprise-ready
• Data flow
Kafka :
The publish-subscribe messaging system Apache Kafka is quick, scalable,
reliable, and fault-tolerant.
utilised for creating streaming applications and real-time data pipelines
Since it has a higher throughput, dependability, and replication than
conventional message brokers like JMS and AMQP, it is frequently
employed in their place.
• Kafka works in combination with variety of Hadoop tools:
Apache Storm
Apache HBase
Apache Spark
Sqoop :
a tool for quickly importing data into your Hadoop cluster from related Hadoop
systems (like Hive and HBase) and structured databases (like Db2, MySQL,
Netezza, Oracle, and Mode).
enables the export of data from Hadoop to relational databases and business data
warehouses.