Tips for setting up Apache Hive
- Last UpdatedMay 21, 2024
- 1 minute read
In order for Apache Hive to work with PI Integrator for Business Analytics, you must complete the following tasks:
-
Create the HDFS user.
-
Create the HCat user.
-
Create the user directory and grant permissions.
-
Disable SASL.
The following procedure describes how to complete these tasks.
-
Create the HDFS user.
-
Create the HDFS user on the node where WebHDFS is installed.
-
Add the HDFS user to a group.
Note: The HDFS user can be added to any group. However, for the purposes of this example, this group will be referred to as the HDFS group.
-
-
Create the HCat user.
(Optional) If HCat is on a different node, then you must add the HCat user.
-
Create the HCat user on the node where HCat is installed.
-
Add the HCat user to the HDFS group.
-
-
Create the user directory and grant permissions.
In the following procedure, you create a user directory and ensure it is set up correctly so that the HDFS user can write to it.
-
Create a user directory on HDFS.
-
Change the owner of this directory to the HDFS user.
-
Change the group to which this directory belongs to HDFS.
-
Grant read and write permissions to the owner of the directory (that is, the HDFS user).
-
Grant read and write permissions to the group to which the directory belongs.
-
-
Disable SASL.
Note: PI Integrator for Business Analytics only supports Apache Hive targets that are configured with NOSASL authentication. For more information on Apache Hive authentication options, see Setting Up HiveServer2.