You should review the Big Data Installation and Configuration guide. It would answer these questions for you. There are multiple ways to install on to the cluster, so 1-4 are all options you can use depending on your cluster set up and preferences. For #5, you are correct that the Hadoop binary installation is NOT required on the BDE server.
Thanks, regards to point 2, is there any dependency on hadoop cluster side to make automatic installation.
You will need password-less SSH for the root user. All the prerequisites are listed in that installation document.
is it mandatory to install Informatica Hadoop binaries into Name node or we can ignore and install only on Data node.
is there any impact if we ignore Name node.
It is mandatory to install Informatica hadoop binaries in all data nodes.
Even If you have a single node hadoop for PoC you need to install the hadoop binaries.
Inforamtica bde will call the functions at run time from these binaries.
hope it helps.
thanks, What about Name node / secondary name node? do I need to install?
No, you only need to install on the data nodes, but you do need to install on ALL the data nodes.
Hello imthias.kareem were you able to install and configure Informatica BDE successfully? If so is there a step-by-step document prepared as part of it. Appreciate if you can share it to me.
Thanks in advance.
What is your Hadoop distribution ? If it is CDH then there is an alternate way to setup BDE binaries on Hadoop cluster through Informatica provided parcels for CDH.
These BDE parcels for CDH can be directly depoyed on CDH cluster using Cloudera Manager.
Hello Hitesh -
Sorry for delayed response, BDE distributor is Hortonworks
I think there is no concept of parcels for Hortonworks distribution and thus you have to install Informatica hadoop binaries manually on all Hadoop data nodes.
You can do this from primary name node, download the installabe (rpm) to the primary name node,
Run the install.sh (provided by Informatica available when BDE RPM installables are downloaded, this picks up all the data node names available in "$HADOOP_HOME/conf/slaves file" and copies the RPM to these and installs them on the same.
For more information refer to BDE Installation guide.
For more details on the installation, refer to the below installation guide:
Hope this helps.
Thanks and Regards,
Can any one will provide me step by step Hortonworks configuration and installation on BDM 10.1
Thanks in advance.