Install Apache Sqoop on Ubuntu 20.04 LTS (2021)
Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between
Apache Hadoop and structured datastores such as
relational databases.
In this post, lets see how to install Apache Sqoop on Ubuntu.
Step 1: Download Apache Sqoop
Download the Apache Sqoop binary file from official download page
Note: As of 2021-06, Apache Sqoop project has been retired since there is no development after Sqoop version 1.4.7
Step 2: Untar the Sqoop binary file
Untar and rename the Sqoop binary file using the below command
tar -xvzf sqoop-1.4.7.bin__hadoop-2.6.0.tar.gz
mv sqoop-1.4.7.bin__hadoop-2.6.0 sqoop
Step 3: Configure Sqoop
Configure the Sqoop by executing the below command
cd sqoop/conf
mv sqoop-env-template.sh sqoop-env.sh
nano sqoop-env.sh
export HADOOP_COMMON_HOME=/home/karthik/hadoop
export HADOOP_MAPRED_HOME=/home/karthik/hadoop
export HIVE_HOME=/home/karthik/hive
Once you made the above mentioned changes, save it by pressing Ctrl+X then type "Y" and press "Enter"
Step 4: Add SQOOP_HOME to ~/.bashrc
Add the SQOOP_HOME environment variables to ~/.bashrc file by executing the below command
cd
nano ~/.bashrc
export SQOOP_HOME=/home/karthik/sqoop
export PATH=$PATH:$SQOOP_HOME/bin
Instantiate the changes by executing the below command
source ~/.bashrc
Step 5: Verify Sqoop installation
Verify your Sqoop Installation by executing the below command
sqoop version
You will see the below screenshot if the installation is successful.
That's it, we have successfully installed Apache Sqoop on Ubuntu. If you face any issues, let me know in the comments section. Thank you!!
Post a Comment