Let us validate Sqoop by running sample job. We typically use Sqoop to get data from remote RDBMS databases into HDFS and vice versa.
- Sqoop uses JDBC to connect to remote databases. Hence on the gateway node, where we typically run Sqoop import/export commands, we need to have JDBC jar file.
- JDBC Jar is based on the database technology used (for e.g.: mysql-connector-java.jar for MySQL and ojdbc12.jar for Oracle)
- We can validate Database Connectivity using Sqoop commands such as list-databases and list-tables.
- We can validate whether we are able to perform import using sqoop import command