Let us validate Sqoop by running sample job. We typically use Sqoop to get data from remote RDBMS databases into HDFS and vice versa.
- Sqoop uses JDBC to connect to remote databases. Hence on the gateway node, where we typically run Sqoop import/export commands, we need to have JDBC jar file.
- JDBC Jar is based on the database technology used (for e.g.: mysql-connector-java.jar for MySQL and ojdbc12.jar for Oracle)
- We can validate Database Connectivity using Sqoop commands such as list-databases and list-tables.
https://gist.github.com/dgadiraju/5d5f5d77bb68549ae117b69461a17437
- We can validate whether we are able to perform import using sqoop import command
https://gist.github.com/dgadiraju/529f1f034e5c0530bb11b576d23700a5