Commands

Sqoop Commands

To run the Sqoop commands, you need shell access on the Invariant edge servers.

Sqoop Help

To access help and see details of commands use the following command

[devuser@inv121 ~]$ Sqoop help
Output
usage: sqoop COMMAND [ARGS]

Available commands:

  codegen            Generate code to interact with database records
  create-hive-table  Import a table definition into Hive
  eval               Evaluate a SQL statement and display the results
  export             Export an HDFS directory to a database table
  help               List available commands
  import             Import a table from a database to HDFS
  import-all-tables  Import tables from a database to HDFS
  import-mainframe   Import datasets from a mainframe server to HDFS
  job                Work with saved jobs
  list-databases     List available databases on a server
  list-tables        List available tables in a database
  merge              Merge results of incremental imports
  metastore          Run a standalone Sqoop metastore
  version            Display version information

See 'sqoop help COMMAND' for information on a specific command.

Each command has specific arguments. The help command can display these arguments when you use the commands as a parameter. For example here is the syntax for displaying the arguments for the frequently used import command.

[devuser@inv121 ~]$ sqoop help list-databases

output

usage: sqoop list-databases [GENERIC-ARGS] [TOOL-ARGS]
Common arguments:
   --connect <jdbc-uri>              Specify JDBC
                                     Connect string
   --connection-manager <class-name> Specify connection manager class name
   --connection-param-file <properties-file>   Specify connection
                                               Parameters file Manually
                                               specify JDBC driver class
                                               to use
   --hadoop-home <hdir>             Override $HADOOP_MAPRED_HOME_ARG
   --hadoop-mapred-home <dir>       Override $HADOOP_MAPRED_HOME_ARG
   --help                           Print usage instructions
   --metadata-transaction-isolation-level <isolationlevel>    Defines the
                                                              transaction
                                                              isolation
                                                              level for
                                                              metadata
                                                              queries. For
                                                              more details
                                                              check
                                                              java.sql.Con
                                                              nection
                                                              javadoc or JDBC
                                                              specification
-P                                                            Read
                                                              password
                                                              from console
   --password <password>                                      Set
                                                              authenticati
                                                              on password
   --password-alias <password-alias>                          Credential
                                                              provider
                                                              password
                                                              alias
   --password-file <password-file>                            Set
                                                              authenticati
                                                              on password
                                                              file path
   --relaxed-isolation                                        Use
                                                              read-uncommi
                                                              tted
                                                              isolation
                                                              for imports
   --skip-dist-cache                                          Skip copying
                                                              jars to
                                                              distributed
                                                              cache
   --temporary-rootdir <rootdir>                              Defines the
                                                              temporary
                                                              root
                                                              directory
                                                              for the
                                                              import
   --username <username>                                      Set
                                                              authenticati
                                                              on username
   --verbose                                                  Print more
                                                              information
                                                              while
                                                              working
                                                              
Generic Hadoop command-line arguments:
(must preceed any tool-specific arguments)
Generic options supported are
-conf <configuration file>     specify an application configuration file
-D <property=value>            use value for given property
-fs <local|namenode:port>      specify a namenode
-jt <local|resourcemanager:port>    specify a ResourceManager
-files <comma separated list of files>    specify comma separated files to be copied to the map reduce cluster
-libjars <comma separated list of jars>    specify comma separated jar files to include in the classpath.
-archives <comma separated list of archives>    specify comma separated archives to be unarchived on the compute machines.

The general command line syntax is
bin/hadoop command [genericOptions] [commandOptions]

List databases

This command lists the databases that your account is allowed to access.

sqoop list-databases \
    -Dhadoop.security.credential.provider.path=jceks://hdfs/user/devuser/q1.password.jceks \ 
    --connect jdbc:db2://localhost:52330/TESTP30 \
    --username devuser --password-alias qrep.password.alias

Output
    ASN
    ASNP2R
    DSADM
    BPM
    SQLJ
    SYSCAT
    SYSFUN
    SYSIBM
    SYSIBMADM
    SYSIBMINTERNAL
    SYSIBMTS
    SYSPROC
    SYSPUBLIC
    SYSSTAT
    SYSTOOLS

List Tables

This command lists the tables available.

sqoop list-tables \
     -Dhadoop.security.credential.provider.path=jceks://hdfs/user/devuser/qr.password.jceks \
     --connect jdbc:db2://localhost:52330/DBBPM30 \
     --username devuser --password-alias qr.password.alias
     
INDEX_ASSIGNMENTSKILLS
INDEX_CORRESPONDENCE_TYPE
INDEX_ENTITIES
WORK_CATEGORY
OPERATORS
OPERATOR_SKILLS     

Again the syntax is straightforward requiring just a connect argument. But in this case you need to specify the database in the connection string as well.

Last updated