CtrlK

Sqoop

Apache Sqoop is a data management tool that can be used to extract data from a structured data store into HDFS and also export data back from HDFS to RDBMS for reporting and processing by other applications. The command line tool provides developers the ability to construct import and export scripts.

Key features

Existing data from relational databases can be loaded into an HDFS cluster
Data can be exported back to traditional data platforms for analysis and use with existing tools and applications
Can be scripted to automate data movement across systems

For more details visit the SQOOP page on Apache Sqoop website

PreviousOozie NextCommands

Last updated 4 years ago