Sqoop

Apache Sqoop is a data management tool that can be used to extract data from a structured data store into HDFS and also export data back from HDFS to RDBMS for reporting and processing by other applications. The command line tool provides developers the ability to construct import and export scripts.

Key features

  • Existing data from relational databases can be loaded into an HDFS cluster

  • Data can be exported back to traditional data platforms for analysis and use with existing tools and applications

  • Can be scripted to automate data movement across systems

For more details visit the SQOOP page on Apache Sqoop website

Last updated