Metadata Configuration

The inventory manager configuration is controlled by a set of YAML files in the config sub-folder of inventory manager installation. The next section explains the key files used to configure the data-sources, tables of interest, target data store etc. Other files present such as the data type mapping should not require change and are maintained by product.

The key files to be managed by the administrators

datasources.yml

This configuration file is used to describe the data sources that feed in source data to the discovery pipeline. Multiple data sources and databases are supported. Please refer to the feature section for the data sources supported by the current version.

An example datasources.yml configuration file.

datasources:
 - name: datasource1
   driverClass: com.ibm.db2.jcc.DB2Driver
   user: db2inst1
   password: aaaaaaaaaa==:bbbbbbbb/cccccc==
   schema: schema1
   url: jdbc:db2://invdb.localhost:50000/sample
 - name: datasource2
   driverClass: com.ibm.db2.jcc.DB2Driver
   user: db2inst1
   password: aaaaaaaaaa==:bbbbbbbb/cccccc==
   schema: schema2
   url: jdbc:db2://invdb.localhost:50000/sample2

The datasources YAML file support multiple data sources that are be configured to source data for the discovery pipeline. Each data source is configured within a section with the following properties

Sample for DB2 based data source

datasource_tables.yml

The dataSource_tables YAML is used to configure the tables used to source data from each one of the data sources. In addition, the audit columns are marked to support inventory tracking.

An example datasource_tables.yml configuration file.

The datasource_tables YAML file support multiple data sources to be configured to source data for the discovery pipeline. Each data source is configured within a section with the following properties

Sample of a data source with tables, a set of audit columns defined at the data source level and overridden for “tablc”

target_datastore.yml

Target store YAML configuration is used to configure the discovery HDFS datastores. It includes the Hive configurations, HDFS locations used for data inventory and support of data merge processes.

An example target_datastore.yml configuration file.

The target_datastore configuration consists of two sections. The first section contains the HDFS connectivity details such as home directory, URL, user authentication details etc. This information should be provided by the Invariant platform administrator. Please also check the installation instructions.

The second section is related to the source tables and where the data will be persisted in the HDFS store. An entry to match each data source is expected to be configured.

Last updated