This connector reads custom dataset lineages from a config file.
No special setup is required.
Create a YAML config file based on the following template.
lineages:
- dataset:
platform: <data_platform>
name: <dataset_name>
upstreams:
- platform: <dataset_platform>
name: <dataset_name>
account: <snowflake_account> # only for Snowflake
...
...
Note: You only need to specify
account
if the platform isSNOWFLAKE
.
See Output Config for more information.
Here's an example specifying that BigQuery table project.db.table1
sources its data from BigQuery table project.db.table2
& Snowflake table db.schema.table3
:
lineages:
- dataset:
platform: BIGQUERY
name: project.db.table1
upstreams:
- platform: BIGQUERY
name: project.db.table2
- platform: SNOWFLAKE
name: db.schema.table3
account: snowflake_account
Follow the Installation instructions to install metaphor-connectors
in your environment (or virtualenv).
Run the following command to test the connector locally:
metaphor custom.lineage <config_file>
Manually verify the output after the run finishes.