55 questions
1
vote
0
answers
86
views
Wrong REST response for retrieving Column Level Lineage in GCP BigQuery
I have got two BigQuery tables:
bigquery:ssh-test-project-01.SSh_Dataset_03.SSh_BgQ_Src_01 and
bigquery:ssh-test-project-01.SSh_Dataset_03.SSh_BgQ_Dst_BgQ_01.
Using Data Transfer with SQL:
"...
0
votes
0
answers
135
views
Fetch data containing the Tags and Business terms using BigQuery with the table/column names
I attached some "Column Tags" and "Business terms" using Dataplex and my goal is to fetch the table/column data with the tags information and business terms, is there a way to do ...
0
votes
1
answer
207
views
How to manage create tags in GCP Data Catalog on row level
I want to create Data Catalog tags for specific row which I process in Data Fusion - e.g. Warning tag to underline that specific row not completely passed all validations of pipeline, but it's said ...
0
votes
1
answer
290
views
What is the difference between a data catalog policy tag and a "normal" data catalog tag?
I have actually gotten things to work properly (using the policy tag), yet I still don't understand the difference between the data catalog policy tag and the data catalog tag.
To be specific, I'm ...
1
vote
1
answer
170
views
How to share Google Dataplex/Data Catalog metadata outside the organization
We are sharing some BQ datasets with a third party (outside our org) using analytics hub. So, the subscriber is able to create a linked dataset in their project and any queries they run are billed to ...
0
votes
1
answer
1k
views
Confusion about metadata in Google Dataplex and Data Catalog
I have a dataset and a table in google big query (BQ). For the dataset, I can add description, and for the table I can add description and column policy tags to control column level access (I am ...
1
vote
0
answers
310
views
How to attach a tag template to a dataset in GCP using terraform
I have generated tag templates with Terraform, intending to associate them with various datasets in GCP. Despite my efforts to identify a connection between the datasets and the tag templates, I was ...
1
vote
0
answers
207
views
Retrieve columns assigned with policy tag
We have a policy tag applied to many columns in BigQuery. Is there anyway to retrieve all the columns that tag applies to?
1
vote
1
answer
226
views
Google Cloud Data Catalog - Offerings and Flexibility
Planning to build a data platform with compute as Google Cloud Dataproc storing the data in delta tables (Deltalake).
Currently exploring the data catalog available in GCP stack along with open source ...
0
votes
1
answer
381
views
Manage Google Data Catalog in Terraform - Set Tag Template's visibility
I'm trying to create a tag template in Google Data Catalog using Terraform.
Once created - the tag template's visibility is set to "Private".
Looking for a way to set the visibility to ...
0
votes
0
answers
273
views
How is Data masking price calculated in BigQuery
We run 2 queries from a table in 2 scenarios:
We run a query to create a table from selecting data of another table (ex: ABC), this table ABC has no any policy tags (data masking rules) on any ...
0
votes
1
answer
530
views
Attaching tags to columns in data catalog using python
I have a table in BQ and I am trying to attach tags to columns depending on the prefix. For ex., all the columns which start with ABC_ have to be tagged with the Private Info tag.
I have written below ...
0
votes
2
answers
187
views
How to get the list of columns that the user has access to
Is there any way(with query/python method) to get all the columns a user has in bigquery table?
my requirement is
userA: TableA.column1, TableA.column2,TableA.column4
userB: TableA.column1, TableA....
0
votes
0
answers
297
views
GCP data catalog/dataflex - can I register Kafka topics and on-prem databases?
Is it possible to also register/add metadata of Kafka topics/producers/consumers and on-premise databases etc in GCP data catalog/Dataplex?
Could someone please point me to some examples how to do the ...
0
votes
1
answer
100
views
Adding multiple fields for the same template in Data Catalog using python
I have a tag template in data catalog with name 'data check' and i need to add multiple fields to this template using python code by reading from a text file
the text file consists the following
name,...
0
votes
1
answer
237
views
GCP CLI Command to remove Data Catalog Tag
what is the GCP CLI Command to remove (detach) a Data Catalog Tag from a BigQuery Dataset, and also CLI Command to Update Tag.
I am able to do it manually how to do it using Cloud Shell CLI gcloud ...
0
votes
1
answer
445
views
How to detach or remove a data catalog tag from a BigQuery table using gcloud command
Could anyone please share ETL tag template in GCP data catalog?
I'd like to refresh a tag value with its ETL status every time a BigQuery table is updated. I'm trying to use gcloud commands to create ...
0
votes
2
answers
1k
views
Dataplex API Tag Policies
I'm on exploration Dataplex API with Python in Google Documentation, there is documentation to Get Lake, Zone, Assets, etc. I've explored that documentation, but I didn't find any documentation ...
0
votes
1
answer
196
views
How to update the Data Catalog tag values through airflow CloudDataCatalog operators
I would like to update Data Catalog tags through airflow CloudDataCatalog operators and schedule the DAG to update the tag values daily. This is to capture the ETL status on a BigQuery table and feed ...
3
votes
1
answer
3k
views
Error parsing metadata using python datacatalog
I'm trying to update attached tag to tables in data catalog using python : enter image description here
And i recieve this error :
enter image description here
E0102 16:29:36.671000000 13508 src/core/...
3
votes
1
answer
615
views
GCP Data Catalog Schema History or Versioning
I've been wondering if it is possible to have versions of schema in GCP Data Catalog Service? Or maybe advice on how you deal with Data Catalog entries when schema is changed (e.g. in CloudSQL, GCS ...
1
vote
1
answer
887
views
How Can I Attach Policy Tags to columns using Python API
As a part of data governance, we have created Taxonomies, Policy Tags Using "Python API". And I am trying to Assign Policy Tags to Columns [Name, Age] for a table Project.Dataset.TMP_TBL.
...
0
votes
1
answer
843
views
Attach a tag from a existing template along with values to a BigQuery table using the Data Catalog
I need help using the template template created in Google Data Catalog through Python programming to tag several BigQuery tables using the same template template created.
I don't know how to use the ...
0
votes
1
answer
2k
views
Error while creating Taxonomy in Google Data Catalog using Apache Airflow
Experts,
I'm trying to create a Taxonomy in Google Data Catalog using Apache Airflow.
When I trigger the job in Airflow, it fails with below error.
Error
> google.api_core.exceptions....
0
votes
1
answer
585
views
Is there way to copy data from the GCP data catalog from a project to another?
I'm doing a test to use GCP Data Catalog (DataPlex) to document tables from Big Query.
I'm doing in a dev project, in the future, I'd like transfer data that I documented (description of columns, ...
1
vote
1
answer
575
views
Unsupported schema specified for Pubsub source in CREATE TABLE
Following the link I found in Google, I'm trying to do a sample setup to publish message in pubsub and load the same into bigquery table using dataflow sql.
But when I create dataflow job am getting ...
1
vote
1
answer
1k
views
Error adding Policy Tags on Big Query External Tables:Policy tags are not supported on plain external tables. Use an authorized external table instead
I have been trying to implement column level access policies on a Big Query external table. I have tried to create a policy tag via GCP console as well as REST API but getting an error "Policy ...
2
votes
1
answer
755
views
Programmatically adding tags to Data Catalog Custom entries
I am trying to attach tags to data catalog custom entries. I am trying to create a python function to perform data catalog operations i.e. create/delete custom entries, create/delete tag templates, ...
1
vote
1
answer
312
views
How is includeProjectIds used in Google data catalog search?
When searching using Google's data catalog, a scope parameter is required and one of its fields is includeProjectIds. According to the documentation, this parameter represents:
The list of project ...
0
votes
1
answer
175
views
I can't filter the column description in the datacatalog(GCP)
I'm trying to filter the gcp data catalog, in which I try to filter column descriptions, using the description qualifiers: student name attribute.
Unfortunately, no return.
other qualifiers that I ...
0
votes
0
answers
1k
views
Google Cloud Data Catalog - Node JS Library giving Error: 5 NOT_FOUND: Project does not exist
I am trying to use Nodejs Library for Google Cloud Data Catalog. For one project, it is working perfectly fine but for others it is giving me Project does not exist error, while I can see the project ...
0
votes
1
answer
1k
views
How can I list tags en Google Data Catalog?
I'm creating tag templates programmatically and it doesn't appear in the section "Tag Templates" (the creation of each tag template was successful because I can attach it in each table), but ...
0
votes
2
answers
2k
views
Attach a tag along with the values to a BigQuery table using the Data Catalog
I have multiple tables in BigQuery. I also have a tag template.
Is there a way to attach this tag template and fill the details programmatically with python to any table using the google.cloud....
1
vote
2
answers
541
views
How to filter GCP Data Catalog entries with a specific tag value via console?
I ran a DLP job saving results to Data Catalog and would like to filter the entries in Data Catalog where the standard tag template (Data Loss Prevention Tags) has a value Contains DLP findings: true. ...
0
votes
1
answer
690
views
Column Tagging in GCP Data Catalog using DLP results and Python
I've been dealing with this issue for weeks. Does anyone know how to create column tags in Data Catalog from DLP results? I wanted to run inspection job on DLP and then tag the inspected tables with ...
0
votes
1
answer
460
views
Bigquery Updating Metadata but Disabled Create/Delete/Alter Tables
What roles should be properly assigned to a group/service account if we would like to be able to update table descriptions in a centralized dataset and other labels for Data Catalog?
We currently have ...
1
vote
1
answer
3k
views
Why do BigQuery policy tagged columns prevent access to other columns when used in a GROUP BY statement?
We are exploring the use of BigQuery policy tags for implementing column level security. We have discovered that when a column that a user does not have access to is included in a GROUP BY clause then ...
1
vote
1
answer
682
views
Create Data Catalog column tags by inspecting BigQuery data with Cloud Data Loss Prevention
I want to use DLP to inspect my tables in BigQuery, and then write the findings to policy tags on the columns of the table. For example, I have a (test) table that contains data including an email ...
0
votes
0
answers
120
views
google-cloud-datacatalog 0.8.0 is missing TagTemplate option
I am working on developing a datacatalog integration pipeline in google cloud composer. Since I am using composer version composer-1.12.4-airflow-1.10.10, its not allowing me to install google-cloud-...
0
votes
1
answer
184
views
Can Google Cloud Data Catalog link to Azure Data Lake and Azure SQL Database?
Is it possible to use the Google Cloud Data Catalog to catalog an Azure Data Lake, Blob storage or Azure SQL Database?
1
vote
2
answers
663
views
Data Catalog will not read technical metadata automatically from files in a bucket (GCS)
In Google Data Catalog I have tried to create a new entry group, then created a fileset entry where bucket and file pattern is configured. I have not defined a schema on purpose in the Fileset, as I ...
0
votes
1
answer
917
views
Error Python API GCP Data Catalog - Google Cloud Platform
I'm getting an error trying of use the code of this link: Data Catalog Example. At the step 4, just copy pasting all the code provided, authenticating into my GCP Project and testing it.
Everything is ...
2
votes
0
answers
1k
views
Data Catalog: error updating Tag - Error 400: Unsupported field mask path: "column", supported field masks are: fields
I am attaching a tag-template to a column of a BigQuery table. For this, I am using Terraform and I have just recreated the code in the terraform documentation.
resource "...
0
votes
1
answer
6k
views
Policy Tags in BigQuery
Is there a way to assign policy tags globally in BigQuery? Rather than entering every single table and putting it there?
0
votes
1
answer
244
views
Google Cloud Platform: Any way to rearrange fields in an existing Data Catalog template?
I've added a new field to an existing GCP Data Catalog template which by default adds it to the bottom. I'd like for this new field to appear higher up. Is there some way I'm not seeing to rearrange ...
0
votes
0
answers
80
views
Google Data Catalog systems list not refreshing
I am using the connector suggested in some GCP documentation to sync metadata from Tableau to Google Data Catalog (https://github.com/GoogleCloudPlatform/datacatalog-connectors-bi). The sync process ...
0
votes
1
answer
719
views
GCP Data Catalog - ONE for all projects (of one and more orgs)
What is the best practice
to get a company wide (one or more organisations each with multiple folders and projects)
INTO one central and all metadata contained data catalog ?
(if "multiple orgs&...
0
votes
1
answer
327
views
GCP Data Catalog - search columns containing a dot in the column name
Take the public github dataset as an example
SELECT
*
FROM
`bigquery-public-data.github_repos.commits`
LIMIT
2
There are column names like
difference.old_mode
via search:
column:difference....
1
vote
1
answer
815
views
How to do - wildcard search in Data Catalog (Google Cloud Platform)
How to execute a wildcard/RegEx search in Data Catalog (Google Cloud Platform) ?
It would make sense to search metadata across column names and tag attributes (and there values).
The current ...
1
vote
1
answer
903
views
How to retrieve tagged column using Google Data Catalog Java API
I am able to search for tag templates using the Java Google Data Catalog library, and do sub search on the tag name to find the big query tables the tags are used, but I can't figure out how to find ...