Add sample for fetching total_rows from query results. #7217

tswast · 2019-01-29T19:15:33Z

In response to feedback internally Bug 123578325, add a sample (acts as a system test, too) which shows how to populate the total_rows value.

shollyman · 2019-01-29T21:33:46Z

bigquery/docs/snippets.py

+    results = query_job.result()  # Waits for query to complete.
+    next(iter(results))  # Fetch the first page of results, which contains total_rows.
+    print("Got {} rows.".format(results.total_rows))
+    # [START bigquery_query_total_rows]


shollyman · 2019-01-29T21:33:56Z

bigquery/docs/snippets.py



+def test_client_query_total_rows(client, capsys):
+    """Run a query an just check for how many rows."""


yan-hic · 2019-01-30T16:20:17Z

Would fetching the first page of result() not make an API call ? If so, why not reading as per #6117. _query_results of the query job does get updated by result()

tswast · 2019-01-30T20:27:07Z

@yiga2 Calling result() makes an API request, but not to tabledata.list, where we get the total_rows from. As you point out in #6117, the request to wait for the job to complete actually contains a total_rows value as well, but we don't pass that through to the RowIterator.

I agree that the way I show in this sample is a bit awkward, and I was thinking about fetching the first page automatically in #4152, but I changed my mind about prefetching, since it could mean an extra unnecessary API request if someone just cares that a query completes and not the actual result rows.

Perhaps we could find a way to pass the total_rows through to the RowIterator after it is constructed in result() to avoid the extra API request.

yan-hic · 2019-03-15T02:04:44Z

@tswast any further consideration on getting total_rows without an add'l API call ?

tswast · 2019-03-15T16:43:22Z

@yiga2 I still think it's a good idea. I just haven't gotten around to. We're open to PRs. The change would likely be to the result method of QueryJob in job.py.

yan-hic · 2019-03-18T18:25:49Z

@tswast Feel free to bundle with other enhancements as PR would be very light otherwise.

Suggested code change, after

google-cloud-python/bigquery/google/cloud/bigquery/job.py

Line 2776 in a6d8499

schema = self._query_results.schema

total_rows = self._query_results.total_rows

yan-hic · 2019-04-03T09:36:33Z

@tswast any feedback on this ?

tswast · 2019-04-03T13:18:28Z

@yiga2 I've prepared #7622 which addresses this issue, but in a slightly more complicated way than we propose here because I wanted to also handle more cases where Client.list_rows is called directly. My PR has had one pass at review, but I'm waiting on a follow-up now that I've addressed the requested changes.

yan-hic · 2019-04-03T13:23:34Z

Cool - thanks Tim !

Add sample for fetching total_rows from query results.

c7e3c18

tswast requested a review from shollyman January 29, 2019 19:15

tswast requested a review from crwilcox as a code owner January 29, 2019 19:15

googlebot added the cla: yes This human has signed the Contributor License Agreement. label Jan 29, 2019

tseaver approved these changes Jan 29, 2019

View reviewed changes

shollyman approved these changes Jan 29, 2019

View reviewed changes

tseaver added api: bigquery Issues related to the BigQuery API. type: docs Improvement to the documentation for an API. labels Jan 29, 2019

Copypasta

0f2e10a

tswast merged commit 51c97cd into googleapis:master Jan 30, 2019

tswast deleted the b123578325-total-rows branch January 30, 2019 00:42

tswast mentioned this pull request Jan 30, 2019

BigQuery: populate RowIterator.total_rows after running a query job #6117

Closed

tswast mentioned this pull request Mar 23, 2019

BigQuery: Add tqdm progress bar for downloads #7552

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add sample for fetching total_rows from query results. #7217

Add sample for fetching total_rows from query results. #7217

Uh oh!

tswast commented Jan 29, 2019

Uh oh!

shollyman Jan 29, 2019

Uh oh!

shollyman Jan 29, 2019

Uh oh!

yan-hic commented Jan 30, 2019

Uh oh!

tswast commented Jan 30, 2019

Uh oh!

yan-hic commented Mar 15, 2019 •

edited

Loading

Uh oh!

tswast commented Mar 15, 2019

Uh oh!

yan-hic commented Mar 18, 2019

Uh oh!

yan-hic commented Apr 3, 2019

Uh oh!

tswast commented Apr 3, 2019

Uh oh!

yan-hic commented Apr 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants



		def test_client_query_total_rows(client, capsys):
		"""Run a query an just check for how many rows."""

Add sample for fetching total_rows from query results. #7217

Add sample for fetching total_rows from query results. #7217

Uh oh!

Conversation

tswast commented Jan 29, 2019

Uh oh!

shollyman Jan 29, 2019

Choose a reason for hiding this comment

Uh oh!

shollyman Jan 29, 2019

Choose a reason for hiding this comment

Uh oh!

yan-hic commented Jan 30, 2019

Uh oh!

tswast commented Jan 30, 2019

Uh oh!

yan-hic commented Mar 15, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tswast commented Mar 15, 2019

Uh oh!

yan-hic commented Mar 18, 2019

Uh oh!

yan-hic commented Apr 3, 2019

Uh oh!

tswast commented Apr 3, 2019

Uh oh!

yan-hic commented Apr 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yan-hic commented Mar 15, 2019 •

edited

Loading