tag:google.com,2016:dataflow-release-notes Dataflow - Release notes Google Cloud Platform 2026-04-07T00:00:00-07:00 April 07, 2026 tag:google.com,2016:dataflow-release-notes#April_07_2026 2026-04-07T00:00:00-07:00 Feature

When you use min_ram or cpu_count resource hints for pipeline steps that don't require accelerators, Auto VM Selection (Instance Flexibility) is enabled automatically. With Auto VM Selection, workers are provisioned from a curated list of machine types that meet your RAM and CPU requirements. For more information, see Auto VM Selection for worker machine types.

]]>
February 27, 2026 tag:google.com,2016:dataflow-release-notes#February_27_2026 2026-02-27T00:00:00-08:00 Feature

Dataflow support for the C4A machine series of Arm processors is now generally available. Arm-based VMs are optimized for power efficiency and can provide improved price-performance for many workloads. For more information, see Use Arm VMs on Dataflow.

]]>
February 02, 2026 tag:google.com,2016:dataflow-release-notes#February_02_2026 2026-02-02T00:00:00-08:00 Feature

Dataflow Managed I/O now supports rolling upgrades for streaming jobs. With this feature, Dataflow upgrades your Managed I/O connectors in running pipelines as new connector versions become available. For more information, see Automatic upgrades.

]]>
January 20, 2026 tag:google.com,2016:dataflow-release-notes#January_20_2026 2026-01-20T00:00:00-08:00 Feature

Dataflow is available in the Bangkok (asia-southeast3) region. Learn more about Google Cloud locations.

]]>
January 13, 2026 tag:google.com,2016:dataflow-release-notes#January_13_2026 2026-01-13T00:00:00-08:00 Feature

Dataflow now serves a notice for when the Dataflow Runner v2 container image of a streaming pipeline will be upgraded. To use a new image and avoid the scheduled maintenance, launch a replacement job before the upgrade. For more information, see Runner v2 harness update.

]]>
November 21, 2025 tag:google.com,2016:dataflow-release-notes#November_21_2025 2025-11-21T00:00:00-08:00 Feature

Dataflow now supports speculative execution for batch pipelines. This feature mitigates the impact of slow-running tasks (stragglers) by launching a redundant execution of these tasks. The first task to finish is used, and the other is canceled, which can improve the overall completion time of your pipeline. This feature is generally available. For more information, see Use speculative execution to avoid stragglers.

]]>
September 24, 2025 tag:google.com,2016:dataflow-release-notes#September_24_2025 2025-09-24T00:00:00-07:00 Feature

For jobs that use GPUs, Dataflow now supports the flex-start provisioning model. This flex-start provisioning model can improve your ability to get access to constrained GPU resources for short-duration workloads. This feature is available in Preview and is for batch pipelines only. For more information, see Configure a provisioning model.

]]>
September 08, 2025 tag:google.com,2016:dataflow-release-notes#September_08_2025 2025-09-08T00:00:00-07:00 Feature

Dataflow now supports using secure tags to set firewall rules on worker VMs. For more information, see Use secure tags with Dataflow.

]]>
August 27, 2025 tag:google.com,2016:dataflow-release-notes#August_27_2025 2025-08-27T00:00:00-07:00 Feature

Dataflow supports TPUs, Google's custom-designed AI accelerators that are optimized for large-scale AI/ML workloads. This feature lets you accelerate inference workloads on frameworks like PyTorch, JAX, and TensorFlow. This feature is generally available with an allowlist. For more information, see Dataflow support for TPUs.

Feature

Dataflow supports specifically targeted reservations for pipelines using accelerators (GPUs or TPUs). This functionality is generally available with an allowlist. For more information, see Use Compute Engine reservations with Dataflow.

Change

Dataflow supports NVIDIA® H100 and NVIDIA® H100 Mega GPU types. For more information, see Dataflow support for GPUs.

]]>
August 26, 2025 tag:google.com,2016:dataflow-release-notes#August_26_2025 2025-08-26T00:00:00-07:00 Fixed

Dataflow Runner v2 fixes an issue that could cause data discrepancies when using splittable DoFns, particularly when processing large datasets as side inputs. This fix ensures that all data is accurately processed and transmitted within the pipeline. This improvement is available in recent Dataflow service releases, and is automatically enabled when using Dataflow Runner v2.

Note: After this fix, pipelines that previously experienced data loss due to this issue might consume more resources (such as CPU, memory, and processing time) because more data is being processed. This increase in resource usage is expected and reflects the correct behavior of the pipeline.

]]>
August 11, 2025 tag:google.com,2016:dataflow-release-notes#August_11_2025 2025-08-11T00:00:00-07:00 Feature

Dataflow now automatically detects performance bottlenecks in streaming jobs. You can see the cause of the bottleneck in the Step Info panel to help with troubleshooting.

For more information, see Troubleshoot bottlenecks.

]]>
June 26, 2025 tag:google.com,2016:dataflow-release-notes#June_26_2025 2025-06-26T00:00:00-07:00 Feature

Dataflow now supports an automated parallel update workflow for streaming jobs. This feature helps minimize disruption by launching a new replacement job that runs in parallel with the existing job. After a duration of time you specify, the old job is automatically drained.

For more information, see Run parallel pipelines.

]]>
June 09, 2025 tag:google.com,2016:dataflow-release-notes#June_09_2025 2025-06-09T00:00:00-07:00 Feature

Dataflow now supports right fitting for streaming jobs. Right fitting lets you specify resource requirements for an entire pipeline or for specific pipeline steps. Previously, right fitting was only supported for batch pipelines. For more information, see Streaming right fitting.

]]>
May 12, 2025 tag:google.com,2016:dataflow-release-notes#May_12_2025 2025-05-12T00:00:00-07:00 Libraries

Go

0.11.0 (2025-05-06)

Features
  • dataflow: A new enum StreamingMode is added (2f22244)
  • dataflow: A new field bugs is added to message .google.dataflow.v1beta3.SdkVersion (2f22244)
  • dataflow: A new field data_sampling is added to message .google.dataflow.v1beta3.DebugOptions (2f22244)
  • dataflow: A new field default_streaming_mode is added to message .google.dataflow.v1beta3.TemplateMetadata (2f22244)
  • dataflow: A new field default_value is added to message .google.dataflow.v1beta3.ParameterMetadata (2f22244)
  • dataflow: A new field disk_size_gb is added to message .google.dataflow.v1beta3.RuntimeEnvironment (2f22244)
  • dataflow: A new field dynamic_destinations is added to message .google.dataflow.v1beta3.PubsubLocation (2f22244)
  • dataflow: A new field enable_launcher_vm_serial_port_logging is added to message .google.dataflow.v1beta3.FlexTemplateRuntimeEnvironment (2f22244)
  • dataflow: A new field enum_options is added to message .google.dataflow.v1beta3.ParameterMetadata (2f22244)
  • dataflow: A new field group_name is added to message .google.dataflow.v1beta3.ParameterMetadata (2f22244)
  • dataflow: A new field hidden_ui is added to message .google.dataflow.v1beta3.ParameterMetadata (2f22244)
  • dataflow: A new field image_repository_cert_path is added to message .google.dataflow.v1beta3.ContainerSpec (2f22244)
  • dataflow: A new field image_repository_password_secret_id is added to message .google.dataflow.v1beta3.ContainerSpec (2f22244)
  • dataflow: A new field image_repository_username_secret_id is added to message .google.dataflow.v1beta3.ContainerSpec (2f22244)
  • dataflow: A new field name is added to message .google.dataflow.v1beta3.ListJobsRequest (2f22244)
  • dataflow: A new field parent_name is added to message .google.dataflow.v1beta3.ParameterMetadata (2f22244)
  • dataflow: A new field parent_trigger_values is added to message .google.dataflow.v1beta3.ParameterMetadata (2f22244)
  • dataflow: A new field runtime_updatable_params is added to message .google.dataflow.v1beta3.Job (2f22244)
  • dataflow: A new field satisfies_pzi is added to message .google.dataflow.v1beta3.Job (2f22244)
  • dataflow: A new field service_resources is added to message .google.dataflow.v1beta3.Job (2f22244)
  • dataflow: A new field step_names_hash is added to message .google.dataflow.v1beta3.PipelineDescription (2f22244)
  • dataflow: A new field straggler_info is added to message .google.dataflow.v1beta3.WorkItemDetails (2f22244)
  • dataflow: A new field straggler_summary is added to message .google.dataflow.v1beta3.StageSummary (2f22244)
  • dataflow: A new field streaming_mode is added to message .google.dataflow.v1beta3.Environment (2f22244)
  • dataflow: A new field streaming_mode is added to message .google.dataflow.v1beta3.FlexTemplateRuntimeEnvironment (2f22244)
  • dataflow: A new field streaming_mode is added to message .google.dataflow.v1beta3.RuntimeEnvironment (2f22244)
  • dataflow: A new field streaming is added to message .google.dataflow.v1beta3.TemplateMetadata (2f22244)
  • dataflow: A new field supports_at_least_once is added to message .google.dataflow.v1beta3.TemplateMetadata (2f22244)
  • dataflow: A new field supports_exactly_once is added to message .google.dataflow.v1beta3.TemplateMetadata (2f22244)
  • dataflow: A new field trie is added to message .google.dataflow.v1beta3.MetricUpdate (2f22244)
  • dataflow: A new field update_mask is added to message .google.dataflow.v1beta3.UpdateJobRequest (2f22244)
  • dataflow: A new field use_streaming_engine_resource_based_billing is added to message .google.dataflow.v1beta3.Environment (2f22244)
  • dataflow: A new field user_display_properties is added to message .google.dataflow.v1beta3.JobMetadata (2f22244)
  • dataflow: A new message DataSamplingConfig is added (2f22244)
  • dataflow: A new message HotKeyDebuggingInfo is added (2f22244)
  • dataflow: A new message ParameterMetadataEnumOption is added (2f22244)
  • dataflow: A new message RuntimeUpdatableParams is added (2f22244)
  • dataflow: A new message SdkBug is added (2f22244)
  • dataflow: A new message ServiceResources is added (2f22244)
  • dataflow: A new message Straggler is added (2f22244)
  • dataflow: A new message StragglerInfo is added (2f22244)
  • dataflow: A new message StragglerSummary is added (2f22244)
  • dataflow: A new message StreamingStragglerInfo is added (2f22244)
  • dataflow: A new method_signature job,update_mask is added to method UpdateJob in service JobsV1Beta3 (2f22244)
  • dataflow: A new value BIGQUERY_TABLE is added to enum ParameterType (2f22244)
  • dataflow: A new value BOOLEAN is added to enum ParameterType (2f22244)
  • dataflow: A new value ENUM is added to enum ParameterType (2f22244)
  • dataflow: A new value GO is added to enum Language (2f22244)
  • dataflow: A new value JAVASCRIPT_UDF_FILE is added to enum ParameterType (2f22244)
  • dataflow: A new value KAFKA_READ_TOPIC is added to enum ParameterType (2f22244)
  • dataflow: A new value KAFKA_TOPIC is added to enum ParameterType (2f22244)
  • dataflow: A new value KAFKA_WRITE_TOPIC is added to enum ParameterType (2f22244)
  • dataflow: A new value KMS_KEY_NAME is added to enum ParameterType (2f22244)
  • dataflow: A new value MACHINE_TYPE is added to enum ParameterType (2f22244)
  • dataflow: A new value NUMBER is added to enum ParameterType (2f22244)
  • dataflow: A new value SERVICE_ACCOUNT is added to enum ParameterType (2f22244)
  • dataflow: A new value WORKER_REGION is added to enum ParameterType (2f22244)
  • dataflow: A new value WORKER_ZONE is added to enum ParameterType (2f22244)
Bug Fixes
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
  • dataflow: An existing oauth_scope `https (2f22244)
Documentation
  • dataflow: A comment for enum JobState is changed (2f22244)
  • dataflow: A comment for enum WorkerIPAddressConfiguration is changed (2f22244)
  • dataflow: A comment for enum value JOB_VIEW_ALL in enum JobView is changed (2f22244)
  • dataflow: A comment for field additional_experiments in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field additional_user_labels in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field bypass_temp_dir_validation in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field capabilities in message .google.dataflow.v1beta3.SdkHarnessContainerImage is changed (2f22244)
  • dataflow: A comment for field current_state in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field dataset in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field debug_options in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field dump_heap_on_oom in message .google.dataflow.v1beta3.FlexTemplateRuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field dynamic_template in message .google.dataflow.v1beta3.LaunchTemplateRequest is changed (2f22244)
  • dataflow: A comment for field enable_hot_key_logging in message .google.dataflow.v1beta3.DebugOptions is changed (2f22244)
  • dataflow: A comment for field enable_streaming_engine in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field environment in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field flex_resource_scheduling_goal in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field gcs_path in message .google.dataflow.v1beta3.DynamicTemplateLaunchParams is changed (2f22244)
  • dataflow: A comment for field gcs_path in message .google.dataflow.v1beta3.LaunchTemplateRequest is changed (2f22244)
  • dataflow: A comment for field id in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field ip_configuration in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field job_name in message .google.dataflow.v1beta3.LaunchTemplateParameters is changed (2f22244)
  • dataflow: A comment for field kms_key_name in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field launch_parameters in message .google.dataflow.v1beta3.LaunchTemplateRequest is changed (2f22244)
  • dataflow: A comment for field location in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field machine_type in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field max_workers in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field name in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field network in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field num_workers in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field project_id in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field requested_state in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field save_heap_dumps_to_gcs_path in message .google.dataflow.v1beta3.FlexTemplateRuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field service_account_email in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field service_account_email in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field service_kms_key_name in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field service_options in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field set in message .google.dataflow.v1beta3.MetricUpdate is changed (2f22244)
  • dataflow: A comment for field subnetwork in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field temp_location in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field transform_name_mapping in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field type in message .google.dataflow.v1beta3.Job is changed (2f22244)
  • dataflow: A comment for field worker_region in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field worker_region in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field worker_zone in message .google.dataflow.v1beta3.Environment is changed (2f22244)
  • dataflow: A comment for field worker_zone in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for field zone in message .google.dataflow.v1beta3.RuntimeEnvironment is changed (2f22244)
  • dataflow: A comment for message DynamicTemplateLaunchParams is changed (2f22244)
  • dataflow: A comment for message Job is changed (2f22244)
  • dataflow: A comment for message JobExecutionStageInfo is changed (2f22244)
  • dataflow: A comment for message JobMetrics is changed (2f22244)
  • dataflow: A comment for message LaunchTemplateParameters is changed (2f22244)
  • dataflow: A comment for message MetricUpdate is changed (2f22244)
  • dataflow: A comment for message SdkHarnessContainerImage is changed (2f22244)
  • dataflow: A comment for message Step is changed (2f22244)
  • dataflow: A comment for method AggregatedListJobs in service JobsV1Beta3 is changed (2f22244)
  • dataflow: A comment for method CreateJob in service JobsV1Beta3 is changed (2f22244)
  • dataflow: A comment for method CreateJobFromTemplate in service TemplatesService is changed (2f22244)
  • dataflow: A comment for method GetTemplate in service TemplatesService is changed (2f22244)
  • dataflow: A comment for method LaunchTemplate in service TemplatesService is changed (2f22244)
  • dataflow: A comment for method ListJobs in service JobsV1Beta3 is changed (2f22244)
  • dataflow: A comment for service FlexTemplatesService is changed (2f22244)
]]>
April 21, 2025 tag:google.com,2016:dataflow-release-notes#April_21_2025 2025-04-21T00:00:00-07:00 Libraries

Go

0.10.6 (2025-04-15)

Bug Fixes
  • dataflow: Update google.golang.org/api to 0.229.0 (3319672)
]]>
March 17, 2025 tag:google.com,2016:dataflow-release-notes#March_17_2025 2025-03-17T00:00:00-07:00 Libraries

Go

0.10.5 (2025-03-13)

Bug Fixes
  • dataflow: Update golang.org/x/net to 0.37.0 (1144978)
]]>
March 13, 2025 tag:google.com,2016:dataflow-release-notes#March_13_2025 2025-03-13T00:00:00-07:00 Feature

Dataflow now supports data lineage. Data lineage lets you track how data moves through your systems. This feature is generally available (GA). For more information, see Use data lineage in Dataflow.

]]>
March 10, 2025 tag:google.com,2016:dataflow-release-notes#March_10_2025 2025-03-10T00:00:00-07:00 Libraries

Go

0.10.4 (2025-03-06)

Bug Fixes
  • dataflow: Fix out-of-sync version.go (28f0030)
]]>
March 04, 2025 tag:google.com,2016:dataflow-release-notes#March_04_2025 2025-03-04T00:00:00-08:00 Feature

Dataflow is now available in Stockholm (europe-north2).

]]>
January 21, 2025 tag:google.com,2016:dataflow-release-notes#January_21_2025 2025-01-21T00:00:00-08:00 Feature

Managed I/O now supports automatic upgrades for supported I/O connectors. Using this feature, Dataflow pipelines automatically use the latest reliable version of the connector. This feature is generally available (GA). For more information, see Dataflow managed I/O.

]]>
January 06, 2025 tag:google.com,2016:dataflow-release-notes#January_06_2025 2025-01-06T00:00:00-08:00 Libraries

Go

0.10.3 (2025-01-02)

Bug Fixes
  • dataflow: Update golang.org/x/net to v0.33.0 (e9b0b69)
]]>
November 18, 2024 tag:google.com,2016:dataflow-release-notes#November_18_2024 2024-11-18T00:00:00-08:00 Feature

Dataflow is available in Queretaro, Mexico (northamerica-south1). Learn more about Google Cloud locations.

]]>
July 11, 2024 tag:google.com,2016:dataflow-release-notes#July_11_2024 2024-07-11T00:00:00-07:00 Feature

You can now use the Dataflow job builder UI to create and run Dataflow pipelines in the Google Cloud console, without writing any code. This feature is generally available (GA).

]]>
July 05, 2024 tag:google.com,2016:dataflow-release-notes#July_05_2024 2024-07-05T00:00:00-07:00 Security

The remote code execution vulnerability, CVE-2024-6387, in OpenSSH has been mitigated. A patched Dataflow VM image that includes an updated OpenSSH is available. For more information about how to apply mitigations, see the GCP-2024-040 security bulletin.

]]>
July 03, 2024 tag:google.com,2016:dataflow-release-notes#July_03_2024 2024-07-03T00:00:00-07:00 Security

A remote code execution vulnerability, CVE-2024-6387, was recently discovered in OpenSSH. Dataflow jobs might create VMs that use an OS image with versions of OpenSSH that are vulnerable to CVE-2024-6387. For more information, see the GCP-2024-040 security bulletin.

]]>
July 01, 2024 tag:google.com,2016:dataflow-release-notes#July_01_2024 2024-07-01T00:00:00-07:00 Change

Dataflow batch jobs are now cancelled after ten days. Previously, they were cancelled after 30 days. See Quotas and limits.

]]>
June 21, 2024 tag:google.com,2016:dataflow-release-notes#June_21_2024 2024-06-21T00:00:00-07:00 Deprecated

Dataflow SQL is deprecated. As of July 31, 2024, you can't access Dataflow SQL in the Google Cloud console. As of January 31, 2025, you can't use Dataflow SQL in the Google Cloud CLI. As a replacement, use Beam SQL.

]]>
June 04, 2024 tag:google.com,2016:dataflow-release-notes#June_04_2024 2024-06-04T00:00:00-07:00 Feature

Iceberg read/write support is available through the new Managed I/O Java API. For more information, see Dataflow managed I/O.

]]>
May 31, 2024 tag:google.com,2016:dataflow-release-notes#May_31_2024 2024-05-31T00:00:00-07:00 Feature

You can now use Metrics Explorer to find individual DoFns that cause latencies in streaming jobs. These metrics are available in streaming pipelines that use Apache Beam 2.53.0 and later versions. The following new metrics are available:

  • Average message processing time per DoFn (job/dofn_latency_average)
  • Maximum message processing time per DoFn (job/dofn_latency_max)
  • Minimum message processing time per DoFn (job/dofn_latency_min)
  • Number of messages processed per DoFn (job/dofn_latency_num_messages)
  • Oldest active message processing time per DoFn (job/oldest_active_message_age)
  • Total message processing time per DoFn (job/dofn_latency_total)

For more information about Dataflow metrics, see Google Cloud metrics.

]]>
May 17, 2024 tag:google.com,2016:dataflow-release-notes#May_17_2024 2024-05-17T00:00:00-07:00 Change

Dataflow no longer supports the NVIDIA Tesla K80 GPU type. For a list of supported GPU types, see Dataflow support for GPUs.

]]>