# CREATE SOURCE: PostgreSQL (New Syntax) Creates a new source from PostgreSQL 11+. > **Disambiguation:** This page reflects the new syntax which allows Materialize to handle upstream DDL changes, specifically adding or dropping columns, without downtime. For the deprecated syntax, see the [old reference page](/sql/create-source/postgres/). Creates a new source from PostgreSQL. Materialize supports creating sources from PostgreSQL version 11+. Once a new source is created, you can CREATE TABLE FROM SOURCE from the source to create the corresponding tables in Materialize and start the data ingestion process. ## Prerequisites

To create a source from PostgreSQL 11+, you must first:

For details, see the PostgreSQL integration guides.

## Syntax To create a source from an external PostgreSQL: ```mzsql CREATE SOURCE [IF NOT EXISTS] [IN CLUSTER ] FROM POSTGRES CONNECTION (PUBLICATION '') ; ``` | Syntax element | Description | | --- | --- | | **IF NOT EXISTS** | *Optional.* If specified, do not throw an error if a source with the same name already exists. Instead, issue a notice and skip the source creation. | | `` | The name of the source to create. Names for sources must follow the [naming guidelines](/sql/identifiers/#naming-restrictions). | | **IN CLUSTER** `` | *Optional.* The [cluster](/sql/create-cluster) to maintain this source. Otherwise, the source will be created in the active cluster. {{< tip >}} If possible, use a cluster dedicated just for sources. See also [Operational guidelines](/manage/operational-guidelines/#sources). {{< /tip >}} | | `` | The name of the PostgreSQL connection to use for the source. For details on creating connections, check the [`CREATE CONNECTION`](/sql/create-connection/#postgresql) documentation page. A connection is **reusable** across multiple `CREATE SOURCE` statements. | | `` | The name of the PostgreSQL publication to associate with the source. For details on creating a publication in your PostgreSQL database, see the [integration guides for your PostgreSQL](/ingest-data/postgres/#integration-guides). | ## Details ### Ingesting data After a source is created, you can create tables from the source, referencing the tables in the publication, to start ingesting data. You can create multiple tables that reference the same table in the publication. See [`CREATE TABLE FROM SOURCE`](/sql/create-table/) for details. #### Handling table schema changes The use of the `CREATE SOURCE` with the new [`CREATE TABLE FROM SOURCE`](/sql/create-table/) allows for the handling of certain upstream DDL changes without downtime. See [`CREATE TABLE FROM SOURCE`](/sql/create-table/#handling-table-schema-changes) for details. #### Supported types With the new syntax, after a PostgreSQL source is created, you [`CREATE TABLE FROM SOURCE`](/sql/create-table/) to create a corresponding table in Matererialize and start ingesting data.

Materialize natively supports the following PostgreSQL types (including the array type for each of the types):

  • bool
  • bpchar
  • bytea
  • char
  • date
  • daterange
  • float4
  • float8
  • int2
  • int2vector
  • int4
  • int4range
  • int8
  • int8range
  • interval
  • json
  • jsonb
  • numeric
  • numrange
  • oid
  • text
  • time
  • timestamp
  • timestamptz
  • tsrange
  • tstzrange
  • uuid
  • varchar
For more information, including strategies for handling unsupported types, see [`CREATE TABLE FROM SOURCE`](/sql/create-table/). #### Upstream table truncation restrictions

Avoid truncating upstream tables that are being replicated into Materialize. If a replicated upstream table is truncated, the corresponding subsource(s)/table(s) in Materialize becomes inaccessible and will not produce any data until it is recreated.

Instead of truncating, use an unqualified DELETE to remove all rows from the upstream table:

DELETE FROM t;
For additional considerations, see also [`CREATE TABLE`](/sql/create-table/). ### Publication membership

PostgreSQL’s logical replication API does not provide a signal when users remove tables from publications. Because of this, Materialize relies on periodic checks to determine if a table has been removed from a publication, at which time it generates an irrevocable error, preventing any values from being read from the table.

However, it is possible to remove a table from a publication and then re-add it before Materialize notices that the table was removed. In this case, Materialize can no longer provide any consistency guarantees about the data we present from the table and, unfortunately, is wholly unaware that this occurred.

To mitigate this issue, if you need to drop and re-add a table to a publication, ensure that you remove the table/subsource from the source before re-adding it using the DROP SOURCE command. ### PostgreSQL replication slots When you define a source, Materialize will automatically create a **replication slot** in the upstream PostgreSQL database (see [PostgreSQL replication slots](#postgresql-replication-slots)). Each source ingests the raw replication stream data for all tables in the specified publication using **a single** replication slot. This allows you to minimize the performance impact on the upstream database as well as reuse the same source across multiple materializations. The name of the replication slot created by Materialize is prefixed with `materialize_`. In Materialize, you can query the `mz_internal.mz_postgres_sources` to find the replication slots created: ```mzsql SELECT id, replication_slot FROM mz_internal.mz_postgres_sources; ``` ``` id | replication_slot ---------+---------------------------------------------- u8 | materialize_7f8a72d0bf2a4b6e9ebc4e61ba769b71 ``` > **Tip:**
    >
  • >

    For PostgreSQL 13+, set a reasonable value > for max_slot_wal_keep_size > to limit the amount of storage used by replication slots.

    >
  • >
  • >

    If you stop using Materialize, or if either the Materialize instance or > the PostgreSQL instance crash, delete any replication slots. You can query > the mz_internal.mz_postgres_sources table to look up the name of the > replication slot created for each source.

    >
  • >
  • >

    If you delete all objects that depend on a source without also dropping > the source, the upstream replication slot remains and will continue to > accumulate data so that the source can resume in the future. To avoid > unbounded disk space usage, make sure to use DROP SOURCE or manually delete the replication slot.

    >
  • >
## Examples ### Prerequisites

To create a source from PostgreSQL 11+, you must first:

  • Configure upstream PostgreSQL instance
    • Set up logical replication.
    • Create a publication.
    • Create a replication user and password for Materialize to use to connect.
  • Configure network security
    • Ensure Materialize can connect to your PostgreSQL instance.
  • Create a connection to PostgreSQL in Materialize
    • The connection setup depends on the network security configuration.

For details, see the PostgreSQL integration guides.

### Create a source {#create-source-example} Once you have configured the upstream PostgreSQL, network security, and created the connection, you can create the source. In this example, the PostgreSQL publication is `mz_source` and the connection to PostgreSQL is `pg_connection`. ```mzsql /* This example assumes: - In the upstream PostgreSQL, you have defined: - replication user and password with the appropriate access. - a publication named `mz_source` for the `public.items` and `public.orders` tables. - In Materialize: - You have created a secret for the PostgreSQL password. - You have defined the connection to the upstream PostgreSQL. - You have used the connection to create a source. For example (substitute with your configuration): CREATE SECRET pgpass AS ''; -- substitute CREATE CONNECTION pg_connection TO POSTGRES ( HOST '', -- substitute DATABASE , -- substitute USER , -- substitute PASSWORD SECRET pgpass -- [, ] ); */ CREATE SOURCE pg_source FROM POSTGRES CONNECTION pg_connection ( PUBLICATION 'mz_source' ); ``` After a source is created, you can create tables from the source, referencing specific upstream table(s). Use a [DDL transaction block](/sql/begin/#ddl-only-transactions) to create multiple tables from the same source. ```mzsql BEGIN; CREATE TABLE items FROM SOURCE pg_source(REFERENCE public.items) ; CREATE TABLE orders FROM SOURCE pg_source(REFERENCE public.orders) ; COMMIT; ``` {{< note >}} - Although the example creates the tables with the same name as the upstream tables, the tables in Materialize can have different names. - You can create multiple tables that reference the same upstream table. {{< /note >}} For more information, see [`CREATE TABLE`](/sql/create-table/). ## Related pages - [`CREATE TABLE`](/sql/create-table/) - [`CREATE SECRET`](/sql/create-secret) - [`CREATE CONNECTION`](/sql/create-connection) - [PostgreSQL integration guides](/ingest-data/postgres/#integration-guides) [`enum`]: https://www.postgresql.org/docs/current/datatype-enum.html [`money`]: https://www.postgresql.org/docs/current/datatype-money.html