PostgreSQL metadata collector
Summarize
Summary of PostgreSQL metadata collector
The PostgreSQL metadata collector provides read-only access to metadata from PostgreSQL databases, enabling ServiceNow customers to harvest detailed metadata for tables, columns, views, materialized views, functions, and stored procedures. It supports both self-hosted PostgreSQL instances and managed services like AWS RDS. This collector makes metadata searchable and discoverable within the ServiceNow data catalog, facilitating improved data governance and management.
Show less
Key Features
- Authentication: Supports username/password and AWS IAM authentication for secure access.
- Comprehensive Metadata Cataloging: Captures detailed attributes such as column names, descriptions, data types, keys, table schemas, indexes, views, materialized views, functions, and stored procedures. Each version of overloaded functions and stored procedures is uniquely identified.
- Profiling and Sampling: When enabled and with appropriate read permissions, collects column statistics including average length, value distributions, distinct counts, min/max values, non-null counts, sample string values, and table row counts to provide deeper insight into data quality and characteristics.
- Relationships and Lineage: Displays relationships between tables, columns, indexes, schemas, views, and materialized views. Provides column-level lineage tracing for views and materialized views, linking derived columns back to source table columns through SQL operations like filtering, sorting, and aggregation.
Practical Guidance for ServiceNow Customers
- Before running the collector, create a PostgreSQL user with proper read permissions to enable metadata harvesting and profiling.
- Create and configure a PostgreSQL metadata collector within ServiceNow to import metadata from your PostgreSQL environment.
- Use the harvested metadata and lineage information to enhance data cataloging, improve data discoverability, and support governance initiatives.
Provides read-only access to metadata from a PostgreSQL database.
The collector harvests metadata for PostgreSQL tables and columns, making them searchable and discoverable in the data catalog. Supports both self-hosted PostgreSQL instances and managed PostgreSQL instances, such as those hosted on AWS RDS.
Authentication supported
Username and password authentication and AWS IAM authentication.
Metadata cataloged
The PostgreSQL collector catalogs the following information.
| Object | Information cataloged |
|---|---|
| Columns | Name, Description, JDBC type, Column Type, Is Nullable, Default Value, Key type (Primary, foreign), column size, column index |
| Table | Name, description, primary key, schema |
| Table Index | Index Cardinality, Column name, Index Type, Index Name, is non Unique, Ordinal Position, Pages, Sort Sequence |
| Views | Name, description, SQL definition |
| Materialized View | Name, description, SQL definition |
| Schema | Identifier, Name |
| Database | Type, name, identifier, server, port, environment, JDBC URL |
| Function | Name, Description, Function Type |
| Stored Procedure | Name, Description, Stored Procedure Type |
| Object | Information cataloged |
|---|---|
| Column |
|
| Table |
|
Relationships between objects
Catalog pages show relationships between the following data asset types:
| Data asset page | Relationship |
|---|---|
| Table | Columns, Table Indexes |
| Columns | Table |
| Schema |
|
| View |
|
| Materialized View |
|
Lineage for PostgreSQL
The collector identifies column-level lineage for Views and Materialized Views, tracing data sources from view columns to source table columns across SQL expressions and subqueries.
Establishes relationships between Views and source Table columns that:
- Sort rows (SQL ORDER BY)
- Filter rows (SQL WHERE and HAVING clauses)
- Aggregate rows (SQL GROUP BY)