Crawler - upbound/provider-aws-glue@v1.15.0

string

databaseNameRef

object

Reference to a CatalogDatabase in glue to populate databaseName.

requiredstring

object

Policies for referencing.

string

string

databaseNameSelector

object

Selector for a CatalogDatabase in glue to populate databaseName.

boolean

object

object

Policies for selection.

string

string

deltaTarget

array

List of nested Delta Lake target arguments. See Delta Target below.

string

createNativeDeltaTable

boolean

deltaTables

array

A list of the Amazon S3 paths to the Delta tables.

writeManifest

boolean

description

string

dynamodbTarget

array

List of nested DynamoDB target arguments. See Dynamodb Target below.

string

boolean

number

array

List of nested Hudi target arguments. See Iceberg Target below.

string

array

A list of glob patterns used to exclude from the crawl.

number

array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

icebergTarget

array

List of nested Iceberg target arguments. See Iceberg Target below.

string

array

A list of glob patterns used to exclude from the crawl.

number

array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

jdbcTarget

array

List of nested JDBC target arguments. See JDBC Target below.

string

object

Reference to a Connection in glue to populate connectionName.

requiredstring

object

Policies for referencing.

string

string

object

Selector for a Connection in glue to populate connectionName.

boolean

object

object

Policies for selection.

string

string

enableAdditionalMetadata

array

Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.

lakeFormationConfiguration

array

A list of glob patterns used to exclude from the crawl.

path

string

array

Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.

accountId

string

useLakeFormationCredentials

boolean

lineageConfiguration

array

Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.

crawlerLineageSettings

string

mongodbTarget

array

List of nested MongoDB target arguments. See MongoDB Target below.

string

object

Reference to a Connection in glue to populate connectionName.

requiredstring

object

Policies for referencing.

string

string

object

Selector for a Connection in glue to populate connectionName.

boolean

object

object

Policies for selection.

string

string

string

boolean

array

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.

string

requiredstring

string

object

Reference to a Role in iam to populate role.

requiredstring

object

Policies for referencing.

string

string

roleSelector

object

Selector for a Role in iam to populate role.

boolean

object

object

Policies for selection.

string

string

s3Target

array

List of nested Amazon S3 target arguments. See S3 Target below.

string

string

string

array

A list of glob patterns used to exclude from the crawl.

string

number

string

array

Policy for the crawler's update and deletion behavior. See Schema Change Policy below.

deleteBehavior

string

updateBehavior

string

securityConfiguration

string

tablePrefix

string

tags

object

initProvider

object

THIS IS A BETA FIELD. It will be honored unless the Management Policies feature flag is disabled. InitProvider holds the same fields as ForProvider, with the exception of Identifier and other resource reference fields. The fields that are in InitProvider are merged into ForProvider when the resource is created. The same fields are also added to the terraform ignore_changes hook, to avoid updating them after creation. This is useful for fields that are required on creation, but we do not desire to update them after creation, for example because of an external controller is managing them, like an autoscaler.

catalogTarget

array

List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.

string

string

databaseNameRef

object

Reference to a CatalogDatabase in glue to populate databaseName.

requiredstring

object

Policies for referencing.

string

string

databaseNameSelector

object

Selector for a CatalogDatabase in glue to populate databaseName.

boolean

object

object

Policies for selection.

string

string

string

string

array

A list of catalog tables to be synchronized.

classifiers

array

List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

configuration

string

string

databaseNameRef

object

Reference to a CatalogDatabase in glue to populate databaseName.

requiredstring

object

Policies for referencing.

string

string

databaseNameSelector

object

Selector for a CatalogDatabase in glue to populate databaseName.

boolean

object

object

Policies for selection.

string

string

deltaTarget

array

List of nested Delta Lake target arguments. See Delta Target below.

string

createNativeDeltaTable

boolean

deltaTables

array

A list of the Amazon S3 paths to the Delta tables.

writeManifest

boolean

description

string

dynamodbTarget

array

List of nested DynamoDB target arguments. See Dynamodb Target below.

string

boolean

number

array

List of nested Hudi target arguments. See Iceberg Target below.

string

array

A list of glob patterns used to exclude from the crawl.

number

array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

icebergTarget

array

List of nested Iceberg target arguments. See Iceberg Target below.

string

array

A list of glob patterns used to exclude from the crawl.

number

array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

jdbcTarget

array

List of nested JDBC target arguments. See JDBC Target below.

string

object

Reference to a Connection in glue to populate connectionName.

requiredstring

object

Policies for referencing.

string

string

object

Selector for a Connection in glue to populate connectionName.

boolean

object

object

Policies for selection.

string

string

enableAdditionalMetadata

array

lakeFormationConfiguration

array

A list of glob patterns used to exclude from the crawl.

path

string

array

Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.

accountId

string

useLakeFormationCredentials

boolean

lineageConfiguration

array

Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.

crawlerLineageSettings

string

mongodbTarget

array

List of nested MongoDB target arguments. See MongoDB Target below.

string

object

Reference to a Connection in glue to populate connectionName.

requiredstring

object

Policies for referencing.

string

string

object

Selector for a Connection in glue to populate connectionName.

boolean

object

object

Policies for selection.

string

string

string

boolean

array

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.

recrawlBehavior

string

role

string

roleRef

object

Reference to a Role in iam to populate role.

requiredstring

object

Policies for referencing.

string

string

roleSelector

object

Selector for a Role in iam to populate role.

boolean

object

object

Policies for selection.

string

string

s3Target

array

List of nested Amazon S3 target arguments. See S3 Target below.

string

string

string

array

A list of glob patterns used to exclude from the crawl.

string

number

string

array

Policy for the crawler's update and deletion behavior. See Schema Change Policy below.

deleteBehavior

string

updateBehavior

string

securityConfiguration

string

tablePrefix

string

tags

object

managementPolicies

array

THIS IS A BETA FIELD. It is on by default but can be opted out through a Crossplane feature flag. ManagementPolicies specify the array of actions Crossplane is allowed to take on the managed and external resources. This field is planned to replace the DeletionPolicy field in a future release. Currently, both could be set independently and non-default values would be honored if the feature flag is enabled. If both are custom, the DeletionPolicy field will be ignored. See the design doc for more information: https://github.com/crossplane/crossplane/blob/499895a25d1a1a0ba1604944ef98ac7a1a71f197/design/design-doc-observe-only-resources.md?plain=1#L223 and this one: https://github.com/crossplane/crossplane/blob/444267e84783136daa93568b364a5f01228cacbe/design/one-pager-ignore-changes.md

providerConfigRef

object

ProviderConfigReference specifies how the provider that will be used to create, observe, update, and delete this managed resource should be configured.

requiredstring

object

Policies for referencing.

string

publishConnectionDetailsTo

string

object

PublishConnectionDetailsTo specifies the connection secret config which contains a name, metadata and a reference to secret store config to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource.

configRef

object

SecretStoreConfigRef specifies which secret store config should be used for this ConnectionSecret.

requiredstring

object

Policies for referencing.

string

writeConnectionSecretToRef

string

metadata

object

Metadata is the metadata for connection secret.

object

object

string

requiredstring

object

WriteConnectionSecretToReference specifies the namespace and name of a Secret to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource. This field is planned to be replaced in a future release in favor of PublishConnectionDetailsTo. Currently, both could be set independently and connection details would be published to both without affecting each other.

requiredstring

namespace

requiredstring

status

object

CrawlerStatus defines the observed state of Crawler.

atProvider

object

No description provided.

arn

string

catalogTarget

array

List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.

string

string

string

string

array

A list of catalog tables to be synchronized.

classifiers

array

List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

configuration

string

string

deltaTarget

array

List of nested Delta Lake target arguments. See Delta Target below.

string

createNativeDeltaTable

boolean

deltaTables

array

A list of the Amazon S3 paths to the Delta tables.

writeManifest

boolean

description

string

dynamodbTarget

array

List of nested DynamoDB target arguments. See Dynamodb Target below.

string

boolean

number

array

List of nested Hudi target arguments. See Iceberg Target below.

string

array

A list of glob patterns used to exclude from the crawl.

number

array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

icebergTarget

array

List of nested Iceberg target arguments. See Iceberg Target below.

string

array

A list of glob patterns used to exclude from the crawl.

number

array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

string

jdbcTarget

array

List of nested JDBC target arguments. See JDBC Target below.

string

enableAdditionalMetadata

array

lakeFormationConfiguration

array

A list of glob patterns used to exclude from the crawl.

path

string

array

Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.

accountId

string

useLakeFormationCredentials

boolean

lineageConfiguration

array

Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.

crawlerLineageSettings

string

mongodbTarget

array

List of nested MongoDB target arguments. See MongoDB Target below.

string

string

boolean

array

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.

recrawlBehavior

string

role

string

s3Target

array

List of nested Amazon S3 target arguments. See S3 Target below.