Crawler is the Schema for the Crawlers API. Manages a Glue Crawler
Type
CRD
Group
glue.aws.upbound.io
Version
apiVersion: glue.aws.upbound.io/v1beta1
kind: Crawler
CrawlerSpec defines the desired state of Crawler
No description provided.
List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.
Reference to a CatalogDatabase in glue to populate databaseName.
Policies for referencing.
Selector for a CatalogDatabase in glue to populate databaseName.
Policies for selection.
A list of catalog tables to be synchronized.
List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.
Reference to a CatalogDatabase in glue to populate databaseName.
Policies for referencing.
Selector for a CatalogDatabase in glue to populate databaseName.
Policies for selection.
List of nested Delta Lake target arguments. See Delta Target below.
A list of the Amazon S3 paths to the Delta tables.
List of nested Hudi target arguments. See Iceberg Target below.
A list of glob patterns used to exclude from the crawl.
One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.
List of nested Iceberg target arguments. See Iceberg Target below.
A list of glob patterns used to exclude from the crawl.
One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.
List of nested JDBC target arguments. See JDBC Target below.
Reference to a Connection in glue to populate connectionName.
Policies for referencing.
Selector for a Connection in glue to populate connectionName.
Policies for selection.
Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.
A list of glob patterns used to exclude from the crawl.
Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.
Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.
List of nested MongoDB target arguments. See MongoDB Target below.
Reference to a Connection in glue to populate connectionName.
Policies for referencing.
Selector for a Connection in glue to populate connectionName.
Policies for selection.
A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.
Reference to a Role in iam to populate role.
Policies for referencing.
Selector for a Role in iam to populate role.
Policies for selection.
List of nested Amazon S3 target arguments. See S3 Target below.
A list of glob patterns used to exclude from the crawl.
Policy for the crawler's update and deletion behavior. See Schema Change Policy below.
THIS IS A BETA FIELD. It will be honored unless the Management Policies feature flag is disabled. InitProvider holds the same fields as ForProvider, with the exception of Identifier and other resource reference fields. The fields that are in InitProvider are merged into ForProvider when the resource is created. The same fields are also added to the terraform ignore_changes hook, to avoid updating them after creation. This is useful for fields that are required on creation, but we do not desire to update them after creation, for example because of an external controller is managing them, like an autoscaler.
List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.
Reference to a CatalogDatabase in glue to populate databaseName.
Policies for referencing.
Selector for a CatalogDatabase in glue to populate databaseName.
Policies for selection.
A list of catalog tables to be synchronized.
List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.
Reference to a CatalogDatabase in glue to populate databaseName.
Policies for referencing.
Selector for a CatalogDatabase in glue to populate databaseName.
Policies for selection.
List of nested Delta Lake target arguments. See Delta Target below.
A list of the Amazon S3 paths to the Delta tables.
List of nested Hudi target arguments. See Iceberg Target below.
A list of glob patterns used to exclude from the crawl.
One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.
List of nested Iceberg target arguments. See Iceberg Target below.
A list of glob patterns used to exclude from the crawl.
One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.
List of nested JDBC target arguments. See JDBC Target below.
Reference to a Connection in glue to populate connectionName.
Policies for referencing.
Selector for a Connection in glue to populate connectionName.
Policies for selection.
Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.
A list of glob patterns used to exclude from the crawl.
Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.
Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.
List of nested MongoDB target arguments. See MongoDB Target below.
Reference to a Connection in glue to populate connectionName.
Policies for referencing.
Selector for a Connection in glue to populate connectionName.
Policies for selection.
A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.
Reference to a Role in iam to populate role.
Policies for referencing.
Selector for a Role in iam to populate role.
Policies for selection.
List of nested Amazon S3 target arguments. See S3 Target below.
A list of glob patterns used to exclude from the crawl.
Policy for the crawler's update and deletion behavior. See Schema Change Policy below.
THIS IS A BETA FIELD. It is on by default but can be opted out through a Crossplane feature flag. ManagementPolicies specify the array of actions Crossplane is allowed to take on the managed and external resources. This field is planned to replace the DeletionPolicy field in a future release. Currently, both could be set independently and non-default values would be honored if the feature flag is enabled. If both are custom, the DeletionPolicy field will be ignored. See the design doc for more information: https://github.com/crossplane/crossplane/blob/499895a25d1a1a0ba1604944ef98ac7a1a71f197/design/design-doc-observe-only-resources.md?plain=1#L223 and this one: https://github.com/crossplane/crossplane/blob/444267e84783136daa93568b364a5f01228cacbe/design/one-pager-ignore-changes.md
ProviderConfigReference specifies how the provider that will be used to create, observe, update, and delete this managed resource should be configured.
Policies for referencing.
PublishConnectionDetailsTo specifies the connection secret config which contains a name, metadata and a reference to secret store config to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource.
WriteConnectionSecretToReference specifies the namespace and name of a Secret to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource. This field is planned to be replaced in a future release in favor of PublishConnectionDetailsTo. Currently, both could be set independently and connection details would be published to both without affecting each other.
CrawlerStatus defines the observed state of Crawler.
No description provided.
List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.
A list of catalog tables to be synchronized.
List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.
List of nested Delta Lake target arguments. See Delta Target below.
A list of the Amazon S3 paths to the Delta tables.
List of nested Hudi target arguments. See Iceberg Target below.
A list of glob patterns used to exclude from the crawl.
One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.
List of nested Iceberg target arguments. See Iceberg Target below.
A list of glob patterns used to exclude from the crawl.
One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.
List of nested JDBC target arguments. See JDBC Target below.
Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.
A list of glob patterns used to exclude from the crawl.
Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.
Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.
List of nested MongoDB target arguments. See MongoDB Target below.
A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.
List of nested Amazon S3 target arguments. See S3 Target below.
A list of glob patterns used to exclude from the crawl.
Policy for the crawler's update and deletion behavior. See Schema Change Policy below.
Conditions of the resource.
example
apiVersion: glue.aws.upbound.io/v1beta1
kind: Crawler
metadata:
annotations:
meta.upbound.io/example-id: glue/v1beta1/crawler
labels:
testing.upbound.io/example-name: example
name: example
spec:
forProvider:
databaseNameSelector:
matchLabels:
testing.upbound.io/example-name: example
dynamodbTarget:
- path: table-name
region: us-west-1
roleSelector:
matchLabels:
testing.upbound.io/example-name: example
© 2022 Upbound, Inc.
Discover the building blocksfor your internal cloud platform.