Crawler is the Schema for the Crawlers API. Manages a Glue Crawler
Type
CRD
Group
glue.aws.upbound.io
Version
v1beta1
apiVersion: glue.aws.upbound.io/v1beta1
kind: Crawler
CrawlerSpec defines the desired state of Crawler
No description provided.
No description provided.
Reference to a CatalogDatabase in glue to populate databaseName.
Policies for referencing.
Selector for a CatalogDatabase in glue to populate databaseName.
Policies for selection.
A list of catalog tables to be synchronized.
List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.
Reference to a CatalogDatabase in glue to populate databaseName.
Policies for referencing.
Selector for a CatalogDatabase in glue to populate databaseName.
Policies for selection.
No description provided.
A list of the Amazon S3 paths to the Delta tables.
List of nested JBDC target arguments. See JDBC Target below.
Reference to a Connection in glue to populate connectionName.
Policies for referencing.
Selector for a Connection in glue to populate connectionName.
Policies for selection.
A list of glob patterns used to exclude from the crawl.
Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.
List nested MongoDB target arguments. See MongoDB Target below.
Reference to a Connection in glue to populate connectionName.
Policies for referencing.
Selector for a Connection in glue to populate connectionName.
Policies for selection.
A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.
Reference to a Role in iam to populate role.
Policies for referencing.
Selector for a Role in iam to populate role.
Policies for selection.
List nested Amazon S3 target arguments. See S3 Target below.
A list of glob patterns used to exclude from the crawl.
Policy for the crawler's update and deletion behavior. See Schema Change Policy below.
ProviderConfigReference specifies how the provider that will be used to create, observe, update, and delete this managed resource should be configured.
Policies for referencing.
ProviderReference specifies the provider that will be used to create, observe, update, and delete this managed resource. Deprecated: Please use ProviderConfigReference, i.e. providerConfigRef
Policies for referencing.
PublishConnectionDetailsTo specifies the connection secret config which contains a name, metadata and a reference to secret store config to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource.
WriteConnectionSecretToReference specifies the namespace and name of a Secret to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource. This field is planned to be replaced in a future release in favor of PublishConnectionDetailsTo. Currently, both could be set independently and connection details would be published to both without affecting each other.
CrawlerStatus defines the observed state of Crawler.
Conditions of the resource.
example
apiVersion: glue.aws.upbound.io/v1beta1
kind: Crawler
metadata:
annotations:
meta.upbound.io/example-id: glue/v1beta1/crawler
labels:
testing.upbound.io/example-name: example
name: example
spec:
forProvider:
databaseNameSelector:
matchLabels:
testing.upbound.io/example-name: example
dynamodbTarget:
- path: table-name
region: us-west-1
roleSelector:
matchLabels:
testing.upbound.io/example-name: glue-role
© 2022 Upbound, Inc.
Discover the building blocksfor your internal cloud platform.