Marketplace
BrowsePublish
Marketplace
You are viewing an outdated version of provider-aws.Go to Latest
upbound/provider-aws@v1.3.1
Crawler
glue.aws.upbound.io
Crawler
upbound/provider-aws@v1.3.1glue.aws.upbound.io

Crawler is the Schema for the Crawlers API. Manages a Glue Crawler

Type

CRD

Group

glue.aws.upbound.io

Version

v1beta1

apiVersion: glue.aws.upbound.io/v1beta1

kind: Crawler

API Documentation
apiVersion
string
kind
string
metadata
object
spec
object
object

CrawlerSpec defines the desired state of Crawler

forProvider
requiredobject
requiredobject

No description provided.

array

List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.

object

Reference to a CatalogDatabase in glue to populate databaseName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a CatalogDatabase in glue to populate databaseName.

policy
object
object

Policies for selection.

resolve
string
tables
array
array

A list of catalog tables to be synchronized.

array

List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

object

Reference to a CatalogDatabase in glue to populate databaseName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a CatalogDatabase in glue to populate databaseName.

policy
object
object

Policies for selection.

resolve
string
array

List of nested Delta Lake target arguments. See Delta Target below.

array

A list of the Amazon S3 paths to the Delta tables.

array

List of nested DynamoDB target arguments. See Dynamodb Target below.

path
string
scanAll
boolean
scanRate
number
array

List of nested Hudi target arguments. See Iceberg Target below.

array

A list of glob patterns used to exclude from the crawl.

paths
array
array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

array

List of nested Iceberg target arguments. See Iceberg Target below.

array

A list of glob patterns used to exclude from the crawl.

paths
array
array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

array

List of nested JDBC target arguments. See JDBC Target below.

object

Reference to a Connection in glue to populate connectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a Connection in glue to populate connectionName.

policy
object
object

Policies for selection.

resolve
string
array

Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.

array

A list of glob patterns used to exclude from the crawl.

path
string
array

Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.

array

Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.

array

List of nested MongoDB target arguments. See MongoDB Target below.

object

Reference to a Connection in glue to populate connectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a Connection in glue to populate connectionName.

policy
object
object

Policies for selection.

resolve
string
path
string
scanAll
boolean
array

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.

region
requiredstring
role
string
roleRef
object
object

Reference to a Role in iam to populate role.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a Role in iam to populate role.

policy
object
object

Policies for selection.

resolve
string
array

List of nested Amazon S3 target arguments. See S3 Target below.

array

A list of glob patterns used to exclude from the crawl.

path
string
schedule
string
array

Policy for the crawler's update and deletion behavior. See Schema Change Policy below.

tags
object
object

THIS IS A BETA FIELD. It will be honored unless the Management Policies feature flag is disabled. InitProvider holds the same fields as ForProvider, with the exception of Identifier and other resource reference fields. The fields that are in InitProvider are merged into ForProvider when the resource is created. The same fields are also added to the terraform ignore_changes hook, to avoid updating them after creation. This is useful for fields that are required on creation, but we do not desire to update them after creation, for example because of an external controller is managing them, like an autoscaler.

array

List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.

object

Reference to a CatalogDatabase in glue to populate databaseName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a CatalogDatabase in glue to populate databaseName.

policy
object
object

Policies for selection.

resolve
string
tables
array
array

A list of catalog tables to be synchronized.

array

List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

object

Reference to a CatalogDatabase in glue to populate databaseName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a CatalogDatabase in glue to populate databaseName.

policy
object
object

Policies for selection.

resolve
string
array

List of nested Delta Lake target arguments. See Delta Target below.

array

A list of the Amazon S3 paths to the Delta tables.

array

List of nested DynamoDB target arguments. See Dynamodb Target below.

path
string
scanAll
boolean
scanRate
number
array

List of nested Hudi target arguments. See Iceberg Target below.

array

A list of glob patterns used to exclude from the crawl.

paths
array
array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

array

List of nested Iceberg target arguments. See Iceberg Target below.

array

A list of glob patterns used to exclude from the crawl.

paths
array
array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

array

List of nested JDBC target arguments. See JDBC Target below.

object

Reference to a Connection in glue to populate connectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a Connection in glue to populate connectionName.

policy
object
object

Policies for selection.

resolve
string
array

Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.

array

A list of glob patterns used to exclude from the crawl.

path
string
array

Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.

array

Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.

array

List of nested MongoDB target arguments. See MongoDB Target below.

object

Reference to a Connection in glue to populate connectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a Connection in glue to populate connectionName.

policy
object
object

Policies for selection.

resolve
string
path
string
scanAll
boolean
array

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.

role
string
roleRef
object
object

Reference to a Role in iam to populate role.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

Selector for a Role in iam to populate role.

policy
object
object

Policies for selection.

resolve
string
array

List of nested Amazon S3 target arguments. See S3 Target below.

array

A list of glob patterns used to exclude from the crawl.

path
string
schedule
string
array

Policy for the crawler's update and deletion behavior. See Schema Change Policy below.

tags
object
array

THIS IS A BETA FIELD. It is on by default but can be opted out through a Crossplane feature flag. ManagementPolicies specify the array of actions Crossplane is allowed to take on the managed and external resources. This field is planned to replace the DeletionPolicy field in a future release. Currently, both could be set independently and non-default values would be honored if the feature flag is enabled. If both are custom, the DeletionPolicy field will be ignored. See the design doc for more information: https://github.com/crossplane/crossplane/blob/499895a25d1a1a0ba1604944ef98ac7a1a71f197/design/design-doc-observe-only-resources.md?plain=1#L223 and this one: https://github.com/crossplane/crossplane/blob/444267e84783136daa93568b364a5f01228cacbe/design/one-pager-ignore-changes.md

object

ProviderConfigReference specifies how the provider that will be used to create, observe, update, and delete this managed resource should be configured.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

PublishConnectionDetailsTo specifies the connection secret config which contains a name, metadata and a reference to secret store config to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource.

configRef
object
object

SecretStoreConfigRef specifies which secret store config should be used for this ConnectionSecret.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
metadata
object
object

Metadata is the metadata for connection secret.

labels
object
type
string
name
requiredstring
object

WriteConnectionSecretToReference specifies the namespace and name of a Secret to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource. This field is planned to be replaced in a future release in favor of PublishConnectionDetailsTo. Currently, both could be set independently and connection details would be published to both without affecting each other.

name
requiredstring
namespace
requiredstring
status
object
object

CrawlerStatus defines the observed state of Crawler.

object

No description provided.

arn
string
array

List of nested AWS Glue Data Catalog target arguments. See Catalog Target below.

tables
array
array

A list of catalog tables to be synchronized.

array

List of custom classifiers. By default, all AWS classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

array

List of nested Delta Lake target arguments. See Delta Target below.

array

A list of the Amazon S3 paths to the Delta tables.

array

List of nested DynamoDB target arguments. See Dynamodb Target below.

path
string
scanAll
boolean
scanRate
number
array

List of nested Hudi target arguments. See Iceberg Target below.

array

A list of glob patterns used to exclude from the crawl.

paths
array
array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

array

List of nested Iceberg target arguments. See Iceberg Target below.

array

A list of glob patterns used to exclude from the crawl.

paths
array
array

One or more Amazon S3 paths that contains Hudi metadata folders as s3://bucket/prefix.

id
string
array

List of nested JDBC target arguments. See JDBC Target below.

array

Specify a value of RAWTYPES or COMMENTS to enable additional metadata intable responses. RAWTYPES provides the native-level datatype. COMMENTS provides comments associated with a column or table in the database.

array

A list of glob patterns used to exclude from the crawl.

path
string
array

Specifies Lake Formation configuration settings for the crawler. See Lake Formation Configuration below.

array

Specifies data lineage configuration settings for the crawler. See Lineage Configuration below.

array

List of nested MongoDB target arguments. See MongoDB Target below.

path
string
scanAll
boolean
array

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.. See Recrawl Policy below.

role
string
array

List of nested Amazon S3 target arguments. See S3 Target below.

array

A list of glob patterns used to exclude from the crawl.

path
string
schedule
string
array

Policy for the crawler's update and deletion behavior. See Schema Change Policy below.

tags
object
tagsAll
object
array

Conditions of the resource.

lastTransitionTime
requiredstring
message
string
reason
requiredstring
status
requiredstring
type
requiredstring
Marketplace

Discover the building blocks for your internal cloud platform.

© 2022 Upbound, Inc.

SolutionsProvidersConfigurations
LearnDocumentationTry for Free
MorePrivacy PolicyTerms & Conditions
Marketplace

© 2022 Upbound, Inc.

Marketplace

Discover the building blocksfor your internal cloud platform.