Marketplace
BrowsePublish
Marketplace
You are viewing an outdated version of provider-aws.Go to Latest
crossplane-contrib/provider-aws@v0.38.0
Crawler
glue.aws.crossplane.io
Crawler
crossplane-contrib/provider-aws@v0.38.0glue.aws.crossplane.io

Crawler is the Schema for the Crawlers API

Type

CRD

Group

glue.aws.crossplane.io

Version

v1alpha1

apiVersion: glue.aws.crossplane.io/v1alpha1

kind: Crawler

API Documentation
apiVersion
string
kind
string
metadata
object
spec
object
object

CrawlerSpec defines the desired state of Crawler

forProvider
requiredobject
requiredobject

CrawlerParameters defines the desired state of Crawler

array

ClassifierRefs is a list of references to Classifiers used to set the Classifiers.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

ClassifiersSelector selects references to Classifiers used to set the Classifiers.

policy
object
object

Policies for selection.

resolve
string
array

A list of custom classifiers that the user has registered. By default, all built-in classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification.

object

CrawlerSecurityConfigurationRef is a reference to an SecurityConfiguration used to set the CrawlerSecurityConfiguration.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

CrawlerSecurityConfigurationSelector selects references to SecurityConfiguration used to set the CrawlerSecurityConfiguration.

policy
object
object

Policies for selection.

resolve
string
object

DatabaseNameRef is a reference to an Database used to set the DatabaseName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

DatabaseNamesSelector selects references to Database used to set the DatabaseName.

policy
object
object

Policies for selection.

resolve
string
object

Specifies Lake Formation configuration settings for the crawler.

object

Specifies data lineage configuration settings for the crawler.

object

A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run.

region
requiredstring
role
string
roleRef
object
object

RoleRef is a reference to an IAMRole used to set the Role.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

RoleSelector selects references to IAMRole used to set the Role.

policy
object
object

Policies for selection.

resolve
string
schedule
string
object

The policy for the crawler's update and deletion behavior.

tags
object
targets
requiredobject
requiredobject

A list of collection of targets to crawl. Targets is a required field

array

Specifies Glue Data Catalog targets.

object

DatabaseNameRef is a reference to an Database used to set the DatabaseName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

DatabaseNamesSelector selects references to Database used to set the DatabaseName.

policy
object
object

Policies for selection.

resolve
string
tables
requiredarray
requiredarray

A list of the tables to be synchronized. Tables is a required field

array

Specifies Amazon DynamoDB targets.

path
string
scanAll
boolean
scanRate
number
array

Specifies JDBC targets.

object

ConnectionNameRef is a reference to an Connection used to set the ConnectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

ConnectionNamesSelector selects references to Connection used to set the ConnectionName.

policy
object
object

Policies for selection.

resolve
string
array

A list of glob patterns used to exclude from the crawl. For more information, see Catalog Tables with a Crawler (https://docs.aws.amazon.com/glue/latest/dg/add-crawler.html).

path
string
array

Specifies Amazon DocumentDB or MongoDB targets.

object

ConnectionNameRef is a reference to an Connection used to set the ConnectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

ConnectionNamesSelector selects references to Connection used to set the ConnectionName.

policy
object
object

Policies for selection.

resolve
string
path
string
scanAll
boolean
array

Specifies Amazon Simple Storage Service (Amazon S3) targets.

object

ConnectionNameRef is a reference to an Connection used to set the ConnectionName.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

ConnectionNamesSelector selects references to Connection used to set the ConnectionName.

policy
object
object

Policies for selection.

resolve
string
object

DlqEventQueueARNRef is a reference to an SQSEventQueue used to set the DlqEventQueueARN.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

DlqEventQueueARNSelector selects references to SQSEventQueue used to set the DlqEventQueueARN.

policy
object
object

Policies for selection.

resolve
string
object

EventQueueARNRef is a reference to an SQSEventQueue used to set the EventQueueARN.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

EventQueueARNSelector selects references to SQSEventQueue used to set the EventQueueARN.

policy
object
object

Policies for selection.

resolve
string
array

A list of glob patterns used to exclude from the crawl. For more information, see Catalog Tables with a Crawler (https://docs.aws.amazon.com/glue/latest/dg/add-crawler.html).

path
string
sampleSize
integer
object

ProviderConfigReference specifies how the provider that will be used to create, observe, update, and delete this managed resource should be configured.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

ProviderReference specifies the provider that will be used to create, observe, update, and delete this managed resource. Deprecated: Please use ProviderConfigReference, i.e. providerConfigRef

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
object

PublishConnectionDetailsTo specifies the connection secret config which contains a name, metadata and a reference to secret store config to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource.

configRef
object
object

SecretStoreConfigRef specifies which secret store config should be used for this ConnectionSecret.

name
requiredstring
policy
object
object

Policies for referencing.

resolve
string
metadata
object
object

Metadata is the metadata for connection secret.

labels
object
type
string
name
requiredstring
object

WriteConnectionSecretToReference specifies the namespace and name of a Secret to which any connection details for this managed resource should be written. Connection details frequently include the endpoint, username, and password required to connect to the managed resource. This field is planned to be replaced in a future release in favor of PublishConnectionDetailsTo. Currently, both could be set independently and connection details would be published to both without affecting each other.

name
requiredstring
namespace
requiredstring
status
object
object

CrawlerStatus defines the observed state of Crawler.

object

CrawlerObservation defines the observed state of Crawler

lastCrawl
object
object

The status of the last crawl, and potentially error information if an error occurred.

state
string
version
integer
array

Conditions of the resource.

lastTransitionTime
requiredstring
message
string
reason
requiredstring
status
requiredstring
type
requiredstring
Marketplace

Discover the building blocks for your internal cloud platform.

© 2022 Upbound, Inc.

SolutionsProvidersConfigurations
LearnDocumentationTry for Free
MorePrivacy PolicyTerms & Conditions
Marketplace

© 2022 Upbound, Inc.

Marketplace

Discover the building blocksfor your internal cloud platform.