FinSpace Public API

2021/11/22 - FinSpace Public API - 11 new 1 updated api methods

Changes  Add new APIs for managing Datasets, Changesets, and Dataviews.

CreateDataset (new) Link ¶

Creates a new FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.create_dataset(
    clientToken='string',
    datasetTitle='string',
    kind='TABULAR'|'NON_TABULAR',
    datasetDescription='string',
    ownerInfo={
        'name': 'string',
        'phoneNumber': 'string',
        'email': 'string'
    },
    permissionGroupParams={
        'permissionGroupId': 'string',
        'datasetPermissions': [
            {
                'permission': 'string'
            },
        ]
    },
    alias='string',
    schemaDefinition={
        'tabularSchemaConfig': {
            'columns': [
                {
                    'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                    'columnName': 'string',
                    'columnDescription': 'string'
                },
            ],
            'primaryKeyColumns': [
                'string',
            ]
        }
    }
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetTitle

string

param datasetTitle

[REQUIRED]

Display title for a FinSpace Dataset.

type kind

string

param kind

[REQUIRED]

The format in which Dataset data is structured.

  • TABULAR - Data is structured in a tabular format.

  • NON_TABULAR - Data is structured in a non-tabular format.

type datasetDescription

string

param datasetDescription

[REQUIRED]

Description of a Dataset.

type ownerInfo

dict

param ownerInfo

Contact information for a Dataset owner.

  • name (string) --

    Name of the Dataset owner.

  • phoneNumber (string) --

    Phone number for the Dataset owner.

  • email (string) --

    Email address for the Dataset owner.

type permissionGroupParams

dict

param permissionGroupParams

[REQUIRED]

Permission group parameters for Dataset permissions.

  • permissionGroupId (string) --

    The unique identifier of the PermissionGroup.

  • datasetPermissions (list) --

    List of resource permissions.

    • (dict) --

      Resource permission for a Dataset.

      • permission (string) --

        Permission for a resource.

type alias

string

param alias

[REQUIRED]

The unique resource identifier for a Dataset.

type schemaDefinition

dict

param schemaDefinition

Definition for a schema on a tabular Dataset.

  • tabularSchemaConfig (dict) --

    The configuration for a schema on a tabular Dataset.

    • columns (list) --

      List of column definitions.

      • (dict) --

        The definition of a column in a tabular Dataset.

        • dataType (string) --

          Data type of a column.

          • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

        • columnName (string) --

          Name for a column.

        • columnDescription (string) --

          Description for a column.

    • primaryKeyColumns (list) --

      List of column names used for primary key.

      • (string) -- Column Name

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from a CreateDataset operation

    • datasetId (string) --

      The unique identifier for the created Dataset.

GetDataset (new) Link ¶

Returns information about a Dataset.

See also: AWS API Documentation

Request Syntax

client.get_dataset(
    datasetId='string'
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for a Dataset.

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string',
    'datasetArn': 'string',
    'datasetTitle': 'string',
    'kind': 'TABULAR'|'NON_TABULAR',
    'datasetDescription': 'string',
    'createTime': 123,
    'lastModifiedTime': 123,
    'schemaDefinition': {
        'tabularSchemaConfig': {
            'columns': [
                {
                    'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                    'columnName': 'string',
                    'columnDescription': 'string'
                },
            ],
            'primaryKeyColumns': [
                'string',
            ]
        }
    },
    'alias': 'string',
    'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'
}

Response Structure

  • (dict) -- Response for the GetDataset operation

    • datasetId (string) --

      The unique identifier for a Dataset.

    • datasetArn (string) --

      The ARN identifier of the Dataset.

    • datasetTitle (string) --

      Display title for a Dataset.

    • kind (string) --

      The format in which Dataset data is structured.

      • TABULAR - Data is structured in a tabular format.

      • NON_TABULAR - Data is structured in a non-tabular format.

    • datasetDescription (string) --

      A description of the Dataset.

    • createTime (integer) --

      The timestamp at which the Dataset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • lastModifiedTime (integer) --

      The last time that the Dataset was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • schemaDefinition (dict) --

      Definition for a schema on a tabular Dataset.

      • tabularSchemaConfig (dict) --

        The configuration for a schema on a tabular Dataset.

        • columns (list) --

          List of column definitions.

          • (dict) --

            The definition of a column in a tabular Dataset.

            • dataType (string) --

              Data type of a column.

              • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

            • columnName (string) --

              Name for a column.

            • columnDescription (string) --

              Description for a column.

        • primaryKeyColumns (list) --

          List of column names used for primary key.

          • (string) -- Column Name

    • alias (string) --

      The unique resource identifier for a Dataset.

    • status (string) --

      Status of the Dataset creation.

      • PENDING - Dataset is pending creation.

      • FAILED - Dataset creation has failed.

      • SUCCESS - Dataset creation has succeeded.

      • RUNNING - Dataset creation is running.

ListChangesets (new) Link ¶

Lists the FinSpace Changesets for a Dataset.

See also: AWS API Documentation

Request Syntax

client.list_changesets(
    datasetId='string',
    maxResults=123,
    nextToken='string'
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the FinSpace Dataset to which the Changeset belongs.

type maxResults

integer

param maxResults

The maximum number of results per page.

type nextToken

string

param nextToken

A token indicating where a results page should begin.

rtype

dict

returns

Response Syntax

{
    'changesets': [
        {
            'changesetId': 'string',
            'changesetArn': 'string',
            'datasetId': 'string',
            'changeType': 'REPLACE'|'APPEND'|'MODIFY',
            'sourceParams': {
                'string': 'string'
            },
            'formatParams': {
                'string': 'string'
            },
            'createTime': 123,
            'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'|'STOP_REQUESTED',
            'errorInfo': {
                'errorMessage': 'string',
                'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
            },
            'activeUntilTimestamp': 123,
            'updatesChangesetId': 'string',
            'updatedByChangesetId': 'string'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) -- Response to ListChangesetsResponse. This returns a list of dataset changesets that match the query criteria.

    • changesets (list) --

      List of Changesets found.

      • (dict) --

        A Changeset is unit of data in a Dataset.

        • changesetId (string) --

          The unique identifier for a Changeset.

        • changesetArn (string) --

          The ARN identifier of the Changeset.

        • datasetId (string) --

          The unique identifier for the FinSpace Dataset in which the Changeset is created.

        • changeType (string) --

          Type that indicates how a Changeset is applied to a Dataset.

          • REPLACE - Changeset is considered as a replacement to all prior loaded Changesets.

          • APPEND - Changeset is considered as an addition to the end of all prior loaded Changesets.

          • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

        • sourceParams (dict) --

          Options that define the location of the data being ingested.

          • (string) --

            • (string) --

        • formatParams (dict) --

          Options that define the structure of the source file(s).

          • (string) --

            • (string) --

        • createTime (integer) --

          The timestamp at which the Changeset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • status (string) --

          Status of the Changeset ingestion.

          • PENDING - Changeset is pending creation.

          • FAILED - Changeset creation has failed.

          • SUCCESS - Changeset creation has succeeded.

          • RUNNING - Changeset creation is running.

          • STOP_REQUESTED - User requested Changeset creation to stop.

        • errorInfo (dict) --

          The structure with error messages.

          • errorMessage (string) --

            The text of the error message.

          • errorCategory (string) --

            The category of the error.

            • VALIDATION -The inputs to this request are invalid.

            • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

            • ACCESS_DENIED - Missing required permission to perform this request.

            • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

            • THROTTLING - The system temporarily lacks sufficient resources to process the request.

            • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

            • CANCELLED - Cancelled.

            • USER_RECOVERABLE - A user recoverable error has occurred.

        • activeUntilTimestamp (integer) --

          Time until which the Changeset is active. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • updatesChangesetId (string) --

          The unique identifier of the Changeset that is updated.

        • updatedByChangesetId (string) --

          The unique identifier of the updated Changeset.

    • nextToken (string) --

      A token indicating where a results page should begin.

GetChangeset (new) Link ¶

Get information about a Changeset.

See also: AWS API Documentation

Request Syntax

client.get_changeset(
    datasetId='string',
    changesetId='string'
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the FinSpace Dataset where the Changeset is created.

type changesetId

string

param changesetId

[REQUIRED]

The unique identifier of the Changeset for which to get data.

rtype

dict

returns

Response Syntax

{
    'changesetId': 'string',
    'changesetArn': 'string',
    'datasetId': 'string',
    'changeType': 'REPLACE'|'APPEND'|'MODIFY',
    'sourceParams': {
        'string': 'string'
    },
    'formatParams': {
        'string': 'string'
    },
    'createTime': 123,
    'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'|'STOP_REQUESTED',
    'errorInfo': {
        'errorMessage': 'string',
        'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
    },
    'activeUntilTimestamp': 123,
    'updatesChangesetId': 'string',
    'updatedByChangesetId': 'string'
}

Response Structure

  • (dict) -- The response from a describe changeset operation

    • changesetId (string) --

      The unique identifier for a Changeset.

    • changesetArn (string) --

      The ARN identifier of the Changeset.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset where the Changeset is created.

    • changeType (string) --

      Type that indicates how a Changeset is applied to a Dataset.

      • REPLACE - Changeset is considered as a replacement to all prior loaded Changesets.

      • APPEND - Changeset is considered as an addition to the end of all prior loaded Changesets.

      • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

    • sourceParams (dict) --

      Options that define the location of the data being ingested.

      • (string) --

        • (string) --

    • formatParams (dict) --

      Structure of the source file(s).

      • (string) --

        • (string) --

    • createTime (integer) --

      The timestamp at which the Changeset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • status (string) --

      The status of Changeset creation operation.

    • errorInfo (dict) --

      The structure with error messages.

      • errorMessage (string) --

        The text of the error message.

      • errorCategory (string) --

        The category of the error.

        • VALIDATION -The inputs to this request are invalid.

        • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

        • ACCESS_DENIED - Missing required permission to perform this request.

        • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

        • THROTTLING - The system temporarily lacks sufficient resources to process the request.

        • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

        • CANCELLED - Cancelled.

        • USER_RECOVERABLE - A user recoverable error has occurred.

    • activeUntilTimestamp (integer) --

      Time until which the Changeset is active. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • updatesChangesetId (string) --

      The unique identifier of the Changeset that is being updated.

    • updatedByChangesetId (string) --

      The unique identifier of the updated Changeset.

UpdateChangeset (new) Link ¶

Updates a FinSpace Changeset.

See also: AWS API Documentation

Request Syntax

client.update_changeset(
    clientToken='string',
    datasetId='string',
    changesetId='string',
    sourceParams={
        'string': 'string'
    },
    formatParams={
        'string': 'string'
    }
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the FinSpace Dataset in which the Changeset is created.

type changesetId

string

param changesetId

[REQUIRED]

The unique identifier for the Changeset to update.

type sourceParams

dict

param sourceParams

[REQUIRED]

Options that define the location of the data being ingested.

  • (string) --

    • (string) --

type formatParams

dict

param formatParams

[REQUIRED]

Options that define the structure of the source file(s).

  • (string) --

    • (string) --

rtype

dict

returns

Response Syntax

{
    'changesetId': 'string',
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from a update changeset operation.

    • changesetId (string) --

      The unique identifier for the Changeset to update.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset in which the Changeset is created.

DeleteDataset (new) Link ¶

Deletes a FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.delete_dataset(
    clientToken='string',
    datasetId='string'
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId

string

param datasetId

[REQUIRED]

The unique identifier of the Dataset to be deleted.

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from an DeleteDataset operation

    • datasetId (string) --

      The unique identifier for the deleted Dataset.

ListDataViews (new) Link ¶

Lists all available Dataviews for a Dataset.

See also: AWS API Documentation

Request Syntax

client.list_data_views(
    datasetId='string',
    nextToken='string',
    maxResults=123
)
type datasetId

string

param datasetId

[REQUIRED]

The unique identifier of the Dataset for which to retrieve Dataviews.

type nextToken

string

param nextToken

A token indicating where a results page should begin.

type maxResults

integer

param maxResults

The maximum number of results per page.

rtype

dict

returns

Response Syntax

{
    'nextToken': 'string',
    'dataViews': [
        {
            'dataViewId': 'string',
            'dataViewArn': 'string',
            'datasetId': 'string',
            'asOfTimestamp': 123,
            'partitionColumns': [
                'string',
            ],
            'sortColumns': [
                'string',
            ],
            'status': 'RUNNING'|'STARTING'|'FAILED'|'CANCELLED'|'TIMEOUT'|'SUCCESS'|'PENDING'|'FAILED_CLEANUP_FAILED',
            'errorInfo': {
                'errorMessage': 'string',
                'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
            },
            'destinationTypeProperties': {
                'destinationType': 'string'
            },
            'autoUpdate': True|False,
            'createTime': 123,
            'lastModifiedTime': 123
        },
    ]
}

Response Structure

  • (dict) --

    • nextToken (string) --

      A token indicating where a results page should begin.

    • dataViews (list) --

      A list of Dataviews.

      • (dict) --

        Structure for the summary of a Dataview.

        • dataViewId (string) --

          The unique identifier for the Dataview.

        • dataViewArn (string) --

          The ARN identifier of the Dataview.

        • datasetId (string) --

          Th unique identifier for the Dataview Dataset.

        • asOfTimestamp (integer) --

          Time range to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • partitionColumns (list) --

          Ordered set of column names used to partition data.

          • (string) -- 1 - 255 character String

        • sortColumns (list) --

          Columns to be used for sorting the data.

          • (string) -- 1 - 255 character String

        • status (string) --

          The status of a Dataview creation.

          • RUNNING - Dataview creation is running.

          • STARTING - Dataview creation is starting.

          • FAILED - Dataview creation has failed.

          • CANCELLED - Dataview creation has been cancelled.

          • TIMEOUT - Dataview creation has timed out.

          • SUCCESS - Dataview creation has succeeded.

          • PENDING - Dataview creation is pending.

          • FAILED_CLEANUP_FAILED - Dataview creation failed and resource cleanup failed.

        • errorInfo (dict) --

          The structure with error messages.

          • errorMessage (string) --

            The text of the error message.

          • errorCategory (string) --

            The category of the error.

            • VALIDATION -The inputs to this request are invalid.

            • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

            • ACCESS_DENIED - Missing required permission to perform this request.

            • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

            • THROTTLING - The system temporarily lacks sufficient resources to process the request.

            • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

            • CANCELLED - Cancelled.

            • USER_RECOVERABLE - A user recoverable error has occurred.

        • destinationTypeProperties (dict) --

          Information about the Dataview destination.

          • destinationType (string) --

            Destination type for a Dataview.

            • GLUE_TABLE - Glue table destination type.

        • autoUpdate (boolean) --

          The flag to indicate Dataview should be updated automatically.

        • createTime (integer) --

          The timestamp at which the Dataview was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • lastModifiedTime (integer) --

          The last time that a Dataview was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

ListDatasets (new) Link ¶

Lists all of the active Datasets that a user has access to.

See also: AWS API Documentation

Request Syntax

client.list_datasets(
    nextToken='string',
    maxResults=123
)
type nextToken

string

param nextToken

A token indicating where a results page should begin.

type maxResults

integer

param maxResults

The maximum number of results per page.

rtype

dict

returns

Response Syntax

{
    'datasets': [
        {
            'datasetId': 'string',
            'datasetArn': 'string',
            'datasetTitle': 'string',
            'kind': 'TABULAR'|'NON_TABULAR',
            'datasetDescription': 'string',
            'ownerInfo': {
                'name': 'string',
                'phoneNumber': 'string',
                'email': 'string'
            },
            'createTime': 123,
            'lastModifiedTime': 123,
            'schemaDefinition': {
                'tabularSchemaConfig': {
                    'columns': [
                        {
                            'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                            'columnName': 'string',
                            'columnDescription': 'string'
                        },
                    ],
                    'primaryKeyColumns': [
                        'string',
                    ]
                }
            },
            'alias': 'string'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) -- Response for the ListDatasets operation

    • datasets (list) --

      List of Datasets.

      • (dict) --

        The structure for a Dataset.

        • datasetId (string) --

          An identifier for a Dataset.

        • datasetArn (string) --

          The ARN identifier of the Dataset.

        • datasetTitle (string) --

          Display title for a Dataset.

        • kind (string) --

          The format in which Dataset data is structured.

          • TABULAR - Data is structured in a tabular format.

          • NON_TABULAR - Data is structured in a non-tabular format.

        • datasetDescription (string) --

          Description for a Dataset.

        • ownerInfo (dict) --

          Contact information for a Dataset owner.

          • name (string) --

            Name of the Dataset owner.

          • phoneNumber (string) --

            Phone number for the Dataset owner.

          • email (string) --

            Email address for the Dataset owner.

        • createTime (integer) --

          The timestamp at which the Dataset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • lastModifiedTime (integer) --

          The last time that the Dataset was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • schemaDefinition (dict) --

          Definition for a schema on a tabular Dataset.

          • tabularSchemaConfig (dict) --

            The configuration for a schema on a tabular Dataset.

            • columns (list) --

              List of column definitions.

              • (dict) --

                The definition of a column in a tabular Dataset.

                • dataType (string) --

                  Data type of a column.

                  • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

                • columnName (string) --

                  Name for a column.

                • columnDescription (string) --

                  Description for a column.

            • primaryKeyColumns (list) --

              List of column names used for primary key.

              • (string) -- Column Name

        • alias (string) --

          The unique resource identifier for a Dataset.

    • nextToken (string) --

      A token indicating where a results page should begin.

CreateDataView (new) Link ¶

Creates a Dataview for a Dataset.

See also: AWS API Documentation

Request Syntax

client.create_data_view(
    clientToken='string',
    datasetId='string',
    autoUpdate=True|False,
    sortColumns=[
        'string',
    ],
    partitionColumns=[
        'string',
    ],
    asOfTimestamp=123,
    destinationTypeParams={
        'destinationType': 'string'
    }
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId

string

param datasetId

[REQUIRED]

The unique Dataset identifier that is used to create a Dataview.

type autoUpdate

boolean

param autoUpdate

Flag to indicate Dataview should be updated automatically.

type sortColumns

list

param sortColumns

Columns to be used for sorting the data.

  • (string) -- 1 - 255 character String

type partitionColumns

list

param partitionColumns

Ordered set of column names used to partition data.

  • (string) -- 1 - 255 character String

type asOfTimestamp

integer

param asOfTimestamp

Beginning time to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

type destinationTypeParams

dict

param destinationTypeParams

[REQUIRED]

Options that define the destination type for the Dataview.

  • destinationType (string) -- [REQUIRED]

    Destination type for a Dataview.

    • GLUE_TABLE - Glue table destination type.

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string',
    'dataViewId': 'string'
}

Response Structure

  • (dict) -- Response for creating a data view.

    • datasetId (string) --

      The unique identifier of the Dataset used for the Dataview.

    • dataViewId (string) --

      The unique identifier for the created Dataview.

UpdateDataset (new) Link ¶

Updates a FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.update_dataset(
    clientToken='string',
    datasetId='string',
    datasetTitle='string',
    kind='TABULAR'|'NON_TABULAR',
    datasetDescription='string',
    alias='string',
    schemaDefinition={
        'tabularSchemaConfig': {
            'columns': [
                {
                    'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                    'columnName': 'string',
                    'columnDescription': 'string'
                },
            ],
            'primaryKeyColumns': [
                'string',
            ]
        }
    }
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the Dataset to update.

type datasetTitle

string

param datasetTitle

[REQUIRED]

A display title for the Dataset.

type kind

string

param kind

[REQUIRED]

The format in which the Dataset data is structured.

  • TABULAR - Data is structured in a tabular format.

  • NON_TABULAR - Data is structured in a non-tabular format.

type datasetDescription

string

param datasetDescription

A description for the Dataset.

type alias

string

param alias

[REQUIRED]

The unique resource identifier for a Dataset.

type schemaDefinition

dict

param schemaDefinition

Definition for a schema on a tabular Dataset.

  • tabularSchemaConfig (dict) --

    The configuration for a schema on a tabular Dataset.

    • columns (list) --

      List of column definitions.

      • (dict) --

        The definition of a column in a tabular Dataset.

        • dataType (string) --

          Data type of a column.

          • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

        • columnName (string) --

          Name for a column.

        • columnDescription (string) --

          Description for a column.

    • primaryKeyColumns (list) --

      List of column names used for primary key.

      • (string) -- Column Name

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from an UpdateDataset operation

    • datasetId (string) --

      The unique identifier for updated Dataset.

GetDataView (new) Link ¶

Gets information about a Dataview.

See also: AWS API Documentation

Request Syntax

client.get_data_view(
    dataViewId='string',
    datasetId='string'
)
type dataViewId

string

param dataViewId

[REQUIRED]

The unique identifier for the Dataview.

type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the Dataset used in the Dataview.

rtype

dict

returns

Response Syntax

{
    'autoUpdate': True|False,
    'partitionColumns': [
        'string',
    ],
    'datasetId': 'string',
    'asOfTimestamp': 123,
    'errorInfo': {
        'errorMessage': 'string',
        'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
    },
    'lastModifiedTime': 123,
    'createTime': 123,
    'sortColumns': [
        'string',
    ],
    'dataViewId': 'string',
    'dataViewArn': 'string',
    'destinationTypeParams': {
        'destinationType': 'string'
    },
    'status': 'RUNNING'|'STARTING'|'FAILED'|'CANCELLED'|'TIMEOUT'|'SUCCESS'|'PENDING'|'FAILED_CLEANUP_FAILED'
}

Response Structure

  • (dict) -- Response from retrieving a dataview, which includes details on the target database and table name

    • autoUpdate (boolean) --

      Flag to indicate Dataview should be updated automatically.

    • partitionColumns (list) --

      Ordered set of column names used to partition data.

      • (string) -- 1 - 255 character String

    • datasetId (string) --

      The unique identifier for the Dataset used in the Dataview.

    • asOfTimestamp (integer) --

      Time range to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • errorInfo (dict) --

      Information about an error that occurred for the Dataview.

      • errorMessage (string) --

        The text of the error message.

      • errorCategory (string) --

        The category of the error.

        • VALIDATION -The inputs to this request are invalid.

        • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

        • ACCESS_DENIED - Missing required permission to perform this request.

        • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

        • THROTTLING - The system temporarily lacks sufficient resources to process the request.

        • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

        • CANCELLED - Cancelled.

        • USER_RECOVERABLE - A user recoverable error has occurred.

    • lastModifiedTime (integer) --

      The last time that a Dataview was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • createTime (integer) --

      The timestamp at which the Dataview was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • sortColumns (list) --

      Columns to be used for sorting the data.

      • (string) -- 1 - 255 character String

    • dataViewId (string) --

      The unique identifier for the Dataview.

    • dataViewArn (string) --

      The ARN identifier of the Dataview.

    • destinationTypeParams (dict) --

      Options that define the destination type for the Dataview.

      • destinationType (string) --

        Destination type for a Dataview.

        • GLUE_TABLE - Glue table destination type.

    • status (string) --

      The status of a Dataview creation.

      • RUNNING - Dataview creation is running.

      • STARTING - Dataview creation is starting.

      • FAILED - Dataview creation has failed.

      • CANCELLED - Dataview creation has been cancelled.

      • TIMEOUT - Dataview creation has timed out.

      • SUCCESS - Dataview creation has succeeded.

      • PENDING - Dataview creation is pending.

      • FAILED_CLEANUP_FAILED - Dataview creation failed and resource cleanup failed.

CreateChangeset (updated) Link ¶
Changes (request, response)
Request
{'clientToken': 'string'}
Response
{'changesetId': 'string', 'datasetId': 'string'}

Creates a new Changeset in a FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.create_changeset(
    clientToken='string',
    datasetId='string',
    changeType='REPLACE'|'APPEND'|'MODIFY',
    sourceParams={
        'string': 'string'
    },
    formatParams={
        'string': 'string'
    }
)
type clientToken

string

param clientToken

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId

string

param datasetId

[REQUIRED]

The unique identifier for the FinSpace Dataset where the Changeset will be created.

type changeType

string

param changeType

[REQUIRED]

Option to indicate how a Changeset will be applied to a Dataset.

  • REPLACE - Changeset will be considered as a replacement to all prior loaded Changesets.

  • APPEND - Changeset will be considered as an addition to the end of all prior loaded Changesets.

  • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

type sourceParams

dict

param sourceParams

[REQUIRED]

Options that define the location of the data being ingested.

  • (string) --

    • (string) --

type formatParams

dict

param formatParams

[REQUIRED]

Options that define the structure of the source file(s) including the format type ( formatType ), header row ( withHeader ), data separation character ( separator ) and the type of compression ( compression ).

formatType is a required attribute and can have the following values:

  • PARQUET - Parquet source file format.

  • CSV - CSV source file format.

  • JSON - JSON source file format.

  • XML - XML source file format.

For example, you could specify the following for formatParams :

  • (string) --

    • (string) --

rtype

dict

returns

Response Syntax

{
    'datasetId': 'string',
    'changesetId': 'string'
}

Response Structure

  • (dict) -- The response from a CreateChangeset operation.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset where the Changeset is created.

    • changesetId (string) --

      The unique identifier of the Changeset that is created.