FinSpace Public API

2021/11/22 - FinSpace Public API - 11 new1 updated api methods

Changes  Add new APIs for managing Datasets, Changesets, and Dataviews.

UpdateDataset (new) Link ¶

Updates a FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.update_dataset(
    clientToken='string',
    datasetId='string',
    datasetTitle='string',
    kind='TABULAR'|'NON_TABULAR',
    datasetDescription='string',
    alias='string',
    schemaDefinition={
        'tabularSchemaConfig': {
            'columns': [
                {
                    'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                    'columnName': 'string',
                    'columnDescription': 'string'
                },
            ],
            'primaryKeyColumns': [
                'string',
            ]
        }
    }
)
type clientToken:

string

param clientToken:

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for the Dataset to update.

type datasetTitle:

string

param datasetTitle:

[REQUIRED]

A display title for the Dataset.

type kind:

string

param kind:

[REQUIRED]

The format in which the Dataset data is structured.

  • TABULAR - Data is structured in a tabular format.

  • NON_TABULAR - Data is structured in a non-tabular format.

type datasetDescription:

string

param datasetDescription:

A description for the Dataset.

type alias:

string

param alias:

[REQUIRED]

The unique resource identifier for a Dataset.

type schemaDefinition:

dict

param schemaDefinition:

Definition for a schema on a tabular Dataset.

  • tabularSchemaConfig (dict) --

    The configuration for a schema on a tabular Dataset.

    • columns (list) --

      List of column definitions.

      • (dict) --

        The definition of a column in a tabular Dataset.

        • dataType (string) --

          Data type of a column.

          • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

        • columnName (string) --

          Name for a column.

        • columnDescription (string) --

          Description for a column.

    • primaryKeyColumns (list) --

      List of column names used for primary key.

      • (string) -- Column Name

rtype:

dict

returns:

Response Syntax

{
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from an UpdateDataset operation

    • datasetId (string) --

      The unique identifier for updated Dataset.

GetChangeset (new) Link ¶

Get information about a Changeset.

See also: AWS API Documentation

Request Syntax

client.get_changeset(
    datasetId='string',
    changesetId='string'
)
type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for the FinSpace Dataset where the Changeset is created.

type changesetId:

string

param changesetId:

[REQUIRED]

The unique identifier of the Changeset for which to get data.

rtype:

dict

returns:

Response Syntax

{
    'changesetId': 'string',
    'changesetArn': 'string',
    'datasetId': 'string',
    'changeType': 'REPLACE'|'APPEND'|'MODIFY',
    'sourceParams': {
        'string': 'string'
    },
    'formatParams': {
        'string': 'string'
    },
    'createTime': 123,
    'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'|'STOP_REQUESTED',
    'errorInfo': {
        'errorMessage': 'string',
        'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
    },
    'activeUntilTimestamp': 123,
    'updatesChangesetId': 'string',
    'updatedByChangesetId': 'string'
}

Response Structure

  • (dict) -- The response from a describe changeset operation

    • changesetId (string) --

      The unique identifier for a Changeset.

    • changesetArn (string) --

      The ARN identifier of the Changeset.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset where the Changeset is created.

    • changeType (string) --

      Type that indicates how a Changeset is applied to a Dataset.

      • REPLACE - Changeset is considered as a replacement to all prior loaded Changesets.

      • APPEND - Changeset is considered as an addition to the end of all prior loaded Changesets.

      • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

    • sourceParams (dict) --

      Options that define the location of the data being ingested.

      • (string) --

        • (string) --

    • formatParams (dict) --

      Structure of the source file(s).

      • (string) --

        • (string) --

    • createTime (integer) --

      The timestamp at which the Changeset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • status (string) --

      The status of Changeset creation operation.

    • errorInfo (dict) --

      The structure with error messages.

      • errorMessage (string) --

        The text of the error message.

      • errorCategory (string) --

        The category of the error.

        • VALIDATION -The inputs to this request are invalid.

        • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

        • ACCESS_DENIED - Missing required permission to perform this request.

        • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

        • THROTTLING - The system temporarily lacks sufficient resources to process the request.

        • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

        • CANCELLED - Cancelled.

        • USER_RECOVERABLE - A user recoverable error has occurred.

    • activeUntilTimestamp (integer) --

      Time until which the Changeset is active. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • updatesChangesetId (string) --

      The unique identifier of the Changeset that is being updated.

    • updatedByChangesetId (string) --

      The unique identifier of the updated Changeset.

ListChangesets (new) Link ¶

Lists the FinSpace Changesets for a Dataset.

See also: AWS API Documentation

Request Syntax

client.list_changesets(
    datasetId='string',
    maxResults=123,
    nextToken='string'
)
type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for the FinSpace Dataset to which the Changeset belongs.

type maxResults:

integer

param maxResults:

The maximum number of results per page.

type nextToken:

string

param nextToken:

A token indicating where a results page should begin.

rtype:

dict

returns:

Response Syntax

{
    'changesets': [
        {
            'changesetId': 'string',
            'changesetArn': 'string',
            'datasetId': 'string',
            'changeType': 'REPLACE'|'APPEND'|'MODIFY',
            'sourceParams': {
                'string': 'string'
            },
            'formatParams': {
                'string': 'string'
            },
            'createTime': 123,
            'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'|'STOP_REQUESTED',
            'errorInfo': {
                'errorMessage': 'string',
                'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
            },
            'activeUntilTimestamp': 123,
            'updatesChangesetId': 'string',
            'updatedByChangesetId': 'string'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) -- Response to ListChangesetsResponse. This returns a list of dataset changesets that match the query criteria.

    • changesets (list) --

      List of Changesets found.

      • (dict) --

        A Changeset is unit of data in a Dataset.

        • changesetId (string) --

          The unique identifier for a Changeset.

        • changesetArn (string) --

          The ARN identifier of the Changeset.

        • datasetId (string) --

          The unique identifier for the FinSpace Dataset in which the Changeset is created.

        • changeType (string) --

          Type that indicates how a Changeset is applied to a Dataset.

          • REPLACE - Changeset is considered as a replacement to all prior loaded Changesets.

          • APPEND - Changeset is considered as an addition to the end of all prior loaded Changesets.

          • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

        • sourceParams (dict) --

          Options that define the location of the data being ingested.

          • (string) --

            • (string) --

        • formatParams (dict) --

          Options that define the structure of the source file(s).

          • (string) --

            • (string) --

        • createTime (integer) --

          The timestamp at which the Changeset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • status (string) --

          Status of the Changeset ingestion.

          • PENDING - Changeset is pending creation.

          • FAILED - Changeset creation has failed.

          • SUCCESS - Changeset creation has succeeded.

          • RUNNING - Changeset creation is running.

          • STOP_REQUESTED - User requested Changeset creation to stop.

        • errorInfo (dict) --

          The structure with error messages.

          • errorMessage (string) --

            The text of the error message.

          • errorCategory (string) --

            The category of the error.

            • VALIDATION -The inputs to this request are invalid.

            • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

            • ACCESS_DENIED - Missing required permission to perform this request.

            • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

            • THROTTLING - The system temporarily lacks sufficient resources to process the request.

            • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

            • CANCELLED - Cancelled.

            • USER_RECOVERABLE - A user recoverable error has occurred.

        • activeUntilTimestamp (integer) --

          Time until which the Changeset is active. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • updatesChangesetId (string) --

          The unique identifier of the Changeset that is updated.

        • updatedByChangesetId (string) --

          The unique identifier of the updated Changeset.

    • nextToken (string) --

      A token indicating where a results page should begin.

DeleteDataset (new) Link ¶

Deletes a FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.delete_dataset(
    clientToken='string',
    datasetId='string'
)
type clientToken:

string

param clientToken:

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier of the Dataset to be deleted.

rtype:

dict

returns:

Response Syntax

{
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from an DeleteDataset operation

    • datasetId (string) --

      The unique identifier for the deleted Dataset.

GetDataset (new) Link ¶

Returns information about a Dataset.

See also: AWS API Documentation

Request Syntax

client.get_dataset(
    datasetId='string'
)
type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for a Dataset.

rtype:

dict

returns:

Response Syntax

{
    'datasetId': 'string',
    'datasetArn': 'string',
    'datasetTitle': 'string',
    'kind': 'TABULAR'|'NON_TABULAR',
    'datasetDescription': 'string',
    'createTime': 123,
    'lastModifiedTime': 123,
    'schemaDefinition': {
        'tabularSchemaConfig': {
            'columns': [
                {
                    'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                    'columnName': 'string',
                    'columnDescription': 'string'
                },
            ],
            'primaryKeyColumns': [
                'string',
            ]
        }
    },
    'alias': 'string',
    'status': 'PENDING'|'FAILED'|'SUCCESS'|'RUNNING'
}

Response Structure

  • (dict) -- Response for the GetDataset operation

    • datasetId (string) --

      The unique identifier for a Dataset.

    • datasetArn (string) --

      The ARN identifier of the Dataset.

    • datasetTitle (string) --

      Display title for a Dataset.

    • kind (string) --

      The format in which Dataset data is structured.

      • TABULAR - Data is structured in a tabular format.

      • NON_TABULAR - Data is structured in a non-tabular format.

    • datasetDescription (string) --

      A description of the Dataset.

    • createTime (integer) --

      The timestamp at which the Dataset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • lastModifiedTime (integer) --

      The last time that the Dataset was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • schemaDefinition (dict) --

      Definition for a schema on a tabular Dataset.

      • tabularSchemaConfig (dict) --

        The configuration for a schema on a tabular Dataset.

        • columns (list) --

          List of column definitions.

          • (dict) --

            The definition of a column in a tabular Dataset.

            • dataType (string) --

              Data type of a column.

              • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

            • columnName (string) --

              Name for a column.

            • columnDescription (string) --

              Description for a column.

        • primaryKeyColumns (list) --

          List of column names used for primary key.

          • (string) -- Column Name

    • alias (string) --

      The unique resource identifier for a Dataset.

    • status (string) --

      Status of the Dataset creation.

      • PENDING - Dataset is pending creation.

      • FAILED - Dataset creation has failed.

      • SUCCESS - Dataset creation has succeeded.

      • RUNNING - Dataset creation is running.

GetDataView (new) Link ¶

Gets information about a Dataview.

See also: AWS API Documentation

Request Syntax

client.get_data_view(
    dataViewId='string',
    datasetId='string'
)
type dataViewId:

string

param dataViewId:

[REQUIRED]

The unique identifier for the Dataview.

type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for the Dataset used in the Dataview.

rtype:

dict

returns:

Response Syntax

{
    'autoUpdate': True|False,
    'partitionColumns': [
        'string',
    ],
    'datasetId': 'string',
    'asOfTimestamp': 123,
    'errorInfo': {
        'errorMessage': 'string',
        'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
    },
    'lastModifiedTime': 123,
    'createTime': 123,
    'sortColumns': [
        'string',
    ],
    'dataViewId': 'string',
    'dataViewArn': 'string',
    'destinationTypeParams': {
        'destinationType': 'string'
    },
    'status': 'RUNNING'|'STARTING'|'FAILED'|'CANCELLED'|'TIMEOUT'|'SUCCESS'|'PENDING'|'FAILED_CLEANUP_FAILED'
}

Response Structure

  • (dict) -- Response from retrieving a dataview, which includes details on the target database and table name

    • autoUpdate (boolean) --

      Flag to indicate Dataview should be updated automatically.

    • partitionColumns (list) --

      Ordered set of column names used to partition data.

      • (string) -- 1 - 255 character String

    • datasetId (string) --

      The unique identifier for the Dataset used in the Dataview.

    • asOfTimestamp (integer) --

      Time range to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • errorInfo (dict) --

      Information about an error that occurred for the Dataview.

      • errorMessage (string) --

        The text of the error message.

      • errorCategory (string) --

        The category of the error.

        • VALIDATION -The inputs to this request are invalid.

        • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

        • ACCESS_DENIED - Missing required permission to perform this request.

        • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

        • THROTTLING - The system temporarily lacks sufficient resources to process the request.

        • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

        • CANCELLED - Cancelled.

        • USER_RECOVERABLE - A user recoverable error has occurred.

    • lastModifiedTime (integer) --

      The last time that a Dataview was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • createTime (integer) --

      The timestamp at which the Dataview was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • sortColumns (list) --

      Columns to be used for sorting the data.

      • (string) -- 1 - 255 character String

    • dataViewId (string) --

      The unique identifier for the Dataview.

    • dataViewArn (string) --

      The ARN identifier of the Dataview.

    • destinationTypeParams (dict) --

      Options that define the destination type for the Dataview.

      • destinationType (string) --

        Destination type for a Dataview.

        • GLUE_TABLE - Glue table destination type.

    • status (string) --

      The status of a Dataview creation.

      • RUNNING - Dataview creation is running.

      • STARTING - Dataview creation is starting.

      • FAILED - Dataview creation has failed.

      • CANCELLED - Dataview creation has been cancelled.

      • TIMEOUT - Dataview creation has timed out.

      • SUCCESS - Dataview creation has succeeded.

      • PENDING - Dataview creation is pending.

      • FAILED_CLEANUP_FAILED - Dataview creation failed and resource cleanup failed.

CreateDataView (new) Link ¶

Creates a Dataview for a Dataset.

See also: AWS API Documentation

Request Syntax

client.create_data_view(
    clientToken='string',
    datasetId='string',
    autoUpdate=True|False,
    sortColumns=[
        'string',
    ],
    partitionColumns=[
        'string',
    ],
    asOfTimestamp=123,
    destinationTypeParams={
        'destinationType': 'string'
    }
)
type clientToken:

string

param clientToken:

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId:

string

param datasetId:

[REQUIRED]

The unique Dataset identifier that is used to create a Dataview.

type autoUpdate:

boolean

param autoUpdate:

Flag to indicate Dataview should be updated automatically.

type sortColumns:

list

param sortColumns:

Columns to be used for sorting the data.

  • (string) -- 1 - 255 character String

type partitionColumns:

list

param partitionColumns:

Ordered set of column names used to partition data.

  • (string) -- 1 - 255 character String

type asOfTimestamp:

integer

param asOfTimestamp:

Beginning time to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

type destinationTypeParams:

dict

param destinationTypeParams:

[REQUIRED]

Options that define the destination type for the Dataview.

  • destinationType (string) -- [REQUIRED]

    Destination type for a Dataview.

    • GLUE_TABLE - Glue table destination type.

rtype:

dict

returns:

Response Syntax

{
    'datasetId': 'string',
    'dataViewId': 'string'
}

Response Structure

  • (dict) -- Response for creating a data view.

    • datasetId (string) --

      The unique identifier of the Dataset used for the Dataview.

    • dataViewId (string) --

      The unique identifier for the created Dataview.

ListDataViews (new) Link ¶

Lists all available Dataviews for a Dataset.

See also: AWS API Documentation

Request Syntax

client.list_data_views(
    datasetId='string',
    nextToken='string',
    maxResults=123
)
type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier of the Dataset for which to retrieve Dataviews.

type nextToken:

string

param nextToken:

A token indicating where a results page should begin.

type maxResults:

integer

param maxResults:

The maximum number of results per page.

rtype:

dict

returns:

Response Syntax

{
    'nextToken': 'string',
    'dataViews': [
        {
            'dataViewId': 'string',
            'dataViewArn': 'string',
            'datasetId': 'string',
            'asOfTimestamp': 123,
            'partitionColumns': [
                'string',
            ],
            'sortColumns': [
                'string',
            ],
            'status': 'RUNNING'|'STARTING'|'FAILED'|'CANCELLED'|'TIMEOUT'|'SUCCESS'|'PENDING'|'FAILED_CLEANUP_FAILED',
            'errorInfo': {
                'errorMessage': 'string',
                'errorCategory': 'VALIDATION'|'SERVICE_QUOTA_EXCEEDED'|'ACCESS_DENIED'|'RESOURCE_NOT_FOUND'|'THROTTLING'|'INTERNAL_SERVICE_EXCEPTION'|'CANCELLED'|'USER_RECOVERABLE'
            },
            'destinationTypeProperties': {
                'destinationType': 'string'
            },
            'autoUpdate': True|False,
            'createTime': 123,
            'lastModifiedTime': 123
        },
    ]
}

Response Structure

  • (dict) --

    • nextToken (string) --

      A token indicating where a results page should begin.

    • dataViews (list) --

      A list of Dataviews.

      • (dict) --

        Structure for the summary of a Dataview.

        • dataViewId (string) --

          The unique identifier for the Dataview.

        • dataViewArn (string) --

          The ARN identifier of the Dataview.

        • datasetId (string) --

          Th unique identifier for the Dataview Dataset.

        • asOfTimestamp (integer) --

          Time range to use for the Dataview. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • partitionColumns (list) --

          Ordered set of column names used to partition data.

          • (string) -- 1 - 255 character String

        • sortColumns (list) --

          Columns to be used for sorting the data.

          • (string) -- 1 - 255 character String

        • status (string) --

          The status of a Dataview creation.

          • RUNNING - Dataview creation is running.

          • STARTING - Dataview creation is starting.

          • FAILED - Dataview creation has failed.

          • CANCELLED - Dataview creation has been cancelled.

          • TIMEOUT - Dataview creation has timed out.

          • SUCCESS - Dataview creation has succeeded.

          • PENDING - Dataview creation is pending.

          • FAILED_CLEANUP_FAILED - Dataview creation failed and resource cleanup failed.

        • errorInfo (dict) --

          The structure with error messages.

          • errorMessage (string) --

            The text of the error message.

          • errorCategory (string) --

            The category of the error.

            • VALIDATION -The inputs to this request are invalid.

            • SERVICE_QUOTA_EXCEEDED - Service quotas have been exceeded. Please contact AWS support to increase quotas.

            • ACCESS_DENIED - Missing required permission to perform this request.

            • RESOURCE_NOT_FOUND - One or more inputs to this request were not found.

            • THROTTLING - The system temporarily lacks sufficient resources to process the request.

            • INTERNAL_SERVICE_EXCEPTION - An internal service error has occurred.

            • CANCELLED - Cancelled.

            • USER_RECOVERABLE - A user recoverable error has occurred.

        • destinationTypeProperties (dict) --

          Information about the Dataview destination.

          • destinationType (string) --

            Destination type for a Dataview.

            • GLUE_TABLE - Glue table destination type.

        • autoUpdate (boolean) --

          The flag to indicate Dataview should be updated automatically.

        • createTime (integer) --

          The timestamp at which the Dataview was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • lastModifiedTime (integer) --

          The last time that a Dataview was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

UpdateChangeset (new) Link ¶

Updates a FinSpace Changeset.

See also: AWS API Documentation

Request Syntax

client.update_changeset(
    clientToken='string',
    datasetId='string',
    changesetId='string',
    sourceParams={
        'string': 'string'
    },
    formatParams={
        'string': 'string'
    }
)
type clientToken:

string

param clientToken:

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for the FinSpace Dataset in which the Changeset is created.

type changesetId:

string

param changesetId:

[REQUIRED]

The unique identifier for the Changeset to update.

type sourceParams:

dict

param sourceParams:

[REQUIRED]

Options that define the location of the data being ingested.

  • (string) --

    • (string) --

type formatParams:

dict

param formatParams:

[REQUIRED]

Options that define the structure of the source file(s).

  • (string) --

    • (string) --

rtype:

dict

returns:

Response Syntax

{
    'changesetId': 'string',
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from a update changeset operation.

    • changesetId (string) --

      The unique identifier for the Changeset to update.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset in which the Changeset is created.

ListDatasets (new) Link ¶

Lists all of the active Datasets that a user has access to.

See also: AWS API Documentation

Request Syntax

client.list_datasets(
    nextToken='string',
    maxResults=123
)
type nextToken:

string

param nextToken:

A token indicating where a results page should begin.

type maxResults:

integer

param maxResults:

The maximum number of results per page.

rtype:

dict

returns:

Response Syntax

{
    'datasets': [
        {
            'datasetId': 'string',
            'datasetArn': 'string',
            'datasetTitle': 'string',
            'kind': 'TABULAR'|'NON_TABULAR',
            'datasetDescription': 'string',
            'ownerInfo': {
                'name': 'string',
                'phoneNumber': 'string',
                'email': 'string'
            },
            'createTime': 123,
            'lastModifiedTime': 123,
            'schemaDefinition': {
                'tabularSchemaConfig': {
                    'columns': [
                        {
                            'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                            'columnName': 'string',
                            'columnDescription': 'string'
                        },
                    ],
                    'primaryKeyColumns': [
                        'string',
                    ]
                }
            },
            'alias': 'string'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) -- Response for the ListDatasets operation

    • datasets (list) --

      List of Datasets.

      • (dict) --

        The structure for a Dataset.

        • datasetId (string) --

          An identifier for a Dataset.

        • datasetArn (string) --

          The ARN identifier of the Dataset.

        • datasetTitle (string) --

          Display title for a Dataset.

        • kind (string) --

          The format in which Dataset data is structured.

          • TABULAR - Data is structured in a tabular format.

          • NON_TABULAR - Data is structured in a non-tabular format.

        • datasetDescription (string) --

          Description for a Dataset.

        • ownerInfo (dict) --

          Contact information for a Dataset owner.

          • name (string) --

            Name of the Dataset owner.

          • phoneNumber (string) --

            Phone number for the Dataset owner.

          • email (string) --

            Email address for the Dataset owner.

        • createTime (integer) --

          The timestamp at which the Dataset was created in FinSpace. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • lastModifiedTime (integer) --

          The last time that the Dataset was modified. The value is determined as Epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • schemaDefinition (dict) --

          Definition for a schema on a tabular Dataset.

          • tabularSchemaConfig (dict) --

            The configuration for a schema on a tabular Dataset.

            • columns (list) --

              List of column definitions.

              • (dict) --

                The definition of a column in a tabular Dataset.

                • dataType (string) --

                  Data type of a column.

                  • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

                • columnName (string) --

                  Name for a column.

                • columnDescription (string) --

                  Description for a column.

            • primaryKeyColumns (list) --

              List of column names used for primary key.

              • (string) -- Column Name

        • alias (string) --

          The unique resource identifier for a Dataset.

    • nextToken (string) --

      A token indicating where a results page should begin.

CreateDataset (new) Link ¶

Creates a new FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.create_dataset(
    clientToken='string',
    datasetTitle='string',
    kind='TABULAR'|'NON_TABULAR',
    datasetDescription='string',
    ownerInfo={
        'name': 'string',
        'phoneNumber': 'string',
        'email': 'string'
    },
    permissionGroupParams={
        'permissionGroupId': 'string',
        'datasetPermissions': [
            {
                'permission': 'string'
            },
        ]
    },
    alias='string',
    schemaDefinition={
        'tabularSchemaConfig': {
            'columns': [
                {
                    'dataType': 'STRING'|'CHAR'|'INTEGER'|'TINYINT'|'SMALLINT'|'BIGINT'|'FLOAT'|'DOUBLE'|'DATE'|'DATETIME'|'BOOLEAN'|'BINARY',
                    'columnName': 'string',
                    'columnDescription': 'string'
                },
            ],
            'primaryKeyColumns': [
                'string',
            ]
        }
    }
)
type clientToken:

string

param clientToken:

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetTitle:

string

param datasetTitle:

[REQUIRED]

Display title for a FinSpace Dataset.

type kind:

string

param kind:

[REQUIRED]

The format in which Dataset data is structured.

  • TABULAR - Data is structured in a tabular format.

  • NON_TABULAR - Data is structured in a non-tabular format.

type datasetDescription:

string

param datasetDescription:

[REQUIRED]

Description of a Dataset.

type ownerInfo:

dict

param ownerInfo:

Contact information for a Dataset owner.

  • name (string) --

    Name of the Dataset owner.

  • phoneNumber (string) --

    Phone number for the Dataset owner.

  • email (string) --

    Email address for the Dataset owner.

type permissionGroupParams:

dict

param permissionGroupParams:

[REQUIRED]

Permission group parameters for Dataset permissions.

  • permissionGroupId (string) --

    The unique identifier of the PermissionGroup.

  • datasetPermissions (list) --

    List of resource permissions.

    • (dict) --

      Resource permission for a Dataset.

      • permission (string) --

        Permission for a resource.

type alias:

string

param alias:

[REQUIRED]

The unique resource identifier for a Dataset.

type schemaDefinition:

dict

param schemaDefinition:

Definition for a schema on a tabular Dataset.

  • tabularSchemaConfig (dict) --

    The configuration for a schema on a tabular Dataset.

    • columns (list) --

      List of column definitions.

      • (dict) --

        The definition of a column in a tabular Dataset.

        • dataType (string) --

          Data type of a column.

          • STRING - A String data type. CHAR - A char data type. INTEGER - An integer data type. TINYINT - A tinyint data type. SMALLINT - A smallint data type. BIGINT - A bigint data type. FLOAT - A float data type. DOUBLE - A double data type. DATE - A date data type. DATETIME - A datetime data type. BOOLEAN - A boolean data type. BINARY - A binary data type.

        • columnName (string) --

          Name for a column.

        • columnDescription (string) --

          Description for a column.

    • primaryKeyColumns (list) --

      List of column names used for primary key.

      • (string) -- Column Name

rtype:

dict

returns:

Response Syntax

{
    'datasetId': 'string'
}

Response Structure

  • (dict) -- The response from a CreateDataset operation

    • datasetId (string) --

      The unique identifier for the created Dataset.

CreateChangeset (updated) Link ¶
Changes (request, response)
Request
{'clientToken': 'string'}
Response
{'changesetId': 'string', 'datasetId': 'string'}

Creates a new Changeset in a FinSpace Dataset.

See also: AWS API Documentation

Request Syntax

client.create_changeset(
    clientToken='string',
    datasetId='string',
    changeType='REPLACE'|'APPEND'|'MODIFY',
    sourceParams={
        'string': 'string'
    },
    formatParams={
        'string': 'string'
    }
)
type clientToken:

string

param clientToken:

A token used to ensure idempotency.

This field is autopopulated if not provided.

type datasetId:

string

param datasetId:

[REQUIRED]

The unique identifier for the FinSpace Dataset where the Changeset will be created.

type changeType:

string

param changeType:

[REQUIRED]

Option to indicate how a Changeset will be applied to a Dataset.

  • REPLACE - Changeset will be considered as a replacement to all prior loaded Changesets.

  • APPEND - Changeset will be considered as an addition to the end of all prior loaded Changesets.

  • MODIFY - Changeset is considered as a replacement to a specific prior ingested Changeset.

type sourceParams:

dict

param sourceParams:

[REQUIRED]

Options that define the location of the data being ingested.

  • (string) --

    • (string) --

type formatParams:

dict

param formatParams:

[REQUIRED]

Options that define the structure of the source file(s) including the format type ( formatType), header row ( withHeader), data separation character ( separator) and the type of compression ( compression).

formatType is a required attribute and can have the following values:

  • PARQUET - Parquet source file format.

  • CSV - CSV source file format.

  • JSON - JSON source file format.

  • XML - XML source file format.

For example, you could specify the following for formatParams:

  • (string) --

    • (string) --

rtype:

dict

returns:

Response Syntax

{
    'datasetId': 'string',
    'changesetId': 'string'
}

Response Structure

  • (dict) -- The response from a CreateChangeset operation.

    • datasetId (string) --

      The unique identifier for the FinSpace Dataset where the Changeset is created.

    • changesetId (string) --

      The unique identifier of the Changeset that is created.