FinSpace User Environment Management service

2024/03/19 - FinSpace User Environment Management service - 7 updated api methods

Changes  Adding new attributes readWrite and onDemand to dataview models for Database Maintenance operations.

CreateKxCluster (updated) Link ¶
Changes (both)
{'databases': {'dataviewConfiguration': {'segmentConfigurations': {'onDemand': 'boolean'}}}}

Creates a new kdb cluster.

See also: AWS API Documentation

Request Syntax

client.create_kx_cluster(
    clientToken='string',
    environmentId='string',
    clusterName='string',
    clusterType='HDB'|'RDB'|'GATEWAY'|'GP'|'TICKERPLANT',
    tickerplantLogConfiguration={
        'tickerplantLogVolumes': [
            'string',
        ]
    },
    databases=[
        {
            'databaseName': 'string',
            'cacheConfigurations': [
                {
                    'cacheType': 'string',
                    'dbPaths': [
                        'string',
                    ],
                    'dataviewName': 'string'
                },
            ],
            'changesetId': 'string',
            'dataviewName': 'string',
            'dataviewConfiguration': {
                'dataviewName': 'string',
                'dataviewVersionId': 'string',
                'changesetId': 'string',
                'segmentConfigurations': [
                    {
                        'dbPaths': [
                            'string',
                        ],
                        'volumeName': 'string',
                        'onDemand': True|False
                    },
                ]
            }
        },
    ],
    cacheStorageConfigurations=[
        {
            'type': 'string',
            'size': 123
        },
    ],
    autoScalingConfiguration={
        'minNodeCount': 123,
        'maxNodeCount': 123,
        'autoScalingMetric': 'CPU_UTILIZATION_PERCENTAGE',
        'metricTarget': 123.0,
        'scaleInCooldownSeconds': 123.0,
        'scaleOutCooldownSeconds': 123.0
    },
    clusterDescription='string',
    capacityConfiguration={
        'nodeType': 'string',
        'nodeCount': 123
    },
    releaseLabel='string',
    vpcConfiguration={
        'vpcId': 'string',
        'securityGroupIds': [
            'string',
        ],
        'subnetIds': [
            'string',
        ],
        'ipAddressType': 'IP_V4'
    },
    initializationScript='string',
    commandLineArguments=[
        {
            'key': 'string',
            'value': 'string'
        },
    ],
    code={
        's3Bucket': 'string',
        's3Key': 'string',
        's3ObjectVersion': 'string'
    },
    executionRole='string',
    savedownStorageConfiguration={
        'type': 'SDS01',
        'size': 123,
        'volumeName': 'string'
    },
    azMode='SINGLE'|'MULTI',
    availabilityZoneId='string',
    tags={
        'string': 'string'
    },
    scalingGroupConfiguration={
        'scalingGroupName': 'string',
        'memoryLimit': 123,
        'memoryReservation': 123,
        'nodeCount': 123,
        'cpu': 123.0
    }
)
type clientToken:

string

param clientToken:

A token that ensures idempotency. This token expires in 10 minutes.

This field is autopopulated if not provided.

type environmentId:

string

param environmentId:

[REQUIRED]

A unique identifier for the kdb environment.

type clusterName:

string

param clusterName:

[REQUIRED]

A unique name for the cluster that you want to create.

type clusterType:

string

param clusterType:

[REQUIRED]

Specifies the type of KDB database that is being created. The following types are available:

  • HDB – A Historical Database. The data is only accessible with read-only permissions from one of the FinSpace managed kdb databases mounted to the cluster.

  • RDB – A Realtime Database. This type of database captures all the data from a ticker plant and stores it in memory until the end of day, after which it writes all of its data to a disk and reloads the HDB. This cluster type requires local storage for temporary storage of data during the savedown process. If you specify this field in your request, you must provide the savedownStorageConfiguration parameter.

  • GATEWAY – A gateway cluster allows you to access data across processes in kdb systems. It allows you to create your own routing logic using the initialization scripts and custom code. This type of cluster does not require a writable local storage.

  • GP – A general purpose cluster allows you to quickly iterate on code during development by granting greater access to system commands and enabling a fast reload of custom code. This cluster type can optionally mount databases including cache and savedown storage. For this cluster type, the node count is fixed at 1. It does not support autoscaling and supports only SINGLE AZ mode.

  • Tickerplant – A tickerplant cluster allows you to subscribe to feed handlers based on IAM permissions. It can publish to RDBs, other Tickerplants, and real-time subscribers (RTS). Tickerplants can persist messages to log, which is readable by any RDB environment. It supports only single-node that is only one kdb process.

type tickerplantLogConfiguration:

dict

param tickerplantLogConfiguration:

A configuration to store Tickerplant logs. It consists of a list of volumes that will be mounted to your cluster. For the cluster type Tickerplant, the location of the TP volume on the cluster will be available by using the global variable .aws.tp_log_path.

  • tickerplantLogVolumes (list) --

    The name of the volumes for tickerplant logs.

    • (string) --

type databases:

list

param databases:

A list of databases that will be available for querying.

  • (dict) --

    The configuration of data that is available for querying from this database.

    • databaseName (string) -- [REQUIRED]

      The name of the kdb database. When this parameter is specified in the structure, S3 with the whole database is included by default.

    • cacheConfigurations (list) --

      Configuration details for the disk cache used to increase performance reading from a kdb database mounted to the cluster.

      • (dict) --

        The structure of database cache configuration that is used for mapping database paths to cache types in clusters.

        • cacheType (string) -- [REQUIRED]

          The type of disk cache. This parameter is used to map the database path to cache storage. The valid values are:

          • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

        • dbPaths (list) -- [REQUIRED]

          Specifies the portions of database that will be loaded into the cache for access.

          • (string) --

        • dataviewName (string) --

          The name of the dataview to be used for caching historical data on disk.

    • changesetId (string) --

      A unique identifier of the changeset that is associated with the cluster.

    • dataviewName (string) --

      The name of the dataview to be used for caching historical data on disk.

    • dataviewConfiguration (dict) --

      The configuration of the dataview to be used with specified cluster.

      • dataviewName (string) --

        The unique identifier of the dataview.

      • dataviewVersionId (string) --

        The version of the dataview corresponding to a given changeset.

      • changesetId (string) --

        A unique identifier for the changeset.

      • segmentConfigurations (list) --

        The db path and volume configuration for the segmented database.

        • (dict) --

          The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

          • dbPaths (list) -- [REQUIRED]

            The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

            • (string) --

          • volumeName (string) -- [REQUIRED]

            The name of the volume where you want to add data.

          • onDemand (boolean) --

            Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

type cacheStorageConfigurations:

list

param cacheStorageConfigurations:

The configurations for a read only cache storage associated with a cluster. This cache will be stored as an FSx Lustre that reads from the S3 store.

  • (dict) --

    The configuration for read only disk cache associated with a cluster.

    • type (string) -- [REQUIRED]

      The type of cache storage. The valid values are:

      • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

      • CACHE_250 – This type provides at least 250 MB/s disk access throughput.

      • CACHE_12 – This type provides at least 12 MB/s disk access throughput.

      For cache type CACHE_1000 and CACHE_250 you can select cache size as 1200 GB or increments of 2400 GB. For cache type CACHE_12 you can select the cache size in increments of 6000 GB.

    • size (integer) -- [REQUIRED]

      The size of cache in Gigabytes.

type autoScalingConfiguration:

dict

param autoScalingConfiguration:

The configuration based on which FinSpace will scale in or scale out nodes in your cluster.

  • minNodeCount (integer) --

    The lowest number of nodes to scale. This value must be at least 1 and less than the maxNodeCount. If the nodes in a cluster belong to multiple availability zones, then minNodeCount must be at least 3.

  • maxNodeCount (integer) --

    The highest number of nodes to scale. This value cannot be greater than 5.

  • autoScalingMetric (string) --

    The metric your cluster will track in order to scale in and out. For example, CPU_UTILIZATION_PERCENTAGE is the average CPU usage across all the nodes in a cluster.

  • metricTarget (float) --

    The desired value of the chosen autoScalingMetric. When the metric drops below this value, the cluster will scale in. When the metric goes above this value, the cluster will scale out. You can set the target value between 1 and 100 percent.

  • scaleInCooldownSeconds (float) --

    The duration in seconds that FinSpace will wait after a scale in event before initiating another scaling event.

  • scaleOutCooldownSeconds (float) --

    The duration in seconds that FinSpace will wait after a scale out event before initiating another scaling event.

type clusterDescription:

string

param clusterDescription:

A description of the cluster.

type capacityConfiguration:

dict

param capacityConfiguration:

A structure for the metadata of a cluster. It includes information like the CPUs needed, memory of instances, and number of instances.

  • nodeType (string) --

    The type that determines the hardware of the host computer used for your cluster instance. Each node type offers different memory and storage capabilities. Choose a node type based on the requirements of the application or software that you plan to run on your instance.

    You can only specify one of the following values:

    • kx.s.large – The node type with a configuration of 12 GiB memory and 2 vCPUs.

    • kx.s.xlarge – The node type with a configuration of 27 GiB memory and 4 vCPUs.

    • kx.s.2xlarge – The node type with a configuration of 54 GiB memory and 8 vCPUs.

    • kx.s.4xlarge – The node type with a configuration of 108 GiB memory and 16 vCPUs.

    • kx.s.8xlarge – The node type with a configuration of 216 GiB memory and 32 vCPUs.

    • kx.s.16xlarge – The node type with a configuration of 432 GiB memory and 64 vCPUs.

    • kx.s.32xlarge – The node type with a configuration of 864 GiB memory and 128 vCPUs.

  • nodeCount (integer) --

    The number of instances running in a cluster.

type releaseLabel:

string

param releaseLabel:

[REQUIRED]

The version of FinSpace managed kdb to run.

type vpcConfiguration:

dict

param vpcConfiguration:

[REQUIRED]

Configuration details about the network where the Privatelink endpoint of the cluster resides.

  • vpcId (string) --

    The identifier of the VPC endpoint.

  • securityGroupIds (list) --

    The unique identifier of the VPC security group applied to the VPC endpoint ENI for the cluster.

    • (string) --

  • subnetIds (list) --

    The identifier of the subnet that the Privatelink VPC endpoint uses to connect to the cluster.

    • (string) --

  • ipAddressType (string) --

    The IP address type for cluster network configuration parameters. The following type is available:

    • IP_V4 – IP address version 4

type initializationScript:

string

param initializationScript:

Specifies a Q program that will be run at launch of a cluster. It is a relative path within .zip file that contains the custom code, which will be loaded on the cluster. It must include the file name itself. For example, somedir/init.q.

type commandLineArguments:

list

param commandLineArguments:

Defines the key-value pairs to make them available inside the cluster.

  • (dict) --

    Defines the key-value pairs to make them available inside the cluster.

    • key (string) --

      The name of the key.

    • value (string) --

      The value of the key.

type code:

dict

param code:

The details of the custom code that you want to use inside a cluster when analyzing a data. It consists of the S3 source bucket, location, S3 object version, and the relative path from where the custom code is loaded into the cluster.

  • s3Bucket (string) --

    A unique name for the S3 bucket.

  • s3Key (string) --

    The full S3 path (excluding bucket) to the .zip file. This file contains the code that is loaded onto the cluster when it's started.

  • s3ObjectVersion (string) --

    The version of an S3 object.

type executionRole:

string

param executionRole:

An IAM role that defines a set of permissions associated with a cluster. These permissions are assumed when a cluster attempts to access another cluster.

type savedownStorageConfiguration:

dict

param savedownStorageConfiguration:

The size and type of the temporary storage that is used to hold data during the savedown process. This parameter is required when you choose clusterType as RDB. All the data written to this storage space is lost when the cluster node is restarted.

  • type (string) --

    The type of writeable storage space for temporarily storing your savedown data. The valid values are:

    • SDS01 – This type represents 3000 IOPS and io2 ebs volume type.

  • size (integer) --

    The size of temporary storage in gibibytes.

  • volumeName (string) --

    The name of the kdb volume that you want to use as writeable save-down storage for clusters.

type azMode:

string

param azMode:

[REQUIRED]

The number of availability zones you want to assign per cluster. This can be one of the following

  • SINGLE – Assigns one availability zone per cluster.

  • MULTI – Assigns all the availability zones per cluster.

type availabilityZoneId:

string

param availabilityZoneId:

The availability zone identifiers for the requested regions.

type tags:

dict

param tags:

A list of key-value pairs to label the cluster. You can add up to 50 tags to a cluster.

  • (string) --

    • (string) --

type scalingGroupConfiguration:

dict

param scalingGroupConfiguration:

The structure that stores the configuration details of a scaling group.

  • scalingGroupName (string) -- [REQUIRED]

    A unique identifier for the kdb scaling group.

  • memoryLimit (integer) --

    An optional hard limit on the amount of memory a kdb cluster can use.

  • memoryReservation (integer) -- [REQUIRED]

    A reservation of the minimum amount of memory that should be available on the scaling group for a kdb cluster to be successfully placed in a scaling group.

  • nodeCount (integer) -- [REQUIRED]

    The number of kdb cluster nodes.

  • cpu (float) --

    The number of vCPUs that you want to reserve for each node of this kdb cluster on the scaling group host.

rtype:

dict

returns:

Response Syntax

{
    'environmentId': 'string',
    'status': 'PENDING'|'CREATING'|'CREATE_FAILED'|'RUNNING'|'UPDATING'|'DELETING'|'DELETED'|'DELETE_FAILED',
    'statusReason': 'string',
    'clusterName': 'string',
    'clusterType': 'HDB'|'RDB'|'GATEWAY'|'GP'|'TICKERPLANT',
    'tickerplantLogConfiguration': {
        'tickerplantLogVolumes': [
            'string',
        ]
    },
    'volumes': [
        {
            'volumeName': 'string',
            'volumeType': 'NAS_1'
        },
    ],
    'databases': [
        {
            'databaseName': 'string',
            'cacheConfigurations': [
                {
                    'cacheType': 'string',
                    'dbPaths': [
                        'string',
                    ],
                    'dataviewName': 'string'
                },
            ],
            'changesetId': 'string',
            'dataviewName': 'string',
            'dataviewConfiguration': {
                'dataviewName': 'string',
                'dataviewVersionId': 'string',
                'changesetId': 'string',
                'segmentConfigurations': [
                    {
                        'dbPaths': [
                            'string',
                        ],
                        'volumeName': 'string',
                        'onDemand': True|False
                    },
                ]
            }
        },
    ],
    'cacheStorageConfigurations': [
        {
            'type': 'string',
            'size': 123
        },
    ],
    'autoScalingConfiguration': {
        'minNodeCount': 123,
        'maxNodeCount': 123,
        'autoScalingMetric': 'CPU_UTILIZATION_PERCENTAGE',
        'metricTarget': 123.0,
        'scaleInCooldownSeconds': 123.0,
        'scaleOutCooldownSeconds': 123.0
    },
    'clusterDescription': 'string',
    'capacityConfiguration': {
        'nodeType': 'string',
        'nodeCount': 123
    },
    'releaseLabel': 'string',
    'vpcConfiguration': {
        'vpcId': 'string',
        'securityGroupIds': [
            'string',
        ],
        'subnetIds': [
            'string',
        ],
        'ipAddressType': 'IP_V4'
    },
    'initializationScript': 'string',
    'commandLineArguments': [
        {
            'key': 'string',
            'value': 'string'
        },
    ],
    'code': {
        's3Bucket': 'string',
        's3Key': 'string',
        's3ObjectVersion': 'string'
    },
    'executionRole': 'string',
    'lastModifiedTimestamp': datetime(2015, 1, 1),
    'savedownStorageConfiguration': {
        'type': 'SDS01',
        'size': 123,
        'volumeName': 'string'
    },
    'azMode': 'SINGLE'|'MULTI',
    'availabilityZoneId': 'string',
    'createdTimestamp': datetime(2015, 1, 1),
    'scalingGroupConfiguration': {
        'scalingGroupName': 'string',
        'memoryLimit': 123,
        'memoryReservation': 123,
        'nodeCount': 123,
        'cpu': 123.0
    }
}

Response Structure

  • (dict) --

    • environmentId (string) --

      A unique identifier for the kdb environment.

    • status (string) --

      The status of cluster creation.

      • PENDING – The cluster is pending creation.

      • CREATING – The cluster creation process is in progress.

      • CREATE_FAILED – The cluster creation process has failed.

      • RUNNING – The cluster creation process is running.

      • UPDATING – The cluster is in the process of being updated.

      • DELETING – The cluster is in the process of being deleted.

      • DELETED – The cluster has been deleted.

      • DELETE_FAILED – The cluster failed to delete.

    • statusReason (string) --

      The error message when a failed state occurs.

    • clusterName (string) --

      A unique name for the cluster.

    • clusterType (string) --

      Specifies the type of KDB database that is being created. The following types are available:

      • HDB – A Historical Database. The data is only accessible with read-only permissions from one of the FinSpace managed kdb databases mounted to the cluster.

      • RDB – A Realtime Database. This type of database captures all the data from a ticker plant and stores it in memory until the end of day, after which it writes all of its data to a disk and reloads the HDB. This cluster type requires local storage for temporary storage of data during the savedown process. If you specify this field in your request, you must provide the savedownStorageConfiguration parameter.

      • GATEWAY – A gateway cluster allows you to access data across processes in kdb systems. It allows you to create your own routing logic using the initialization scripts and custom code. This type of cluster does not require a writable local storage.

      • GP – A general purpose cluster allows you to quickly iterate on code during development by granting greater access to system commands and enabling a fast reload of custom code. This cluster type can optionally mount databases including cache and savedown storage. For this cluster type, the node count is fixed at 1. It does not support autoscaling and supports only SINGLE AZ mode.

      • Tickerplant – A tickerplant cluster allows you to subscribe to feed handlers based on IAM permissions. It can publish to RDBs, other Tickerplants, and real-time subscribers (RTS). Tickerplants can persist messages to log, which is readable by any RDB environment. It supports only single-node that is only one kdb process.

    • tickerplantLogConfiguration (dict) --

      A configuration to store the Tickerplant logs. It consists of a list of volumes that will be mounted to your cluster. For the cluster type Tickerplant, the location of the TP volume on the cluster will be available by using the global variable .aws.tp_log_path.

      • tickerplantLogVolumes (list) --

        The name of the volumes for tickerplant logs.

        • (string) --

    • volumes (list) --

      A list of volumes mounted on the cluster.

      • (dict) --

        The structure that consists of name and type of volume.

        • volumeName (string) --

          A unique identifier for the volume.

        • volumeType (string) --

          The type of file system volume. Currently, FinSpace only supports NAS_1 volume type.

    • databases (list) --

      A list of databases that will be available for querying.

      • (dict) --

        The configuration of data that is available for querying from this database.

        • databaseName (string) --

          The name of the kdb database. When this parameter is specified in the structure, S3 with the whole database is included by default.

        • cacheConfigurations (list) --

          Configuration details for the disk cache used to increase performance reading from a kdb database mounted to the cluster.

          • (dict) --

            The structure of database cache configuration that is used for mapping database paths to cache types in clusters.

            • cacheType (string) --

              The type of disk cache. This parameter is used to map the database path to cache storage. The valid values are:

              • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

            • dbPaths (list) --

              Specifies the portions of database that will be loaded into the cache for access.

              • (string) --

            • dataviewName (string) --

              The name of the dataview to be used for caching historical data on disk.

        • changesetId (string) --

          A unique identifier of the changeset that is associated with the cluster.

        • dataviewName (string) --

          The name of the dataview to be used for caching historical data on disk.

        • dataviewConfiguration (dict) --

          The configuration of the dataview to be used with specified cluster.

          • dataviewName (string) --

            The unique identifier of the dataview.

          • dataviewVersionId (string) --

            The version of the dataview corresponding to a given changeset.

          • changesetId (string) --

            A unique identifier for the changeset.

          • segmentConfigurations (list) --

            The db path and volume configuration for the segmented database.

            • (dict) --

              The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

              • dbPaths (list) --

                The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

                • (string) --

              • volumeName (string) --

                The name of the volume where you want to add data.

              • onDemand (boolean) --

                Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

    • cacheStorageConfigurations (list) --

      The configurations for a read only cache storage associated with a cluster. This cache will be stored as an FSx Lustre that reads from the S3 store.

      • (dict) --

        The configuration for read only disk cache associated with a cluster.

        • type (string) --

          The type of cache storage. The valid values are:

          • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

          • CACHE_250 – This type provides at least 250 MB/s disk access throughput.

          • CACHE_12 – This type provides at least 12 MB/s disk access throughput.

          For cache type CACHE_1000 and CACHE_250 you can select cache size as 1200 GB or increments of 2400 GB. For cache type CACHE_12 you can select the cache size in increments of 6000 GB.

        • size (integer) --

          The size of cache in Gigabytes.

    • autoScalingConfiguration (dict) --

      The configuration based on which FinSpace will scale in or scale out nodes in your cluster.

      • minNodeCount (integer) --

        The lowest number of nodes to scale. This value must be at least 1 and less than the maxNodeCount. If the nodes in a cluster belong to multiple availability zones, then minNodeCount must be at least 3.

      • maxNodeCount (integer) --

        The highest number of nodes to scale. This value cannot be greater than 5.

      • autoScalingMetric (string) --

        The metric your cluster will track in order to scale in and out. For example, CPU_UTILIZATION_PERCENTAGE is the average CPU usage across all the nodes in a cluster.

      • metricTarget (float) --

        The desired value of the chosen autoScalingMetric. When the metric drops below this value, the cluster will scale in. When the metric goes above this value, the cluster will scale out. You can set the target value between 1 and 100 percent.

      • scaleInCooldownSeconds (float) --

        The duration in seconds that FinSpace will wait after a scale in event before initiating another scaling event.

      • scaleOutCooldownSeconds (float) --

        The duration in seconds that FinSpace will wait after a scale out event before initiating another scaling event.

    • clusterDescription (string) --

      A description of the cluster.

    • capacityConfiguration (dict) --

      A structure for the metadata of a cluster. It includes information like the CPUs needed, memory of instances, and number of instances.

      • nodeType (string) --

        The type that determines the hardware of the host computer used for your cluster instance. Each node type offers different memory and storage capabilities. Choose a node type based on the requirements of the application or software that you plan to run on your instance.

        You can only specify one of the following values:

        • kx.s.large – The node type with a configuration of 12 GiB memory and 2 vCPUs.

        • kx.s.xlarge – The node type with a configuration of 27 GiB memory and 4 vCPUs.

        • kx.s.2xlarge – The node type with a configuration of 54 GiB memory and 8 vCPUs.

        • kx.s.4xlarge – The node type with a configuration of 108 GiB memory and 16 vCPUs.

        • kx.s.8xlarge – The node type with a configuration of 216 GiB memory and 32 vCPUs.

        • kx.s.16xlarge – The node type with a configuration of 432 GiB memory and 64 vCPUs.

        • kx.s.32xlarge – The node type with a configuration of 864 GiB memory and 128 vCPUs.

      • nodeCount (integer) --

        The number of instances running in a cluster.

    • releaseLabel (string) --

      A version of the FinSpace managed kdb to run.

    • vpcConfiguration (dict) --

      Configuration details about the network where the Privatelink endpoint of the cluster resides.

      • vpcId (string) --

        The identifier of the VPC endpoint.

      • securityGroupIds (list) --

        The unique identifier of the VPC security group applied to the VPC endpoint ENI for the cluster.

        • (string) --

      • subnetIds (list) --

        The identifier of the subnet that the Privatelink VPC endpoint uses to connect to the cluster.

        • (string) --

      • ipAddressType (string) --

        The IP address type for cluster network configuration parameters. The following type is available:

        • IP_V4 – IP address version 4

    • initializationScript (string) --

      Specifies a Q program that will be run at launch of a cluster. It is a relative path within .zip file that contains the custom code, which will be loaded on the cluster. It must include the file name itself. For example, somedir/init.q.

    • commandLineArguments (list) --

      Defines the key-value pairs to make them available inside the cluster.

      • (dict) --

        Defines the key-value pairs to make them available inside the cluster.

        • key (string) --

          The name of the key.

        • value (string) --

          The value of the key.

    • code (dict) --

      The details of the custom code that you want to use inside a cluster when analyzing a data. It consists of the S3 source bucket, location, S3 object version, and the relative path from where the custom code is loaded into the cluster.

      • s3Bucket (string) --

        A unique name for the S3 bucket.

      • s3Key (string) --

        The full S3 path (excluding bucket) to the .zip file. This file contains the code that is loaded onto the cluster when it's started.

      • s3ObjectVersion (string) --

        The version of an S3 object.

    • executionRole (string) --

      An IAM role that defines a set of permissions associated with a cluster. These permissions are assumed when a cluster attempts to access another cluster.

    • lastModifiedTimestamp (datetime) --

      The last time that the cluster was modified. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • savedownStorageConfiguration (dict) --

      The size and type of the temporary storage that is used to hold data during the savedown process. This parameter is required when you choose clusterType as RDB. All the data written to this storage space is lost when the cluster node is restarted.

      • type (string) --

        The type of writeable storage space for temporarily storing your savedown data. The valid values are:

        • SDS01 – This type represents 3000 IOPS and io2 ebs volume type.

      • size (integer) --

        The size of temporary storage in gibibytes.

      • volumeName (string) --

        The name of the kdb volume that you want to use as writeable save-down storage for clusters.

    • azMode (string) --

      The number of availability zones you want to assign per cluster. This can be one of the following

      • SINGLE – Assigns one availability zone per cluster.

      • MULTI – Assigns all the availability zones per cluster.

    • availabilityZoneId (string) --

      The availability zone identifiers for the requested regions.

    • createdTimestamp (datetime) --

      The timestamp at which the cluster was created in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • scalingGroupConfiguration (dict) --

      The structure that stores the configuration details of a scaling group.

      • scalingGroupName (string) --

        A unique identifier for the kdb scaling group.

      • memoryLimit (integer) --

        An optional hard limit on the amount of memory a kdb cluster can use.

      • memoryReservation (integer) --

        A reservation of the minimum amount of memory that should be available on the scaling group for a kdb cluster to be successfully placed in a scaling group.

      • nodeCount (integer) --

        The number of kdb cluster nodes.

      • cpu (float) --

        The number of vCPUs that you want to reserve for each node of this kdb cluster on the scaling group host.

CreateKxDataview (updated) Link ¶
Changes (both)
{'readWrite': 'boolean', 'segmentConfigurations': {'onDemand': 'boolean'}}

Creates a snapshot of kdb database with tiered storage capabilities and a pre-warmed cache, ready for mounting on kdb clusters. Dataviews are only available for clusters running on a scaling group. They are not supported on dedicated clusters.

See also: AWS API Documentation

Request Syntax

client.create_kx_dataview(
    environmentId='string',
    databaseName='string',
    dataviewName='string',
    azMode='SINGLE'|'MULTI',
    availabilityZoneId='string',
    changesetId='string',
    segmentConfigurations=[
        {
            'dbPaths': [
                'string',
            ],
            'volumeName': 'string',
            'onDemand': True|False
        },
    ],
    autoUpdate=True|False,
    readWrite=True|False,
    description='string',
    tags={
        'string': 'string'
    },
    clientToken='string'
)
type environmentId:

string

param environmentId:

[REQUIRED]

A unique identifier for the kdb environment, where you want to create the dataview.

type databaseName:

string

param databaseName:

[REQUIRED]

The name of the database where you want to create a dataview.

type dataviewName:

string

param dataviewName:

[REQUIRED]

A unique identifier for the dataview.

type azMode:

string

param azMode:

[REQUIRED]

The number of availability zones you want to assign per volume. Currently, FinSpace only supports SINGLE for volumes. This places dataview in a single AZ.

type availabilityZoneId:

string

param availabilityZoneId:

The identifier of the availability zones.

type changesetId:

string

param changesetId:

A unique identifier of the changeset that you want to use to ingest data.

type segmentConfigurations:

list

param segmentConfigurations:

The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

  • (dict) --

    The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

    • dbPaths (list) -- [REQUIRED]

      The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

      • (string) --

    • volumeName (string) -- [REQUIRED]

      The name of the volume where you want to add data.

    • onDemand (boolean) --

      Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

type autoUpdate:

boolean

param autoUpdate:

The option to specify whether you want to apply all the future additions and corrections automatically to the dataview, when you ingest new changesets. The default value is false.

type readWrite:

boolean

param readWrite:

The option to specify whether you want to make the dataview writable to perform database maintenance. The following are some considerations related to writable dataviews.

  • You cannot create partial writable dataviews. When you create writeable dataviews you must provide the entire database path.

  • You cannot perform updates on a writeable dataview. Hence, autoUpdate must be set as False if readWrite is True for a dataview.

  • You must also use a unique volume for creating a writeable dataview. So, if you choose a volume that is already in use by another dataview, the dataview creation fails.

  • Once you create a dataview as writeable, you cannot change it to read-only. So, you cannot update the readWrite parameter later.

type description:

string

param description:

A description of the dataview.

type tags:

dict

param tags:

A list of key-value pairs to label the dataview. You can add up to 50 tags to a dataview.

  • (string) --

    • (string) --

type clientToken:

string

param clientToken:

[REQUIRED]

A token that ensures idempotency. This token expires in 10 minutes.

This field is autopopulated if not provided.

rtype:

dict

returns:

Response Syntax

{
    'dataviewName': 'string',
    'databaseName': 'string',
    'environmentId': 'string',
    'azMode': 'SINGLE'|'MULTI',
    'availabilityZoneId': 'string',
    'changesetId': 'string',
    'segmentConfigurations': [
        {
            'dbPaths': [
                'string',
            ],
            'volumeName': 'string',
            'onDemand': True|False
        },
    ],
    'description': 'string',
    'autoUpdate': True|False,
    'readWrite': True|False,
    'createdTimestamp': datetime(2015, 1, 1),
    'lastModifiedTimestamp': datetime(2015, 1, 1),
    'status': 'CREATING'|'ACTIVE'|'UPDATING'|'FAILED'|'DELETING'
}

Response Structure

  • (dict) --

    • dataviewName (string) --

      A unique identifier for the dataview.

    • databaseName (string) --

      The name of the database where you want to create a dataview.

    • environmentId (string) --

      A unique identifier for the kdb environment, where you want to create the dataview.

    • azMode (string) --

      The number of availability zones you want to assign per volume. Currently, FinSpace only supports SINGLE for volumes. This places dataview in a single AZ.

    • availabilityZoneId (string) --

      The identifier of the availability zones.

    • changesetId (string) --

      A unique identifier for the changeset.

    • segmentConfigurations (list) --

      The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

      • (dict) --

        The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

        • dbPaths (list) --

          The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

          • (string) --

        • volumeName (string) --

          The name of the volume where you want to add data.

        • onDemand (boolean) --

          Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

    • description (string) --

      A description of the dataview.

    • autoUpdate (boolean) --

      The option to select whether you want to apply all the future additions and corrections automatically to the dataview when you ingest new changesets. The default value is false.

    • readWrite (boolean) --

      Returns True if the dataview is created as writeable and False otherwise.

    • createdTimestamp (datetime) --

      The timestamp at which the dataview was created in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • lastModifiedTimestamp (datetime) --

      The last time that the dataview was updated in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • status (string) --

      The status of dataview creation.

      • CREATING – The dataview creation is in progress.

      • UPDATING – The dataview is in the process of being updated.

      • ACTIVE – The dataview is active.

GetKxCluster (updated) Link ¶
Changes (response)
{'databases': {'dataviewConfiguration': {'segmentConfigurations': {'onDemand': 'boolean'}}}}

Retrieves information about a kdb cluster.

See also: AWS API Documentation

Request Syntax

client.get_kx_cluster(
    environmentId='string',
    clusterName='string'
)
type environmentId:

string

param environmentId:

[REQUIRED]

A unique identifier for the kdb environment.

type clusterName:

string

param clusterName:

[REQUIRED]

The name of the cluster that you want to retrieve.

rtype:

dict

returns:

Response Syntax

{
    'status': 'PENDING'|'CREATING'|'CREATE_FAILED'|'RUNNING'|'UPDATING'|'DELETING'|'DELETED'|'DELETE_FAILED',
    'statusReason': 'string',
    'clusterName': 'string',
    'clusterType': 'HDB'|'RDB'|'GATEWAY'|'GP'|'TICKERPLANT',
    'tickerplantLogConfiguration': {
        'tickerplantLogVolumes': [
            'string',
        ]
    },
    'volumes': [
        {
            'volumeName': 'string',
            'volumeType': 'NAS_1'
        },
    ],
    'databases': [
        {
            'databaseName': 'string',
            'cacheConfigurations': [
                {
                    'cacheType': 'string',
                    'dbPaths': [
                        'string',
                    ],
                    'dataviewName': 'string'
                },
            ],
            'changesetId': 'string',
            'dataviewName': 'string',
            'dataviewConfiguration': {
                'dataviewName': 'string',
                'dataviewVersionId': 'string',
                'changesetId': 'string',
                'segmentConfigurations': [
                    {
                        'dbPaths': [
                            'string',
                        ],
                        'volumeName': 'string',
                        'onDemand': True|False
                    },
                ]
            }
        },
    ],
    'cacheStorageConfigurations': [
        {
            'type': 'string',
            'size': 123
        },
    ],
    'autoScalingConfiguration': {
        'minNodeCount': 123,
        'maxNodeCount': 123,
        'autoScalingMetric': 'CPU_UTILIZATION_PERCENTAGE',
        'metricTarget': 123.0,
        'scaleInCooldownSeconds': 123.0,
        'scaleOutCooldownSeconds': 123.0
    },
    'clusterDescription': 'string',
    'capacityConfiguration': {
        'nodeType': 'string',
        'nodeCount': 123
    },
    'releaseLabel': 'string',
    'vpcConfiguration': {
        'vpcId': 'string',
        'securityGroupIds': [
            'string',
        ],
        'subnetIds': [
            'string',
        ],
        'ipAddressType': 'IP_V4'
    },
    'initializationScript': 'string',
    'commandLineArguments': [
        {
            'key': 'string',
            'value': 'string'
        },
    ],
    'code': {
        's3Bucket': 'string',
        's3Key': 'string',
        's3ObjectVersion': 'string'
    },
    'executionRole': 'string',
    'lastModifiedTimestamp': datetime(2015, 1, 1),
    'savedownStorageConfiguration': {
        'type': 'SDS01',
        'size': 123,
        'volumeName': 'string'
    },
    'azMode': 'SINGLE'|'MULTI',
    'availabilityZoneId': 'string',
    'createdTimestamp': datetime(2015, 1, 1),
    'scalingGroupConfiguration': {
        'scalingGroupName': 'string',
        'memoryLimit': 123,
        'memoryReservation': 123,
        'nodeCount': 123,
        'cpu': 123.0
    }
}

Response Structure

  • (dict) --

    • status (string) --

      The status of cluster creation.

      • PENDING – The cluster is pending creation.

      • CREATING – The cluster creation process is in progress.

      • CREATE_FAILED – The cluster creation process has failed.

      • RUNNING – The cluster creation process is running.

      • UPDATING – The cluster is in the process of being updated.

      • DELETING – The cluster is in the process of being deleted.

      • DELETED – The cluster has been deleted.

      • DELETE_FAILED – The cluster failed to delete.

    • statusReason (string) --

      The error message when a failed state occurs.

    • clusterName (string) --

      A unique name for the cluster.

    • clusterType (string) --

      Specifies the type of KDB database that is being created. The following types are available:

      • HDB – A Historical Database. The data is only accessible with read-only permissions from one of the FinSpace managed kdb databases mounted to the cluster.

      • RDB – A Realtime Database. This type of database captures all the data from a ticker plant and stores it in memory until the end of day, after which it writes all of its data to a disk and reloads the HDB. This cluster type requires local storage for temporary storage of data during the savedown process. If you specify this field in your request, you must provide the savedownStorageConfiguration parameter.

      • GATEWAY – A gateway cluster allows you to access data across processes in kdb systems. It allows you to create your own routing logic using the initialization scripts and custom code. This type of cluster does not require a writable local storage.

      • GP – A general purpose cluster allows you to quickly iterate on code during development by granting greater access to system commands and enabling a fast reload of custom code. This cluster type can optionally mount databases including cache and savedown storage. For this cluster type, the node count is fixed at 1. It does not support autoscaling and supports only SINGLE AZ mode.

      • Tickerplant – A tickerplant cluster allows you to subscribe to feed handlers based on IAM permissions. It can publish to RDBs, other Tickerplants, and real-time subscribers (RTS). Tickerplants can persist messages to log, which is readable by any RDB environment. It supports only single-node that is only one kdb process.

    • tickerplantLogConfiguration (dict) --

      A configuration to store the Tickerplant logs. It consists of a list of volumes that will be mounted to your cluster. For the cluster type Tickerplant, the location of the TP volume on the cluster will be available by using the global variable .aws.tp_log_path.

      • tickerplantLogVolumes (list) --

        The name of the volumes for tickerplant logs.

        • (string) --

    • volumes (list) --

      A list of volumes attached to the cluster.

      • (dict) --

        The structure that consists of name and type of volume.

        • volumeName (string) --

          A unique identifier for the volume.

        • volumeType (string) --

          The type of file system volume. Currently, FinSpace only supports NAS_1 volume type.

    • databases (list) --

      A list of databases mounted on the cluster.

      • (dict) --

        The configuration of data that is available for querying from this database.

        • databaseName (string) --

          The name of the kdb database. When this parameter is specified in the structure, S3 with the whole database is included by default.

        • cacheConfigurations (list) --

          Configuration details for the disk cache used to increase performance reading from a kdb database mounted to the cluster.

          • (dict) --

            The structure of database cache configuration that is used for mapping database paths to cache types in clusters.

            • cacheType (string) --

              The type of disk cache. This parameter is used to map the database path to cache storage. The valid values are:

              • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

            • dbPaths (list) --

              Specifies the portions of database that will be loaded into the cache for access.

              • (string) --

            • dataviewName (string) --

              The name of the dataview to be used for caching historical data on disk.

        • changesetId (string) --

          A unique identifier of the changeset that is associated with the cluster.

        • dataviewName (string) --

          The name of the dataview to be used for caching historical data on disk.

        • dataviewConfiguration (dict) --

          The configuration of the dataview to be used with specified cluster.

          • dataviewName (string) --

            The unique identifier of the dataview.

          • dataviewVersionId (string) --

            The version of the dataview corresponding to a given changeset.

          • changesetId (string) --

            A unique identifier for the changeset.

          • segmentConfigurations (list) --

            The db path and volume configuration for the segmented database.

            • (dict) --

              The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

              • dbPaths (list) --

                The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

                • (string) --

              • volumeName (string) --

                The name of the volume where you want to add data.

              • onDemand (boolean) --

                Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

    • cacheStorageConfigurations (list) --

      The configurations for a read only cache storage associated with a cluster. This cache will be stored as an FSx Lustre that reads from the S3 store.

      • (dict) --

        The configuration for read only disk cache associated with a cluster.

        • type (string) --

          The type of cache storage. The valid values are:

          • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

          • CACHE_250 – This type provides at least 250 MB/s disk access throughput.

          • CACHE_12 – This type provides at least 12 MB/s disk access throughput.

          For cache type CACHE_1000 and CACHE_250 you can select cache size as 1200 GB or increments of 2400 GB. For cache type CACHE_12 you can select the cache size in increments of 6000 GB.

        • size (integer) --

          The size of cache in Gigabytes.

    • autoScalingConfiguration (dict) --

      The configuration based on which FinSpace will scale in or scale out nodes in your cluster.

      • minNodeCount (integer) --

        The lowest number of nodes to scale. This value must be at least 1 and less than the maxNodeCount. If the nodes in a cluster belong to multiple availability zones, then minNodeCount must be at least 3.

      • maxNodeCount (integer) --

        The highest number of nodes to scale. This value cannot be greater than 5.

      • autoScalingMetric (string) --

        The metric your cluster will track in order to scale in and out. For example, CPU_UTILIZATION_PERCENTAGE is the average CPU usage across all the nodes in a cluster.

      • metricTarget (float) --

        The desired value of the chosen autoScalingMetric. When the metric drops below this value, the cluster will scale in. When the metric goes above this value, the cluster will scale out. You can set the target value between 1 and 100 percent.

      • scaleInCooldownSeconds (float) --

        The duration in seconds that FinSpace will wait after a scale in event before initiating another scaling event.

      • scaleOutCooldownSeconds (float) --

        The duration in seconds that FinSpace will wait after a scale out event before initiating another scaling event.

    • clusterDescription (string) --

      A description of the cluster.

    • capacityConfiguration (dict) --

      A structure for the metadata of a cluster. It includes information like the CPUs needed, memory of instances, and number of instances.

      • nodeType (string) --

        The type that determines the hardware of the host computer used for your cluster instance. Each node type offers different memory and storage capabilities. Choose a node type based on the requirements of the application or software that you plan to run on your instance.

        You can only specify one of the following values:

        • kx.s.large – The node type with a configuration of 12 GiB memory and 2 vCPUs.

        • kx.s.xlarge – The node type with a configuration of 27 GiB memory and 4 vCPUs.

        • kx.s.2xlarge – The node type with a configuration of 54 GiB memory and 8 vCPUs.

        • kx.s.4xlarge – The node type with a configuration of 108 GiB memory and 16 vCPUs.

        • kx.s.8xlarge – The node type with a configuration of 216 GiB memory and 32 vCPUs.

        • kx.s.16xlarge – The node type with a configuration of 432 GiB memory and 64 vCPUs.

        • kx.s.32xlarge – The node type with a configuration of 864 GiB memory and 128 vCPUs.

      • nodeCount (integer) --

        The number of instances running in a cluster.

    • releaseLabel (string) --

      The version of FinSpace managed kdb to run.

    • vpcConfiguration (dict) --

      Configuration details about the network where the Privatelink endpoint of the cluster resides.

      • vpcId (string) --

        The identifier of the VPC endpoint.

      • securityGroupIds (list) --

        The unique identifier of the VPC security group applied to the VPC endpoint ENI for the cluster.

        • (string) --

      • subnetIds (list) --

        The identifier of the subnet that the Privatelink VPC endpoint uses to connect to the cluster.

        • (string) --

      • ipAddressType (string) --

        The IP address type for cluster network configuration parameters. The following type is available:

        • IP_V4 – IP address version 4

    • initializationScript (string) --

      Specifies a Q program that will be run at launch of a cluster. It is a relative path within .zip file that contains the custom code, which will be loaded on the cluster. It must include the file name itself. For example, somedir/init.q.

    • commandLineArguments (list) --

      Defines key-value pairs to make them available inside the cluster.

      • (dict) --

        Defines the key-value pairs to make them available inside the cluster.

        • key (string) --

          The name of the key.

        • value (string) --

          The value of the key.

    • code (dict) --

      The details of the custom code that you want to use inside a cluster when analyzing a data. It consists of the S3 source bucket, location, S3 object version, and the relative path from where the custom code is loaded into the cluster.

      • s3Bucket (string) --

        A unique name for the S3 bucket.

      • s3Key (string) --

        The full S3 path (excluding bucket) to the .zip file. This file contains the code that is loaded onto the cluster when it's started.

      • s3ObjectVersion (string) --

        The version of an S3 object.

    • executionRole (string) --

      An IAM role that defines a set of permissions associated with a cluster. These permissions are assumed when a cluster attempts to access another cluster.

    • lastModifiedTimestamp (datetime) --

      The last time that the cluster was modified. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • savedownStorageConfiguration (dict) --

      The size and type of the temporary storage that is used to hold data during the savedown process. This parameter is required when you choose clusterType as RDB. All the data written to this storage space is lost when the cluster node is restarted.

      • type (string) --

        The type of writeable storage space for temporarily storing your savedown data. The valid values are:

        • SDS01 – This type represents 3000 IOPS and io2 ebs volume type.

      • size (integer) --

        The size of temporary storage in gibibytes.

      • volumeName (string) --

        The name of the kdb volume that you want to use as writeable save-down storage for clusters.

    • azMode (string) --

      The number of availability zones you want to assign per cluster. This can be one of the following

      • SINGLE – Assigns one availability zone per cluster.

      • MULTI – Assigns all the availability zones per cluster.

    • availabilityZoneId (string) --

      The availability zone identifiers for the requested regions.

    • createdTimestamp (datetime) --

      The timestamp at which the cluster was created in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • scalingGroupConfiguration (dict) --

      The structure that stores the capacity configuration details of a scaling group.

      • scalingGroupName (string) --

        A unique identifier for the kdb scaling group.

      • memoryLimit (integer) --

        An optional hard limit on the amount of memory a kdb cluster can use.

      • memoryReservation (integer) --

        A reservation of the minimum amount of memory that should be available on the scaling group for a kdb cluster to be successfully placed in a scaling group.

      • nodeCount (integer) --

        The number of kdb cluster nodes.

      • cpu (float) --

        The number of vCPUs that you want to reserve for each node of this kdb cluster on the scaling group host.

GetKxDataview (updated) Link ¶
Changes (response)
{'activeVersions': {'segmentConfigurations': {'onDemand': 'boolean'}},
 'readWrite': 'boolean',
 'segmentConfigurations': {'onDemand': 'boolean'}}

Retrieves details of the dataview.

See also: AWS API Documentation

Request Syntax

client.get_kx_dataview(
    environmentId='string',
    databaseName='string',
    dataviewName='string'
)
type environmentId:

string

param environmentId:

[REQUIRED]

A unique identifier for the kdb environment, from where you want to retrieve the dataview details.

type databaseName:

string

param databaseName:

[REQUIRED]

The name of the database where you created the dataview.

type dataviewName:

string

param dataviewName:

[REQUIRED]

A unique identifier for the dataview.

rtype:

dict

returns:

Response Syntax

{
    'databaseName': 'string',
    'dataviewName': 'string',
    'azMode': 'SINGLE'|'MULTI',
    'availabilityZoneId': 'string',
    'changesetId': 'string',
    'segmentConfigurations': [
        {
            'dbPaths': [
                'string',
            ],
            'volumeName': 'string',
            'onDemand': True|False
        },
    ],
    'activeVersions': [
        {
            'changesetId': 'string',
            'segmentConfigurations': [
                {
                    'dbPaths': [
                        'string',
                    ],
                    'volumeName': 'string',
                    'onDemand': True|False
                },
            ],
            'attachedClusters': [
                'string',
            ],
            'createdTimestamp': datetime(2015, 1, 1),
            'versionId': 'string'
        },
    ],
    'description': 'string',
    'autoUpdate': True|False,
    'readWrite': True|False,
    'environmentId': 'string',
    'createdTimestamp': datetime(2015, 1, 1),
    'lastModifiedTimestamp': datetime(2015, 1, 1),
    'status': 'CREATING'|'ACTIVE'|'UPDATING'|'FAILED'|'DELETING',
    'statusReason': 'string'
}

Response Structure

  • (dict) --

    • databaseName (string) --

      The name of the database where you created the dataview.

    • dataviewName (string) --

      A unique identifier for the dataview.

    • azMode (string) --

      The number of availability zones you want to assign per volume. Currently, FinSpace only supports SINGLE for volumes. This places dataview in a single AZ.

    • availabilityZoneId (string) --

      The identifier of the availability zones.

    • changesetId (string) --

      A unique identifier of the changeset that you want to use to ingest data.

    • segmentConfigurations (list) --

      The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

      • (dict) --

        The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

        • dbPaths (list) --

          The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

          • (string) --

        • volumeName (string) --

          The name of the volume where you want to add data.

        • onDemand (boolean) --

          Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

    • activeVersions (list) --

      The current active changeset versions of the database on the given dataview.

      • (dict) --

        The active version of the dataview that is currently in use by this cluster.

        • changesetId (string) --

          A unique identifier for the changeset.

        • segmentConfigurations (list) --

          The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

          • (dict) --

            The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

            • dbPaths (list) --

              The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

              • (string) --

            • volumeName (string) --

              The name of the volume where you want to add data.

            • onDemand (boolean) --

              Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

        • attachedClusters (list) --

          The list of clusters that are currently using this dataview.

          • (string) --

        • createdTimestamp (datetime) --

          The timestamp at which the dataview version was active. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • versionId (string) --

          A unique identifier of the active version.

    • description (string) --

      A description of the dataview.

    • autoUpdate (boolean) --

      The option to specify whether you want to apply all the future additions and corrections automatically to the dataview when new changesets are ingested. The default value is false.

    • readWrite (boolean) --

      Returns True if the dataview is created as writeable and False otherwise.

    • environmentId (string) --

      A unique identifier for the kdb environment, from where you want to retrieve the dataview details.

    • createdTimestamp (datetime) --

      The timestamp at which the dataview was created in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • lastModifiedTimestamp (datetime) --

      The last time that the dataview was updated in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • status (string) --

      The status of dataview creation.

      • CREATING – The dataview creation is in progress.

      • UPDATING – The dataview is in the process of being updated.

      • ACTIVE – The dataview is active.

    • statusReason (string) --

      The error message when a failed state occurs.

ListKxDataviews (updated) Link ¶
Changes (response)
{'kxDataviews': {'activeVersions': {'segmentConfigurations': {'onDemand': 'boolean'}},
                 'readWrite': 'boolean',
                 'segmentConfigurations': {'onDemand': 'boolean'}}}

Returns a list of all the dataviews in the database.

See also: AWS API Documentation

Request Syntax

client.list_kx_dataviews(
    environmentId='string',
    databaseName='string',
    nextToken='string',
    maxResults=123
)
type environmentId:

string

param environmentId:

[REQUIRED]

A unique identifier for the kdb environment, for which you want to retrieve a list of dataviews.

type databaseName:

string

param databaseName:

[REQUIRED]

The name of the database where the dataviews were created.

type nextToken:

string

param nextToken:

A token that indicates where a results page should begin.

type maxResults:

integer

param maxResults:

The maximum number of results to return in this request.

rtype:

dict

returns:

Response Syntax

{
    'kxDataviews': [
        {
            'environmentId': 'string',
            'databaseName': 'string',
            'dataviewName': 'string',
            'azMode': 'SINGLE'|'MULTI',
            'availabilityZoneId': 'string',
            'changesetId': 'string',
            'segmentConfigurations': [
                {
                    'dbPaths': [
                        'string',
                    ],
                    'volumeName': 'string',
                    'onDemand': True|False
                },
            ],
            'activeVersions': [
                {
                    'changesetId': 'string',
                    'segmentConfigurations': [
                        {
                            'dbPaths': [
                                'string',
                            ],
                            'volumeName': 'string',
                            'onDemand': True|False
                        },
                    ],
                    'attachedClusters': [
                        'string',
                    ],
                    'createdTimestamp': datetime(2015, 1, 1),
                    'versionId': 'string'
                },
            ],
            'status': 'CREATING'|'ACTIVE'|'UPDATING'|'FAILED'|'DELETING',
            'description': 'string',
            'autoUpdate': True|False,
            'readWrite': True|False,
            'createdTimestamp': datetime(2015, 1, 1),
            'lastModifiedTimestamp': datetime(2015, 1, 1),
            'statusReason': 'string'
        },
    ],
    'nextToken': 'string'
}

Response Structure

  • (dict) --

    • kxDataviews (list) --

      The list of kdb dataviews that are currently active for the given database.

      • (dict) --

        A collection of kdb dataview entries.

        • environmentId (string) --

          A unique identifier for the kdb environment.

        • databaseName (string) --

          A unique identifier of the database.

        • dataviewName (string) --

          A unique identifier of the dataview.

        • azMode (string) --

          The number of availability zones you want to assign per volume. Currently, FinSpace only supports SINGLE for volumes. This places dataview in a single AZ.

        • availabilityZoneId (string) --

          The identifier of the availability zones.

        • changesetId (string) --

          A unique identifier for the changeset.

        • segmentConfigurations (list) --

          The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

          • (dict) --

            The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

            • dbPaths (list) --

              The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

              • (string) --

            • volumeName (string) --

              The name of the volume where you want to add data.

            • onDemand (boolean) --

              Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

        • activeVersions (list) --

          The active changeset versions for the given dataview entry.

          • (dict) --

            The active version of the dataview that is currently in use by this cluster.

            • changesetId (string) --

              A unique identifier for the changeset.

            • segmentConfigurations (list) --

              The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

              • (dict) --

                The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

                • dbPaths (list) --

                  The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

                  • (string) --

                • volumeName (string) --

                  The name of the volume where you want to add data.

                • onDemand (boolean) --

                  Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

            • attachedClusters (list) --

              The list of clusters that are currently using this dataview.

              • (string) --

            • createdTimestamp (datetime) --

              The timestamp at which the dataview version was active. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

            • versionId (string) --

              A unique identifier of the active version.

        • status (string) --

          The status of a given dataview entry.

        • description (string) --

          A description for the dataview list entry.

        • autoUpdate (boolean) --

          The option to specify whether you want to apply all the future additions and corrections automatically to the dataview when you ingest new changesets. The default value is false.

        • readWrite (boolean) --

          Returns True if the dataview is created as writeable and False otherwise.

        • createdTimestamp (datetime) --

          The timestamp at which the dataview list entry was created in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • lastModifiedTimestamp (datetime) --

          The last time that the dataview list was updated in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • statusReason (string) --

          The error message when a failed state occurs.

    • nextToken (string) --

      A token that indicates where a results page should begin.

UpdateKxClusterDatabases (updated) Link ¶
Changes (request)
{'databases': {'dataviewConfiguration': {'segmentConfigurations': {'onDemand': 'boolean'}}}}

Updates the databases mounted on a kdb cluster, which includes the changesetId and all the dbPaths to be cached. This API does not allow you to change a database name or add a database if you created a cluster without one.

Using this API you can point a cluster to a different changeset and modify a list of partitions being cached.

See also: AWS API Documentation

Request Syntax

client.update_kx_cluster_databases(
    environmentId='string',
    clusterName='string',
    clientToken='string',
    databases=[
        {
            'databaseName': 'string',
            'cacheConfigurations': [
                {
                    'cacheType': 'string',
                    'dbPaths': [
                        'string',
                    ],
                    'dataviewName': 'string'
                },
            ],
            'changesetId': 'string',
            'dataviewName': 'string',
            'dataviewConfiguration': {
                'dataviewName': 'string',
                'dataviewVersionId': 'string',
                'changesetId': 'string',
                'segmentConfigurations': [
                    {
                        'dbPaths': [
                            'string',
                        ],
                        'volumeName': 'string',
                        'onDemand': True|False
                    },
                ]
            }
        },
    ],
    deploymentConfiguration={
        'deploymentStrategy': 'NO_RESTART'|'ROLLING'
    }
)
type environmentId:

string

param environmentId:

[REQUIRED]

The unique identifier of a kdb environment.

type clusterName:

string

param clusterName:

[REQUIRED]

A unique name for the cluster that you want to modify.

type clientToken:

string

param clientToken:

A token that ensures idempotency. This token expires in 10 minutes.

This field is autopopulated if not provided.

type databases:

list

param databases:

[REQUIRED]

The structure of databases mounted on the cluster.

  • (dict) --

    The configuration of data that is available for querying from this database.

    • databaseName (string) -- [REQUIRED]

      The name of the kdb database. When this parameter is specified in the structure, S3 with the whole database is included by default.

    • cacheConfigurations (list) --

      Configuration details for the disk cache used to increase performance reading from a kdb database mounted to the cluster.

      • (dict) --

        The structure of database cache configuration that is used for mapping database paths to cache types in clusters.

        • cacheType (string) -- [REQUIRED]

          The type of disk cache. This parameter is used to map the database path to cache storage. The valid values are:

          • CACHE_1000 – This type provides at least 1000 MB/s disk access throughput.

        • dbPaths (list) -- [REQUIRED]

          Specifies the portions of database that will be loaded into the cache for access.

          • (string) --

        • dataviewName (string) --

          The name of the dataview to be used for caching historical data on disk.

    • changesetId (string) --

      A unique identifier of the changeset that is associated with the cluster.

    • dataviewName (string) --

      The name of the dataview to be used for caching historical data on disk.

    • dataviewConfiguration (dict) --

      The configuration of the dataview to be used with specified cluster.

      • dataviewName (string) --

        The unique identifier of the dataview.

      • dataviewVersionId (string) --

        The version of the dataview corresponding to a given changeset.

      • changesetId (string) --

        A unique identifier for the changeset.

      • segmentConfigurations (list) --

        The db path and volume configuration for the segmented database.

        • (dict) --

          The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

          • dbPaths (list) -- [REQUIRED]

            The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

            • (string) --

          • volumeName (string) -- [REQUIRED]

            The name of the volume where you want to add data.

          • onDemand (boolean) --

            Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

type deploymentConfiguration:

dict

param deploymentConfiguration:

The configuration that allows you to choose how you want to update the databases on a cluster.

  • deploymentStrategy (string) -- [REQUIRED]

    The type of deployment that you want on a cluster.

    • ROLLING – This options updates the cluster by stopping the exiting q process and starting a new q process with updated configuration.

    • NO_RESTART – This option updates the cluster without stopping the running q process. It is only available for HDB type cluster. This option is quicker as it reduces the turn around time to update configuration on a cluster. With this deployment mode, you cannot update the initializationScript and commandLineArguments parameters.

rtype:

dict

returns:

Response Syntax

{}

Response Structure

  • (dict) --

UpdateKxDataview (updated) Link ¶
Changes (request, response)
Request
{'segmentConfigurations': {'onDemand': 'boolean'}}
Response
{'activeVersions': {'segmentConfigurations': {'onDemand': 'boolean'}},
 'readWrite': 'boolean',
 'segmentConfigurations': {'onDemand': 'boolean'}}

Updates the specified dataview. The dataviews get automatically updated when any new changesets are ingested. Each update of the dataview creates a new version, including changeset details and cache configurations

See also: AWS API Documentation

Request Syntax

client.update_kx_dataview(
    environmentId='string',
    databaseName='string',
    dataviewName='string',
    description='string',
    changesetId='string',
    segmentConfigurations=[
        {
            'dbPaths': [
                'string',
            ],
            'volumeName': 'string',
            'onDemand': True|False
        },
    ],
    clientToken='string'
)
type environmentId:

string

param environmentId:

[REQUIRED]

A unique identifier for the kdb environment, where you want to update the dataview.

type databaseName:

string

param databaseName:

[REQUIRED]

The name of the database.

type dataviewName:

string

param dataviewName:

[REQUIRED]

The name of the dataview that you want to update.

type description:

string

param description:

The description for a dataview.

type changesetId:

string

param changesetId:

A unique identifier for the changeset.

type segmentConfigurations:

list

param segmentConfigurations:

The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

  • (dict) --

    The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

    • dbPaths (list) -- [REQUIRED]

      The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

      • (string) --

    • volumeName (string) -- [REQUIRED]

      The name of the volume where you want to add data.

    • onDemand (boolean) --

      Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

type clientToken:

string

param clientToken:

[REQUIRED]

A token that ensures idempotency. This token expires in 10 minutes.

This field is autopopulated if not provided.

rtype:

dict

returns:

Response Syntax

{
    'environmentId': 'string',
    'databaseName': 'string',
    'dataviewName': 'string',
    'azMode': 'SINGLE'|'MULTI',
    'availabilityZoneId': 'string',
    'changesetId': 'string',
    'segmentConfigurations': [
        {
            'dbPaths': [
                'string',
            ],
            'volumeName': 'string',
            'onDemand': True|False
        },
    ],
    'activeVersions': [
        {
            'changesetId': 'string',
            'segmentConfigurations': [
                {
                    'dbPaths': [
                        'string',
                    ],
                    'volumeName': 'string',
                    'onDemand': True|False
                },
            ],
            'attachedClusters': [
                'string',
            ],
            'createdTimestamp': datetime(2015, 1, 1),
            'versionId': 'string'
        },
    ],
    'status': 'CREATING'|'ACTIVE'|'UPDATING'|'FAILED'|'DELETING',
    'autoUpdate': True|False,
    'readWrite': True|False,
    'description': 'string',
    'createdTimestamp': datetime(2015, 1, 1),
    'lastModifiedTimestamp': datetime(2015, 1, 1)
}

Response Structure

  • (dict) --

    • environmentId (string) --

      A unique identifier for the kdb environment, where you want to update the dataview.

    • databaseName (string) --

      The name of the database.

    • dataviewName (string) --

      The name of the database under which the dataview was created.

    • azMode (string) --

      The number of availability zones you want to assign per volume. Currently, FinSpace only supports SINGLE for volumes. This places dataview in a single AZ.

    • availabilityZoneId (string) --

      The identifier of the availability zones.

    • changesetId (string) --

      A unique identifier for the changeset.

    • segmentConfigurations (list) --

      The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

      • (dict) --

        The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

        • dbPaths (list) --

          The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

          • (string) --

        • volumeName (string) --

          The name of the volume where you want to add data.

        • onDemand (boolean) --

          Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

    • activeVersions (list) --

      The current active changeset versions of the database on the given dataview.

      • (dict) --

        The active version of the dataview that is currently in use by this cluster.

        • changesetId (string) --

          A unique identifier for the changeset.

        • segmentConfigurations (list) --

          The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

          • (dict) --

            The configuration that contains the database path of the data that you want to place on each selected volume. Each segment must have a unique database path for each volume. If you do not explicitly specify any database path for a volume, they are accessible from the cluster through the default S3/object store segment.

            • dbPaths (list) --

              The database path of the data that you want to place on each selected volume for the segment. Each segment must have a unique database path for each volume.

              • (string) --

            • volumeName (string) --

              The name of the volume where you want to add data.

            • onDemand (boolean) --

              Enables on-demand caching on the selected database path when a particular file or a column of the database is accessed. When on demand caching is True, dataviews perform minimal loading of files on the filesystem as needed. When it is set to False, everything is cached. The default value is False.

        • attachedClusters (list) --

          The list of clusters that are currently using this dataview.

          • (string) --

        • createdTimestamp (datetime) --

          The timestamp at which the dataview version was active. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

        • versionId (string) --

          A unique identifier of the active version.

    • status (string) --

      The status of dataview creation.

      • CREATING – The dataview creation is in progress.

      • UPDATING – The dataview is in the process of being updated.

      • ACTIVE – The dataview is active.

    • autoUpdate (boolean) --

      The option to specify whether you want to apply all the future additions and corrections automatically to the dataview when new changesets are ingested. The default value is false.

    • readWrite (boolean) --

      Returns True if the dataview is created as writeable and False otherwise.

    • description (string) --

      A description of the dataview.

    • createdTimestamp (datetime) --

      The timestamp at which the dataview was created in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.

    • lastModifiedTimestamp (datetime) --

      The last time that the dataview was updated in FinSpace. The value is determined as epoch time in milliseconds. For example, the value for Monday, November 1, 2021 12:00:00 PM UTC is specified as 1635768000000.