AMP Project Specification

YAML File Specification ‒ Version 1.0🔗

The project metadata file is a YAML file. It must be placed in your project's root directory, and must be named .project-metadata.yaml. The specifications for this file are listed below. You can also look at an example for one of the Cloudera AMPs, such as:.project-metadata.yaml.

Fields🔗

Fields for this YAML file are in snake_case. String fields are generally constrained by a fixed character size, for example string(64) is constrained to contain at most 64 characters. Click Show to see the list of fields.

Field Name	Type	Example	Description
name	string(200)	ML Demo	Required: The name of this project prototype. Prototype names do not need to be unique.
description	string(2048)	This demo shows off some cool applications of ML.	Required: A description for this project prototype.
author	string(64)	Cloudera Engineer	Required: The author of this prototype (can be the name of an individual, team, or organization).
date	date string	"2020-08-11"	The date this project prototype was last modified. It should be in the format: "YYYY-MM-DD" (quotation marks are required).
specification_version	string(16)	0.1	Required: The version of the YAML file specification to use.
prototype_version	string(16)	1.0	Required: The version of this project prototype.
shared_memory_limit	number	`0.0625`	Additional shared memory in GB available to sessions running in this project. The default is 0.0625 GB (64MB).
environment_variables	environment variables object	See below	Global environment variables for this project prototype.
feature_dependencies	feature_dependencies	See below	A list of feature dependencies of this AMP. A missing dependency in workspace blocks the creation of the AMP.
engine_images	engine_images	See below	Engine images to be used with the AMP. What's specified here is a recommendation and it does not prevent the user from launching an AMP with non recommended engine images.
runtimes	runtimes	See below	Runtimes to be used with the AMP. What's specified here is a recommendation and it does not prevent the user from launching an AMP with non recommended runtimes.
tasks	task list	See below	A sequence of tasks, such as running Jobs or deploying Models, to be run after project import.

Field Name	Type	Example	Description
default	string	"3"	The default value for this environment variable. Users may override this value when importing this project prototype.
description	string	The number of Model replicas, 3 is standard for redundancy.	A short description explaining this environment variable.
required	boolean	`true`	Whether the environment variable is required to have a non-empty value, the default is `false`.

Field Name	Type	Example	Description
type	string	`create_job`	Required: The type of task to be executed. See below for a list of allowed types.
short_summary	string	Creating a Job that will do a task.	A short summary of what this task is doing.
long_summary	string	Creating a Job that will do this specific task. This is important because it leads up to this next task.	A long summary of what this task is doing.

Field Name	Type	Example	Description
type	string	`run_job`	Required: Must be `run_job`.
entity_label	string	howdy	Required: Must match an `entity_label` of a previous `create_job` task.

Field Name	Type	Example	Description
script	string	greeting.py	Required: Script for this Job to run.
kernel	string	`python3`	Required: What kernel this Job should use. Acceptable values are `python2`, `python3`, `r`, and `scala`. Note that `scala` might not be supported for every cluster.
arguments	string	Ofek 21	Command line arguments to be given to this Job when running.
environment_variables	environment variables object	See above	See above
cpu	number	`1.0`	The amount of CPU virtual cores to allocate for this Job, the default is `1.0`
memory	number	`1.0`	The amount of memory in GB to allocate for this Job, the default is `1.0`.
gpu	integer	`0`	The amount of GPU to allocate for this Job, the default is `0`.
shared_memory_limit	number	`0.0625`	Limits the additional shared memory in GB that can be used by this Job, the default is 0.0625 GB (64MB).

Field Name	Type	Example	Description
cpu	number	`1.0`	The number of CPU virtual cores to allocate per Model deployment.
memory	number	`2.0`	The amount of memory in GB to allocate per Model deployment.
gpu	integer	`0`	The amount of GPU to allocate per Model deployment.

Field Name	Type	Example	Description
type	string	`fixed`	Must be `fixed` if present.
num_replicas	integer	`1`	The number of replicas to create per Model deployment.

Field Name	Type	Example	Description
request	string	See above	Required: An example request object.
num_replicas	string	See above	Required: The response to the above example request object.

Field Name	Type	Example	Description
type	string	`create_model`	Required: Must be `create_model`.
name	string	Say hello to me	Required: Model name
entity_label	string	says-hello	Required: Uniquely identifies this model for future tasks, i.e. `build_model` and `deploy_model` tasks. Entity labels must be lowercase alphanumeric, and may contain hyphens or underscores.
access_key_environment_variable	string	SHTM_ACCESS_KEY	Saves the model's access key to an environment variable with the specified name.
default_resources	resources object	See above	The default amount of resources to allocate per Model deployment.
default_replication_policy	replication policy object	See above	The default replication policy for Model deployments.
description	string	This model says hello to you	Model description.
visibility	string	`private`	The default visibility for this Model.

Field Name	Type	Example	Description
type	string	`build_model`	Required: Must be `build_model`.
entity_label	string	says-hello	Required: Must match an `entity_label` of a previous `create_model` task.
target_file_path	string	greeting.py	Required: Path to file that will be run by Model.
target_function_name	string	greet_me	Required: Name of function to be called by Model.
kernel	string	`python3`	What kernel this Model should use. Acceptable values are `python2`, `python3`, `r`, and `scala`. Note that `scala` might not be supported for every cluster.
comment	string	Some comment about the model	A comment about the Model.
examples	model examples list	See above	A list of request/response example objects.
environment_variables	environment variables object	See above	See above

Field Name	Type	Example	Description
type	string	`deploy_model`	Required: Must be `deploy_model`.
entity_label	string	says-hello	Required: Must match an `entity_label` of a previous `deploy_model` task.

Field Name	Type	Example	Description
cpu	number	`1.0`	The number of CPU virutal cores to allocate for this Model deployment.
memory	number	`2.0`	The amount of memory in GB to allocate for this Model deployment.
gpu	integer	`0`	The amount of GPU to allocate for this Model deployment.
replication_policy	replication policy object	See above	The replication policy for this Model deployment.
environment_variables	environment variables object	See above	Overrides environment variables for this Model deployment.

Field Name	Type	Example	Description
type	string	`start_application`	Required: Must be `start_application`.
subdomain	string	greet	Required: Application subdomain, which must be unique per Application, and must be alphanumeric and hyphen-delimited. Application subdomains are also converted to lowercase.
kernel	string	`python3`	Required: What kernel this Application should use. Acceptable values are `python2`, `python3`, `r`, and `scala`. Note that `scala` might not be supported for every cluster.
entity_label	string	greeter	Uniquely identifies this application for future tasks. Entity labels must be lowercase alphanumeric, and may contain hyphens or underscores.
script	string	greeting.py	Script for this Application to run.
name	string	Greeter	Application name, defaults to 'Untitled application'.
description	string	Some description about the Application	Application description, defaults to 'No description for the app'.
cpu	number	`1.0`	The number of CPU virutal cores to allocate for this Application.
memory	number	`1.0`	The amount of memory in GB to allocate for this Application.
gpu	integer	`0`	The amount of GPU to allocate for this Application.
shared_memory_limit	number	`0.0625`	Limits the additional shared memory in GB that can be used by this application, the default is 0.0625 GB (64MB).
environment_variables	environment variables object	See above	See above

Field Name	Type	Example	Description
type	string	`run_experiment`	Required: Must be `run_experiment`.
script	string	greeting.py	Required: Script for this Experiment to run.
entity_label	string	test-greeter	Uniquely identifies this experiment for future tasks. Entity labels must be lowercase alphanumeric, and may contain hyphens or underscores.
arguments	string	Ofek 21	Command line arguments to be given to this Experiment when running.
kernel	string	`python3`	What kernel this Experiment should use. Acceptable values are `python2`, `python3`, `r`, and `scala`. Note that `scala` might not be supported for every cluster.
comment	string	Comment about the experiment	A comment about the Experiment.
cpu	number	`1.0`	The amount of CPU virtual cores to allocate for this Experiment.
memory	number	`1.0`	The amount of memory in GB to allocate for this Experiment.
gpu	number	`0`	The amount of GPU to allocate for this Experiment.

Field Name	Type	Example	Description
type	string	`run_session`	Required: Must be `run_session`.
	string	See above for code, greeting.py for script	Required: Either the `code` or `script` field is required to exist for the run Session task, not both. `code` is a direct block of code that will be run by the Session, while `script` is a script file that will be executed by the Session.
kernel	string	`python3`	Required: What kernel this Session should use. Acceptable values are `python2`, `python3`, `r`, and `scala`. Note that `scala` might not be supported for every cluster.
cpu	number	`1.0`	Required: The amount of CPU virtual cores to allocate for this Session.
memory	number	`1.0`	Required: The amount of memory in GB to allocate for this Session.
entity_label	string	greeter	Uniquely identifies this session for future tasks. Entity labels must be lowercase alphanumeric, and may contain hyphens or underscores.
name	string	How to be greeted interactively	Session name.
gpu	integer	`0`	The amount of GPU to allocate for this Session.

YAML File Specification ‒ Version 1.0🔗

Fields🔗

Environment variables object🔗

Feature Dependencies🔗

Runtimes Specification🔗

Engine Images Specification🔗

Task list🔗

Jobs🔗

Run Job🔗

Models🔗

Applications🔗

Experiments🔗

Sessions🔗

We want your opinion

How can we improve this page?