Bug #18562
closed[api] should not change the preemptible flag across the board
Added by Ward Vandewege about 3 years ago. Updated almost 3 years ago.
100%
Description
The RailsAPI changes applies the default config value for the preemptible flag whenever a container record is loaded. This causes problems when the config value is changed after a container record has been created and before it is done.
Case in point, if a-d-c tries to cancel (sends State=Cancelled) a container that was created when the config said `UsePreemptible: true` and meanwhile the config is changed to `UsePreemptible: false`, RailsAPI fails validation and returns a 422 on the cancel request.
RailsAPI should only set/change the `Preemptible` flag in one place, maybe when the container is created or when it is committed.
Updated by Ward Vandewege about 3 years ago
- Description updated (diff)
- Target version changed from 2021-12-08 sprint to 2022-01-05 sprint
Updated by Tom Clegg about 3 years ago
- If there are no configured
InstanceTypes
withPreemptible: true
, it is an error for a caller to specify preemptible=true when creating a CR, or change preemptible from false to true during an update. - If config has
UsePreemptibleInstances: true
and someInstanceTypes
withPreemptible: true
, a newly created container request with a non-emptyrequesting_container_uuid
has preemptible set to true, even if the caller specified false. - In all other cases, preemptible is set to whatever the client specifies.
- Default config is
UsePreemptibleInstances: true
, i.e., if you configure some preemptible instance types, arvados will automatically use preemptible instances for child containers. If you don't, it won't.
Updated by Peter Amstutz about 3 years ago
Tom Clegg wrote:
- If config has
UsePreemptibleInstances: true
and someInstanceTypes
withPreemptible: true
, a newly created container request with a non-emptyrequesting_container_uuid
has preemptible set to true, even if the caller specified false.- In all other cases, preemptible is set to whatever the client specifies.
Could we tweak that to
- If config has
InstanceTypes
withPreemptible: true
, then a newly created container request with a non-emptyrequesting_container_uuid
and empty/unspecifiedpreemptible
haspreemptible
on the container set to the value ofUsePreemptibleInstances
.
- In all other cases, preemptible is set to whatever the client specifies or left unchanged in unspecified.
Updated by Ward Vandewege about 3 years ago
I would very much like a shortcut that makes all node types available as both preemptible or not, if no node types have an explicit preemptible flag set in the config. The current situation is very tedious, having to duplicate all these records.
I am not aware of node types that don't exist as spot.
Updated by Tom Clegg about 3 years ago
Peter Amstutz wrote:
Could we tweak that to
- [...] empty/unspecified
preemptible
haspreemptible
on the container set to the value ofUsePreemptibleInstances
.
So, change the meaning of UsePreemptibleInstances from "override user's choice" to "configure default choice"?
Currently, a-c-r never specifies anything, so we could update controller to propagate null/missing, and interpret null in rails the way it currently interprets false, and the end result would be the same behavior we have now. In future, when a-c-r lets the user choose, it would still be able to choose "I don't know, whatever cluster config says".
The configurable default could be useful, but IMO it would be better to implement by having a-c-r look up the default in the exported config. For one thing, it can tell the user what the default is.
(I also don't love making tri-state fields with bool pointers etc.)
Updated by Peter Amstutz about 3 years ago
Tom Clegg wrote:
Peter Amstutz wrote:
Could we tweak that to
- [...] empty/unspecified
preemptible
haspreemptible
on the container set to the value ofUsePreemptibleInstances
.So, change the meaning of UsePreemptibleInstances from "override user's choice" to "configure default choice"?
Currently, a-c-r never specifies anything, so we could update controller to propagate null/missing, and interpret null in rails the way it currently interprets false, and the end result would be the same behavior we have now. In future, when a-c-r lets the user choose, it would still be able to choose "I don't know, whatever cluster config says".
The configurable default could be useful, but IMO it would be better to implement by having a-c-r look up the default in the exported config. For one thing, it can tell the user what the default is.
- If config has InstanceTypes with Preemptible: true, then a newly created container request with a non-empty requesting_container_uuid and value of "false" for preemptible, has preemptible on the container set to the value of UsePreemptibleInstances.
- In all other cases, preemptible is set to whatever the client specifies or left unchanged if unspecified.
(I also don't love making tri-state fields with bool pointers etc.)
That's definitely an impedance mismatch between Go and Python/Ruby.
How do you do "left unchanged if unspecified"? If "false" is treated as default/unspecified then it's impossible to set a preemptible flag that's been set to true back to false?
Updated by Tom Clegg about 3 years ago
How do you do "left unchanged if unspecified"? If "false" is treated as default/unspecified then it's impossible to set a preemptible flag that's been set to true back to false?
Right, I just don't think we need a value that means "unspecified". It can just be true or false, and you get what you ask for, unless the "force preemptible child containers even if client specifies false" config is enabled.
Updated by Tom Clegg about 3 years ago
- Related to Feature #18596: Config option to enable preemptible variants of all instance types added
Updated by Tom Clegg about 3 years ago
- Do not automatically set preemptible=true if there are no preemptible instance types available.
- Do not automatically set preemptible=true on a container request that has already reached "Committed" state with preemptible=false.
- If a container request reaches "Committed" state with preemptible=true, and then all preemptible instance types are removed from config, do not reject subsequent updates to the container request. (Was: refuse all updates, even cancel, if config UsePreemptibleInstances changes to false after a CR is created with preemptible=true.)
- UsePreemptibleInstances=true means enable preemptible=true on child CRs if any preemptible instance types are configured. (Was: set preemptible flag even if that means the container will never run because there are no preemptible instance types configured.)
- UsePreemptibleInstances=false means leave preemptible flag alone, do whatever the client asks. (Was: impossible to set preemptible=true)
The doc page says "To use preemptible instances, set UsePreemptibleInstances: true and add entries to InstanceTypes with Preemptible: true to config.yml." Now that the default for UsePreemptibleInstances is true, should we remove "set UsePreemptibleInstances: true and" or change it to "make sure UsePreemptibleInstances isn't set to false"? Or just leave it the way it is?
Updated by Ward Vandewege about 3 years ago
Tom Clegg wrote:
18562-preemptible @ 534b7df510b99923a7dc273a2f8cacfd0c599800 -- developer-run-tests: #2853
- Do not automatically set preemptible=true if there are no preemptible instance types available.
- Do not automatically set preemptible=true on a container request that has already reached "Committed" state with preemptible=false.
- If a container request reaches "Committed" state with preemptible=true, and then all preemptible instance types are removed from config, do not reject subsequent updates to the container request. (Was: refuse all updates, even cancel, if config UsePreemptibleInstances changes to false after a CR is created with preemptible=true.)
- UsePreemptibleInstances=true means enable preemptible=true on child CRs if any preemptible instance types are configured. (Was: set preemptible flag even if that means the container will never run because there are no preemptible instance types configured.)
- UsePreemptibleInstances=false means leave preemptible flag alone, do whatever the client asks. (Was: impossible to set preemptible=true)
This is a bit weird. As it is now, the name of the flag is misleading. Maybe we should rename the flag to 'ForcePreemptibleInstances' or 'AlwaysUsePreemptibleInstances' ? Also, the meaning of 'false' should be documented in config.default.yml.
The doc page says "To use preemptible instances, set UsePreemptibleInstances: true and add entries to InstanceTypes with Preemptible: true to config.yml." Now that the default for UsePreemptibleInstances is true, should we remove "set UsePreemptibleInstances: true and" or change it to "make sure UsePreemptibleInstances isn't set to false"? Or just leave it the way it is?
It definitely needs to be updated to reflect the default and the meaning of both 'true' and 'false'.
Otherwise, LGTM thanks.
Updated by Tom Clegg about 3 years ago
- update doc/config comments
- rename config to AlwaysUsePreemptibleInstances
Updated by Ward Vandewege about 3 years ago
Tom Clegg wrote:
18562-preemptible @ 0a08f54c405dff0dbda5d6dbc14c1f1c6eeecd39 -- developer-run-tests: #2855
- update doc/config comments
- rename config to AlwaysUsePreemptibleInstances
LGTM thanks. Will you also add a note to the upgrade guide about the renaming of the preemptible global config flag?
Updated by Tom Clegg about 3 years ago
18562-preemptible @ 87f5edeeb1ec8c03a71c2cfa1656176735bfedca
Added upgrade note:
Preemptible instance types are used automatically, if any are configured¶
The default behavior for selecting "preemptible instances":{{site.baseurl}}/admin/spot-instances.html has changed. If your configuration lists any instance types with Preemptible: true
, all child (non-top-level) containers will automatically be scheduled on preemptible instances. To avoid using preemptible instances except when explicitly requested by clients, add AlwaysUsePreemptibleInstances: false
in the Containers
config section. (Previously, preemptible instance types were never used unless the configuration specified UsePreemptibleInstances: true
. That flag has been removed.)
Updated by Ward Vandewege about 3 years ago
Tom Clegg wrote:
18562-preemptible @ 87f5edeeb1ec8c03a71c2cfa1656176735bfedca
Added upgrade note:
Preemptible instance types are used automatically, if any are configured¶
The default behavior for selecting "preemptible instances":{{site.baseurl}}/admin/spot-instances.html has changed. If your configuration lists any instance types with
Preemptible: true
, all child (non-top-level) containers will automatically be scheduled on preemptible instances. To avoid using preemptible instances except when explicitly requested by clients, addAlwaysUsePreemptibleInstances: false
in theContainers
config section. (Previously, preemptible instance types were never used unless the configuration specifiedUsePreemptibleInstances: true
. That flag has been removed.)
Yeah that's good, thanks!
Updated by Tom Clegg about 3 years ago
- Status changed from In Progress to Resolved
Applied in changeset arvados|00f1f05789316936db75b4723b1c3d99196c252a.
Updated by Ward Vandewege almost 3 years ago
- Related to Story #18179: Better spot instance support added