Keep storage tiers » History » Version 3
Tom Clegg, 04/28/2017 03:43 PM
| 1 | 1 | Tom Clegg | h1. Keep storage tiers |
|---|---|---|---|
| 2 | |||
| 3 | 2 | Tom Clegg | Typically, an Arvados cluster has access to multiple storage devices with different cost/performance trade-offs. |
| 4 | 1 | Tom Clegg | |
| 5 | Examples: |
||
| 6 | * Local SSD |
||
| 7 | * Local HDD |
||
| 8 | * Object storage service provided by cloud vendor |
||
| 9 | * Slower or less reliable object storage service provided by same cloud vendor |
||
| 10 | 2 | Tom Clegg | |
| 11 | Users should be able to specify a minimum storage tier for each collection. Arvados should ensure that every data block referenced by a collection is stored at the specified tier _or better_. |
||
| 12 | |||
| 13 | The cluster administrator should be able to specify a default tier, and assign a tier number to each storage device. |
||
| 14 | |||
| 15 | 3 | Tom Clegg | It should be possible to configure multiple storage devices at the same tier: for example, this allows blocks to be distributed more or less uniformly across several (equivalent) cloud storage buckets for performance reasons. |
| 16 | |||
| 17 | h1. Implementation (proposal) |
||
| 18 | |||
| 19 | Storage tier features (and implementation) are similar to replication-level features. |
||
| 20 | |||
| 21 | h2. Configuration |
||
| 22 | |||
| 23 | Each Keep volume has an integer parameter, "tier". Interpretation is site-specific, except that when M≤N, tier M can satisfy a requirement for tier N, i.e., smaller tier numbers are better. |
||
| 24 | |||
| 25 | There is a site-wide default tier number which is used for collections that do not specify a desired tier. |
||
| 26 | |||
| 27 | h2. Storing data at a non-default tier |
||
| 28 | |||
| 29 | Tools that write data to Keep should allow the caller to specify a storage tier. The desired tier is sent to Keep services as a header (X-Keep-Desired-Tier) with each write request. Keep services return an error when the data cannot be written to the requested tier (or better). |
||
| 30 | |||
| 31 | h2. Moving data between tiers |
||
| 32 | |||
| 33 | Each collection has an integer field, "tier_desired". If tier_desired is not null, all blocks referenced by the collection should be stored at the given tier (or better). |
||
| 34 | |||
| 35 | Keep-balance tracks the maximum allowed tier for each block, and moves blocks between tiers as needed. The strategy is similar to fixing rendezvous probe order: if a block is stored at the wrong tier, a new copy is made at the correct tier; then, in a subsequent balancing operation, the redundant copy is detected and deleted. _This increases the danger of data loss due to races between concurrent keep-balance processes. Keep-balance should have a reliable way to detect/avoid concurrent balancing operations._ |
||
| 36 | |||
| 37 | h2. Reporting |
||
| 38 | |||
| 39 | Each collection has an integer field, "tier_confirmed", and a timestamp field, "tier_confirmed_at". These indicate the most recent state of stored blocks: if tier_confirmed=2 then (as of tier_confirmed_at) every block in the collection was stored at tier 2 or better. |