Project

General

Profile

Actions

Keep Proxy Specification » History » Revision 13

« Previous | Revision 13/14 (diff) | Next »
Peter Amstutz, 07/23/2014 09:12 AM


Reverse Keep Proxy

Archived for informational purposes. The proposal described here is now implemented in arvados/services/keep/src/arvados.org/keepproxy

Problem

Need to be able to automatically upload huge (+1 TiB) datasets into Arvados. Current proposed solution is to upload the data to a staging area and then put the data into Keep. On further consideration, this solution is inadequate for a number of reasons:
  • Must set aside staging area big enough to accommodate large uploads.
  • When uploads are not occurring, this empty space just sits around, costing money.
  • Amazon has a 1 TiB limit on EBS volumes, which means we can't accept +1 TiB datasets, unless we create a volume-spanning partitions
  • Multiple users uploading to the same staging partition can end up in a starvation deadlock when if the volume fills up.
  • Some of these problems could be addressed by allocating/deallocating volumes on the fly, but this adds significant complexity.
  • Once data is uploaded, it still needs to be copied into Keep, which adds additional wait time from when the data is uploaded to when the data is actually ready to use.

Solution

Provide a Keep client that sends blocks to a reverse Keep proxy, which forwards the blocks to appropriate internal Keep servers.
  • Doesn't require staging except in RAM of the Keep proxy.
  • No dataset limits except Keep's overall capacity
  • Fewer contention problems (although many uploaders could overwhelm the proxy node...)
  • Data is available immediately once upload is completed
  • This is the right thing to do in the long term anyway. We shouldn't waste our time with messy hacks.

Approach

  1. Develop a subset Arvados Go SDK that supports accessing API server and can write to Keep server (reading from Keep is out of scope).
    • Read files in 64 MiB blocks and calculate hashes
    • Pack small files into a single block
    • Put 64 MiB blocks to Keep server over HTTPS
    • Create manifest (should be normalized form)
    • Write manifest to Keep
    • Use Google API client to talk to API server to create collection, metadata links
  2. Develop uploader program in Go to recursively upload a directory structure
    • Take API server, API token, directory path on the command line (+ additional metadata links to set on the collection after it is completed)
    • Should be self-contained static x64 ELF binary with minimal dependencies that will run on any modern x64 Linux.
    • Use Go Keep client library to upload blocks, create manifest, upload manifest to API server, add metadata links.
    • Should checkpoint during upload so that upload can be canceled and resumed.
  3. Reverse Keep Proxy
    • Publicly accessible head node providing write access into Keep (read access is out of scope for this task)
    • List proxy contact info in discovery document
    • Check API token to ensure client has permission to write
    • Accept blocks from client, forward them to internal Keep cluster. Extend existing Keep Go server by writing a new volume backend that writes to the appropriate internal Keep servers instead of to the disk.
    • Block hash, user uuid for each block logged to API server
    • Writing to internal Keep servers and API server will use Arvados Go SDK
  4. API server
    • API call allowing normal users to create special user accounts that use a combination of limited permissions and scopes to restrict to uploading tasks. Scopes alone are not powerful enough because a scope cannot restrict the uploader to only creating links about collections known to the uploader.
    • Restricted to a few tasks, such as creating collections, creating metadata links about that collection.
    • Restricted account is owned by the Arvados user, so user can see and change everything the uploader account owns.
    • Can deactivate uploader account when done with it.
    • (This task can probably separated from tasks 1-3 but is necessary to support delegation)

Updated by Peter Amstutz over 10 years ago · 14 revisions