Project

General

Profile

Actions

Feature #16519

open

[keepstore] optimize md5sum calculations

Added by Ward Vandewege over 4 years ago. Updated almost 2 years ago.

Status:
New
Priority:
Normal
Assigned To:
-
Category:
-
Target version:
-
Start date:
Due date:
% Done:

0%

Estimated time:
Story points:
-
Release:
Release relationship:
Auto

Description

There is now a Go package to speed up md5sum calculations when the hardware supports it (AVX/AVX2 extensions, which are common):

https://github.com/minio/md5-simd

which is described here:

https://blog.min.io/accelerating-aggregate-md5-hashing-up-to-800-with-avx512-2/

Keepstore should leverage this library to speed up its hashing, if the hardware it runs on supports the necessary extensions.

Ideally, this goes into our codebase in a such a way that all our Go code that calculates md5sums leverages it automatically.


Related issues 3 (2 open1 closed)

Related to Arvados - Feature #16518: [keep] Allow clients to set a header to disable md5sum calculations in keepstoreNew

Actions
Related to Arvados - Feature #16513: Get reference Keep performance numbers for Keep-on-S3ResolvedWard Vandewege06/15/2020

Actions
Related to Arvados Epics - Story #18342: Stream Keep data to minimize latency and memory usageNew03/01/202307/31/2023

Actions
Actions #1

Updated by Ward Vandewege over 4 years ago

  • Related to Story #16516: Run Keepstore on local compute nodes added
Actions #2

Updated by Ward Vandewege over 4 years ago

  • Description updated (diff)
Actions #3

Updated by Ward Vandewege over 4 years ago

  • Description updated (diff)
Actions #4

Updated by Ward Vandewege over 4 years ago

  • Related to Feature #16518: [keep] Allow clients to set a header to disable md5sum calculations in keepstore added
Actions #5

Updated by Ward Vandewege over 4 years ago

  • Description updated (diff)
Actions #6

Updated by Ward Vandewege over 4 years ago

  • Related to Feature #16513: Get reference Keep performance numbers for Keep-on-S3 added
Actions #7

Updated by Peter Amstutz about 3 years ago

  • Related to Story #18342: Stream Keep data to minimize latency and memory usage added
Actions #8

Updated by Peter Amstutz about 3 years ago

  • Related to deleted (Story #16516: Run Keepstore on local compute nodes)
Actions #9

Updated by Lucas Di Pentima almost 2 years ago

  • Release set to 60
Actions

Also available in: Atom PDF