Feature #18369
Updated by Ward Vandewege about 3 years ago
Describe the differences, the reason for them (e.g. information is not available due to access patterns), which write patterns would be most affected (many small files, presumably?).
Are there possible mitigation strategies to avoid duplication, e.g.:
* can we improve some of our clients to write normalized collections from the get-go?
* can we measure/predict which collections could be normalized?
* can we make a normalization tool?