Project

General

Profile

Feature #18369

Updated by Ward Vandewege about 3 years ago

 
 Describe the differences, the reason for them (e.g. information is not available due to access patterns), which write patterns would be most affected (many small files, presumably?).  

 Are there possible mitigation strategies to avoid duplication, e.g.: 

 * can we improve some of our clients to write normalized collections from the get-go? 
 * can we measure/predict which collections could be normalized? 
 * can we make a normalization tool?  

Back