Actions
Bug #5713
open[FUSE] File access sometimes takes too long on su92l
Status:
New
Priority:
Normal
Assigned To:
Tom Morris
Category:
FUSE
Target version:
-
Start date:
04/13/2015
Due date:
% Done:
0%
Estimated time:
Story points:
-
Description
Every once in a while, accessing files or data from the keep mount becomes unusably slow from the shell nodes.
Sometimes file access can take anywhere from 5-15 minutes. I've mostly seen it on abe.shell.su92l and shell.su92l. The problem is intermittent and once the file successfully opens subsequence access is fast.
Updated by Brett Smith over 9 years ago
- Subject changed from Accessing file from my shell node keep mount becomes unusuably slow to [FUSE] File access sometimes takes too long on su92l
- Category set to FUSE
Updated by Abram Connelly over 9 years ago
abram@abe.su92l:~/keep/home/dbGaP Release v2/dbGap Release v2 23andMe data/b37.list$ time file b37.list b37.list: ASCII text real 22m17.655s user 0m0.008s sys 0m0.004s abram@abe.su92l:~/keep/home/dbGaP Release v2/dbGap Release v2 23andMe data/b37.list$ ls -lah b37.list -r-xr-xr-x 1 abram abram 5.9K Apr 22 16:33 b37.list
The keep mount was setup as follows:
arv-mount --debug --logfile .bcsdebug/log.20150423 keep
Here's what I think are the relevant log lines from the above log file:
arv-mount handle.entry [('.', <arvados_fuse.CollectionDirectory object at 0x26ad8d0>), ('..', <arvados_fuse.ProjectDirectory object at 0x22e0310> ), ('b37.list', <arvados_fuse.StreamReaderFile object at 0x22de550>)] Request fail: GET http://[keep7.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb 3e2 => <class 'requests.exceptions.ReadTimeout'>: HTTPConnectionPool(host='keep7.su92l.arvadosapi.com', port=25107): Read timed out. (read timeou t=300) Request: GET http://[keep5.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 404 response: 10 bytes in 35.9370708466 msec (0.000 MiB/sec) Request: GET http://[keep6.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 404 response: 10 bytes in 23.3190059662 msec (0.000 MiB/sec) Request: GET http://[keep8.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 Request fail: GET http://[keep8.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb 3e2 => <class 'requests.exceptions.ReadTimeout'>: HTTPConnectionPool(host='keep8.su92l.arvadosapi.com', port=25107): Read timed out. (read timeou t=300) Request: GET http://[keep3.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 Request fail: GET http://[keep3.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb 3e2 => <class 'requests.exceptions.ReadTimeout'>: HTTPConnectionPool(host='keep3.su92l.arvadosapi.com', port=25107): Read timed out. (read timeou t=300) Request: GET http://[keep2.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 404 response: 10 bytes in 276.965856552 msec (0.000 MiB/sec) Request: GET http://[keep1.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 Request fail: GET http://[keep1.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb 3e2 => <class 'requests.exceptions.ReadTimeout'>: HTTPConnectionPool(host='keep1.su92l.arvadosapi.com', port=25107): Read timed out. (read timeou t=300) Request: GET http://[keep4.su92l.arvadosapi.com]:25107/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 404 response: 10 bytes in 175.436019897 msec (0.000 MiB/sec) Request: GET http://[keep6.su92l.arvadosapi.com]:25108/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb3e2 Request fail: GET http://[keep6.su92l.arvadosapi.com]:25108/89434b3a149d83793ea56dd53951e97e+6006+Ab79cb458b34f47292416a3d83e73cd1e875b1505@554bb 3e2 => <class 'requests.exceptions.ReadTimeout'>: HTTPConnectionPool(host='keep6.su92l.arvadosapi.com', port=25108): Read timed out. (read timeou t=300)
Updated by Abram Connelly over 9 years ago
abram@abe.su92l:~/keepid/0581ad21ee2382f71f06f7249f4d676c+80/out-data$ time file get-evidence.json get-evidence.json: ASCII text, with very long lines real 49m4.585s user 0m0.024s sys 0m0.004s abram@abe.su92l:~/keepid/0581ad21ee2382f71f06f7249f4d676c+80/out-data$ ls -lah total 3.9M dr-xr-xr-x 1 abram abram 0 May 5 18:01 . dr-xr-xr-x 1 abram abram 0 Jan 1 1970 .. -r-xr-xr-x 1 abram abram 3.9M Jan 1 1970 get-evidence.json abram@abe.su92l:~/keepid/0581ad21ee2382f71f06f7249f4d676c+80/out-data$ date Tue May 5 19:01:20 UTC 2015
Updated by Brett Smith over 9 years ago
- Target version changed from Bug Triage to Arvados Future Sprints
Our current thinking is that #5748 is the main culprit here. We're very interested to hear how things look after the next deploy.
Updated by Tom Morris almost 8 years ago
- Assigned To set to Tom Morris
Has this problem been seen recently? It was reported over 2 years ago and a lot has changed since then, include a bunch of improvements to the performance of various components.
Updated by Ward Vandewege over 3 years ago
- Target version deleted (
Arvados Future Sprints)
Actions