Peter;
Thanks for the tour of installing and testing this on Friday. I've gotten cwl-runner setup from your branch and was able to run an initial test run. Awesome.
I'm running into an issue where I think the bcbio/bcbio image on qr1hi (https://cloud.curoverse.com/collections/qr1hi-4zz18-doidmcskcmhn2bm) is out of date. How do we refresh it to the latest?
The run I got started is here:
https://cloud.curoverse.com/pipeline_instances/qr1hi-d1hrv-nybexwq0vehhuu4
and was failing with this error:
2016-02-27 18:42:32 arvados.cwl-runner[5027] ERROR: Got exception while collecting job outputs:
Traceback (most recent call last):
File "build/bdist.linux-x86_64/egg/arvados_cwl/__init__.py", line 197, in done
outputs = self.collect_outputs(self.builder.outdir)
File "build/bdist.linux-x86_64/egg/cwltool/draft2tool.py", line 235, in collect_output_ports
ret[fragment] = self.collect_output(port, builder, outdir)
File "build/bdist.linux-x86_64/egg/cwltool/draft2tool.py", line 316, in collect_output
adjustFileObjs(r, revmap)
File "build/bdist.linux-x86_64/egg/cwltool/process.py", line 154, in adjustFileObjs
adjustFileObjs(d, op)
File "build/bdist.linux-x86_64/egg/cwltool/process.py", line 154, in adjustFileObjs
adjustFileObjs(d, op)
File "build/bdist.linux-x86_64/egg/cwltool/process.py", line 149, in adjustFileObjs
op(rec)
File "build/bdist.linux-x86_64/egg/cwltool/draft2tool.py", line 78, in revmap_file
raise WorkflowException("Output file path %s must be within designated output directory (%s) or an input file pass through." % (f["path"], builder.outdir))
WorkflowException: Output file path align_prep/7_100326_FC6107FAAXX-1.fq.gz must be within designated output directory (keep:d586abc216dd7011f2e57eecc674f804+469) or an input file pass through.
which I believe is due to having relative paths in the output JSON. This was fixed in bcbio a couple of weeks back with the corresponding fix to cwltool (https://github.com/common-workflow-language/cwltool/pull/40). So I hope a refresh of the container will just fix it. It would also be great if I could re-update the container on demand as the latest also contains a lot of new functionality for the Microsoft work (variant calling, validation, SNAP support) that will probably need a few more iterations.
For reference, the up to date CWL I'm running is here:
https://s3.amazonaws.com/bcbio/cwl/test_bcbio_cwl.tar.gz
I've also written up skeleton documentation on running this and will push that out once I've got a working run. Thanks again for this, I'm excited to have this so close to running.