Project

General

Profile

Hacking prerequisites » History » Revision 89

Revision 88 (Lisa Knox, 10/01/2025 07:26 PM) → Revision 89/91 (Lisa Knox, 10/01/2025 07:27 PM)

{{>toc}} 

 h1. Hacking prerequisites 

 This page describes how to install all the software necessary to develop Arvados and run tests. 

 h2. Host options 

 You must have a system running a supported distribution. That system can be installed directly on hardware; running on a cloud instance; or in a virtual machine. 

 h3. Supported distributions 

 As of June 2025/Arvados 3.1, these instructions and the entire test suite are known to work on Debian 12 "bookworm." 

 You may try to run these instructions and tests on Ubuntu 22.04 "jammy"/24.04 "noble," but they have not been tested and you may find some bugs throughout. 

 These instructions are not suitable for any Red Hat-based distribution. Our Ansible playbook will refuse to run on them. 

 h3. Base configuration 

 On your development system, you should have a user account with full permission to use sudo. 

 You can run the Ansible playbook to install your development system on a different system. To do this, you must have permission to SSH into your user account from the system running Ansible (the "control node") to the development system you're installing (the "target node"). 

 h3. Virtual machine requirements 

 If you run your development system in a virtual machine, it needs some permissions. Many environments will allow these operations by default, but they could be limited by your virtual machine setup. 

 * It must be able to create and manage FUSE mounts (@/dev/fuse@) 
 * It must be able to create and run Docker containers 
 * It must be able to create and run Singularity containers—this requires creating and managing block loopback devices (@/dev/block-loop@) 
 * It must have the @fs.inotify.max_user_watches@ sysctl set to at least 524288. Our Ansible playbook will try to set this on the managed host, but if it is unable to do so, you may need to set it on the parent host instead. 

 h2. Install development environment with Ansible 

 h3. Clone Arvados source 

 You will need the Arvados source code to follow this process. 

 <pre><code class="sh">$ git clone https://git.arvados.org/arvados.git</code></pre> 

 If you want to switch to a specific branch or revision like @3.1-release@, do that here. 

 h3. Install Ansible 

 Install Ansible following the instructions in @arvados/tools/ansible/README.md@. This ensures you get the right versions of everything. 

 Make sure you have your Ansible virtualenv activated when you run the steps below. 

 h3. Write an Arvados database configuration 

 Make a copy of the default test configuration: 

 <pre><code class="sh">$ cp arvados/tools/ansible/files/default-test-config.yml ~/zzzzz-config.yml</code></pre> ~/zzzzz-inventory.yml</code></pre> 

 You can copy the file to a different location if you like. This page will use @~/zzzzz-config.yml@ @~/zzzzz-inventory.yml@ as the placeholder path throughout. 

 Edit this file with the database configuration you'd like to use. The cluster ID *must* be @zzzzz@. You can change the @user@, @password@, and @dbname@ settings freely. Our Ansible playbook will configure PostgreSQL so your settings here work. 

 The playbook will always install the @postgresql@ server package. It will *not* change any PostgreSQL configuration except to add @pg_hba.conf@ entries for this user. You should only change @host@ and @port@ if you need to use a PostgreSQL server that is already installed and running somewhere else. 

 h3. Write an Ansible inventory 

 An inventory file tells Ansible what host(s) to manage, how to connect to them, and what settings they use. Write an inventory file to @~/zzzzz-inventory.yml@ like this: 

 <pre><code class="yaml">arvados_test_all: 
   # This is the list of host(s) where we're installing the test environment. 
   # This example installs on the same system running Ansible. 
   # If you want to manage remote hosts, you can write your own host list: 
   # <https://docs.ansible.com/ansible/latest/getting_started/get_started_inventory.html> 
   hosts: 
     localhost: 
       ansible_connection: local 
   vars: 
     # The path to the Arvados cluster configuration you wrote in the previous section. 
     arvados_config_file: "{{ lookup('env', 'HOME') }}/zzzzz-config.yml" 

     # The primary user doing Arvados development and tests. 
     # This user will be added to the `docker` group. 
     # It defaults to the name of the user running `ansible-playbook`. 
     # If you want to configure a different user, set that here: 
     #arvados_dev_user: USERNAME 

     # The authentication mechanism to allow in `pg_hba.conf`. 
     # The default is `scram-sha-256`, which is the most secure method on the most 
     # recent versions of PostgreSQL. 
     # If your development system is running Debian 11, set this to `md5` here. 
     #arvados_postgresql_hba_method: md5 
 </code></pre> 

 h3. Run the playbook 

 The basic command to run the playbook is: 

 <pre><code class="sh">$ source ~/ansible/bin/activate 
 $ cd arvados/tools/ansible 
 $ ansible-playbook -K -i ~/zzzzz-inventory.yml install-dev-tools.yml 
 </code></pre> 

 When you are prompted for the @BECOME password:@, enter the password for your user account on the development host that lets you run @sudo@ commands. 

 @ansible-playbook@ has many options to control how it runs that you can add if you like. Refer to "the @ansible-playbook@ documentation":https://docs.ansible.com/ansible/latest/cli/ansible-playbook.html for more information. 

 After the playbook runs successfully, you should be able to run the Arvados tests from a source checkout on your development host. e.g., 

 <pre><code class="sh">$ cd arvados 
 $ WORKSPACE="$PWD" build/run-tests.sh --temp ~/arvados-test --interactive 
 </code></pre> 

 Refer to [[Running tests]] for details. 

 h3. Troubleshooting 

 The playbook writes your database configuration at @~/.config/arvados/config.yml@ and sets up a hook @/etc/profile.d/arvados-test.sh@ to set your @CONFIGSRC@ environment variable to that directory. If most tests fail with a database connection error, check that this variable is set: 

 <pre><code class="sh">$ echo "${CONFIGSRC:-UNSET}"</code> 
 /home/you/.config/arvados 
 </pre> 

 If that reports @UNSET@, add a line to set @CONFIGSRC="$HOME/.config/arvados"@ to your shell configuration, or set it manually when you run @run-tests.sh@: 

 <pre><code class="sh">$ WORKSPACE="$PWD" CONFIGSRC="$HOME/.config/arvados" build/run-tests.sh ... 
 </code></pre> 

 h3. Notes 

 The playbook will install symlinks for Go, Node, Singularity, and Yarn under @/usr/local/bin@. The actual tools are installed under @/opt@. If you need different versions of these tools for other work on this system, you'll need to customize your @PATH@ environment variable so the Arvados versions are found first when you're doing Arvados work.