Downloading Data from Biobank Japan
how_to
The same info can be found at this page, https://www.ddbj.nig.ac.jp/jga/download-e.html, but the steps may be slightly different because we applied for Biobank Japan data before they integrated with the D-way system.
Register a user account:
- A D-way account should have been created for you, with your username and password sent in an email. If not, create an account here: https://ddbj.nig.ac.jp/D-way/contents/general/reserve_account_page
- Add UChicago as your organization: Log in with D-way account, https://ddbj.nig.ac.jp/D-way/. Click the account tab at the top right corner, and at the bottom, type “The University of Chicago” in Center Full Name. Once selected, center name autofills with “U_CHICAGO”.
- Register your public key: You may need to click update and refresh for the Public Key box to appear at the bottom. You will need to copy your public key to an unhidden folder,
cp ~/.ssh/id_rsa.pub ~/Desktop
, before selecting for upload. If you do not find a file namedid_rsa.pub
in.ssh
, create a public key withssh-keygen -t rsa
. Note: The keypair you register will be used for downloading and decrypting data, and cannot be changed. Choose a public key generated by a lab computer (Mac or Linux) or the machine where the data will be stored.
Download Data:
- Connect to JGA server:
sftp -i id_rsa -P 443 <D-way username>@jga-gw.ddbj.nig.ac.jp
cd controlled-access/download/jga/
- Download genotypes:
get -r J-DU000138/JGAS000114/JGAD000123
- Download a phenotype:
get -r J-DU000138/JGAS000114/JGAD000***
(each folder contains encrypted individual data for one phenotype) - Download decryption tools:
get -r J-DU000138/tools
I wrote a batch script and ran the following lines to download all the phenotypes:
ssh-add
cd BBJ
mkdir JGAS000114
cd JGAS000114
sftp -i ~/.ssh/id_rsa -P 443 -b get_phenotypes.txt scmi@jga-gw.ddbj.nig.ac.jp
Decrypt Data
cd BBJ
unzip tools/J-DU000138.tool.zip
chmod 754 J-DU000138.decrypt.sh
chmod 754 JGAS000114/**/**/*.decrypt.sh
- Decrypt:
./J-DU000138.decrypt.sh -k ~/.ssh/id_rsa -p <private key password>
If you find the following error message:
When I ran into this issue, the problem was that the tool folder provided was not encrypted with my public. If you email the NBDC contact with your public key, the updated tools folder should work.
Reuse
© HakyImLab and Listed Authors - CC BY 4.0 for Text and figures - MIT for code