Topics in 2023
Reanalysis dataset of public data of human whole genome analysis
5 Jun 2023.
We will share Reanalysed human whole genome analysis data that have been published in public databases as open access data on the NIG supercomputer.
These data were downloaded from public databases for analysis in the Ministry of Education, Culture, Sports, Science and Technology-JAPAN(MEXT)'s Grant-in-Aid for Scientific Research on Innovative Areas "Elucidation of the origin and establishment of the Yaponesians using genome sequences as a core" (Yaponesians genome) and re-analysed on the NIG supercomputer. These data will be stored as shared data for the users of the NIG supercomputer so that they can be widely used for research after the research group finishes (from April 2023).
All data are all open access registered with SRA. For sample background and conditions of use, refer to the original papers and use at user's own risk.
We share files in CRAM format mapped to GRCh38 and files in gVCF format analysed with the GATK4 or Parabricks HaplotypeCaller algorithm.
You can get a pipeline with equivalent analysis content at https://github.com/NCGM-genome/WGSpipeline
Also, you can get the dataset from the NIG supercomputer in the following way.
- For all compute nodes in the general analysis division, the datasets are mounted under
/usr/local/shared_data/public-human-genomes/GRCh38/
and can be accessed and used directly from the analysis programmes in the NIG supercomputer .- The personal genome analysis division is currently being prepared.
- HTTPS: https://ddbj.nig.ac.jp/public/public-human-genomes/GRCh38/
- FTP: ftp.ddbj.nig.ac.jp/public-human-genomes/GRCh38
DATASET | DATA SOURCE | URL |
International 1000Genomes Project | https://www.ebi.ac.uk/ena/browser/view/PRJEB31736 | https://doi.org/10.1016/j.cell.2022.08.004 |
Human Genome Diversity Project | https://www.ebi.ac.uk/ena/browser/view/PRJEB6463 | https://doi.org/10.1126/science.aay5012 |
Simons Genome Diversity Project | https://www.ebi.ac.uk/ena/browser/view/PRJEB9586 | https://doi.org/10.1038/nature18964 |
Korean Personal Genomics Project | https://www.ebi.ac.uk/ena/browser/view/PRJNA284338 | https://doi.org/10.1038/s41598-018-23837-x |
Archaea tools(formerly HCPtools)
10 Jan 2023.
In October 2022, the HCPtools software provider launched the brand name 'Bytix' as a brand name for data transfer systems and the product name was changed from 'HCPtools' to 'Archaea tools'.
For more information on the change, 🔗refer to the Bytix official page 'Product name change etc.'
After the product name change, you can still use the same commands that you have been using for HCPtools.
In addition to Windows and Linux, also available for macOS. If you would like to know more, 🔗refer to the Bytix official page 'Supported Platforms'.
For information on how to use Archaea tools (formerly HCPtools), 🔗see the official Bytix page 'Documents'.
When you transfer files to and from the NIG supercomputer, you can use `scp` or `sftp`, which are widely used as the file transfer software. But their transfer speed is slow when a large numbers of files are transfered over long distances. For high-speed file transfer over long distances, the file transfer software Aspera is available on the general analysis division, and Archaea tools(formerly HCPtools) is available on the personal genome analysis division on the NIG supercomputer. |