The Office of Information Technology (OIT), High Performance & Scientific Computing (HPSC) group has initiated more targeted support for the Genomics Community. We have had two annual Symposium’s to bring the community together and discuss how the infrastructure and environment can be improved to support this important community. Access to the ISAAC cluster resources is available to the UT Genomics Community in the form of storage access and compute resource access including conventional compute, large memory compute nodes and GPU nodes. Additionally, we are working with the UTK Genomics Core to support the storage and processing of data from the Genomics Core instruments, specifically the new in 2022 Illumina NovaSeq 6000 and the MiSeq instruments. We are adding additional services including PostgreSQL database access and access to a server version of CLC QIAGEN.
OIT HPSC has been working closely with the Genomics Core staff to make data from the Illumina Novaseq instrument more readily available on the ISAAC NG cluster. NovaSeq and MiSeq data are now automatically saved to the ISAAC NG Lustre file system and, when needed, converted from BCL to FASTQ format once the instrument run is complete eliminating the need and use of Illumina BaseSpace in the cloud. Access to this data can be made available to all collaborators who need the DNA sequence data. This includes UT persons and persons who collaborate but do not have a UT NetID or access to the ISAAC cluster resources. Access is provided by the methods described below.
To access data delivered to the ISAAC cluster directly from the UT Genomics Core instruments, researchers will need to create an ISAAC cluster account (see “Request an ISAAC Account” in menu to the left) if that does not already exist. Also, a project to be associated with this data needs to exist or be created. Requesting an account is self service and requesting a project can be done via submitting a service request (ticket) or self service through the ISAAC portal (https://portal.acf.utk.edu). Also, if there are additional researchers to be members of the research group for the project then each individual researcher can request access via the portal or the PI can submit the request for all members via the service request process.
Once a researcher has an ISAAC account and associate project then make the request for the DNA sequencing to UTK Genomics Core staff and give them the associated NetID and project identifier. The data will be generated on the instrument and put into a service account on the ISAAC NG cluster. The FASTQ files, if necessary, will be generated. Once those are generated the files will be copied to the user’s project directory.
Once the data is available on the ISAAC cluster can be transferred via Globus to where it needs to end up or it can be used on the ISAAC NG cluster and use available ISAAC NG resources to process the data.
OIT HPSC also provides a capability for use by researchers using the UTK Genomics Core to provide data to external organization’s (ORNL, other Universities, etc.) researchers making use of the UTK Genomics Core by using the Globus File Collection sharing capabilities. The data can be put into a special directory and the OIT HPSC staff or UT Genomics Core staff can enable sharing of this data via Globus using the Globus File Sharing capabilities with collections. More details about the capability are available from Globus at https://www.globus.org/data-sharing. If you need this service please let the UT Genomics Core staff know and they will work with OIT HPSC staff to enable access.