Skip to content Skip to main navigation Report an accessibility issue
High Performance & Scientific Computing

Data Transfer



Introduction

The ISAAC Legacy (formerly ACF) cluster provides several ways for transferring files to/from the NFS home directories, NFS project directories, Lustre project directories, and Lustre scratch directories. Data Transfer Nodes (DTNs) furnish this capability and are implemented with Globus and Secure Shell services. At the time of this writing, there are four DTNs available to ISAAC Legacy users. Table 1.1 shows these nodes.

Data Transfer NodeHostname (for SCP)Globus endpoint
datamover1datamover1.acf.utk.eduUTK ISAAC Legacy 1
datamover2datamover2.acf.utk.eduUTK ISAAC Legacy 2
datamover3datamover3.acf.utk.eduUTK ISAAC Legacy 3
datamover4datamover4.acf.utk.eduUTK ISAAC Legacy 4

Although these servers previously used an InCommon Credential, all four datamover DTNs are now running Globus Connect Server version 5.4 which uses UTK Central Authentication Service and no longer uses certificates.

To login and obtain a command line access to these DTNs, use an SSH client to login to the Hostname listed in the table above. More information on SSH usage can be found in the Access and Login document. Replace the hostname of the login node with the hostname of the DTN to which you wish to connect, then authenticate with your UT NetID, password, and Duo TFA.

The ISAAC Legacy cluster supports several data transfer protocols including SCP, SFTP, GSISCP, and Globus. SCP and SFTP are both SSH utilities available for transferring files but perform slower than Globus. At the time of this writing, Globus offers the fastest data transfers on the Open Enclave. Still, SCP and SFTP are useful for small transfers. GSISCP is used for unattended data transfers.

SCP and SFTP

SCP and SFTP are both SSH commands available for transferring files to/from ISAAC Legacy using these SSH client commands. Note SSH transfers are slower than Globus transfers. From our performance testing, Globus offers the fastest data transfers to/from ISAAC Legacy DTNs. Still, SCP and SFTP can be useful for quick, small transfers. For larger file transfers, we recommend you use Globus.

SCP and SFTP are available to Linux and MacOS systems by default. Windows 10 users with the most recent updates can use these utilities within Command Prompt or PowerShell. Windows 7 and 8 users must use a third-party utility to use SCP and SFTP. For more information on SSH in Windows, see the Access and Login document. For Windows 7 and 8 users, the third-party utilities FileZilla and WinSCP are reviewed later in this document.

The general syntax of SCP is given below. In general, SCP is useful when transferring a file on your system to a DTN in ISAAC Legacy. The <source> argument is the pathname of the file on your system that you wish to copy. The <destination> (in this case, datamover1) argument is the hostname of the datamover you wish to use. Additionally, the <directory> argument specifies the absolute pathname within the destination to place the file.

scp <source> <NetID>@datamover1.acf.utk.edu:<directory>

If you wanted to copy a file from your system and place it on the ACF, you could use 

scp ~/<filename><NetID>@acf-login.acf.utk.edu:~/Documents

For SFTP, you specify the hostname of the system to which you intend to connect. For example, to securely transfer files between your local system and a DTN in ISAAC Legacy, use the syntax below in a terminal on your local system. Ensure that you enter SFTP from the directory that contains the file(s) you wish to copy to a DTN in ISAAC Legacy. You can use the pwd command to determine your current directory before entering SFTP.

sftp <NetID>@datamover1.acf.utk.edu

Once you authenticate with your UT NetID, password, and Duo TFA, you will enter SFTP’s interactive mode. Use the put <file> command to upload a file to the Open Enclave. For example, to upload a file named JobScript.sh to DTN from your local machine, use put JobScript.sh. This syntax assumes that the JobScript.sh file is in the directory from which you entered SFTP.

To retrieve files from DTN, use the get <file> command. To download a file named ResearchResults.txt from the DTN to your local machine, use get ResearchResults.txt. SFTP will place the file in the directory from which you entered the utility. To change directories on the DTN, use the cd <directory> command. Use the lcd <directory> command to change the directory on your local system. Once you are done with SFTP, use the bye or exit commands to exit it. Other commands are available with the SFTP utility. Type help within SFTP to read more about them.

Globus Web-based Transfers

The Globus web interface allows you to conveniently perform data transfers to and from ISAAC Legacy DTNs. All four datamovers have transitioned to Globus Connect Server 5 and use regular UTK Central Authentication Service (CAS) for authentication. InCommon credentials should no longer be used.

Using the Globus Web Interface

To access the Globus interface in your browser, navigate to the Globus website. Login using the existing organizational login option. Verify that the University of Tennessee is selected, then select “Continue.” Authenticate with your UT NetID, password, and Duo TFA. You will then see the interface depicted in Figure 3.1.

Figure 3.1 – Initial Globus Interface

Before you can initiate file transfers between your local machine and the DTN, you must configure endpoints. One endpoint will reference your local system while the other will reference one of the datamover DTNs. Further instructions on these endpoints will be provided below.

To configure the endpoints in the Globus interface, select the “Endpoints” tab on the left-side of the page. You will then see a page similar to Figure 3.2. At the top-right of the page, select “Create new endpoint.” On the endpoint type selection page, choose “Globus Connect Personal.”

Figure 3.2 – Globus Endpoint Menu

On the next page, name the endpoint. The name you choose is unimportant; however, it should be something memorable. After you name the endpoint, generate a setup key for the Globus Connect Personal client software. The option to generate the key is listed under Step 2 in Figure 3.3. Copy this key. Finally, download and install the Globus Connect Personal client software. When prompted, enter the setup key you copied to configure your local machine as an endpoint. Refer to Figure 3.3 for a screenshot of the endpoint creation page.

Figure 3.3 – Globus Endpoint Creation Menu

Once you configure your local machine as a Globus endpoint, return to the “File Manager” tab on the left-side of the page. Make sure you select the double panels option in the top-right of the page (Figure 3.9 highlights this option). This will display your local machine’s filesystem in addition to the datamover’s. Once both panels are displayed, click on “Collection” in the left panel. Type the name of your endpoint in the search bar or find it under “My Collections.”

After your endpoint has been selected, you will return to the File Manager. In the right panel, click on “Collection.” Search for one of the two datamovers. The endpoint names of these DTNs are given below.

  • UTK ISAAC-NG DTN1
  • UTK ISAAC-NG DTN2
  • UTK ISAAC-NG DTN3
  • UTK ISAAC-NG DTN4
Figure 3.4 – Globus File Transfer Interface

Once both endpoints are configured, you can transfer data between the two. You can select individual files and directories for these transfers. When you select the data you wish to transfer, press the “Start” button below the endpoint from which you will transfer data. Additionally, you can navigate throughout the filesystem hierarchy in either endpoint using the Globus interface. Other options are available for your transfers, but they are usually unnecessary for most transfers. Figure 3.4 shows what the Globus interface should look like when both endpoints are selected.

Using FileZilla to Transfer Files

FileZilla will work with file transfers to the Open Enclave. Please only use the DTNs listed in Table 1.1 at the beginning of this document.

To use the FileZilla client with your NetID, password, and Duo TFA, follow these steps.

  1. Open the FileZilla client.
  2. Select File, then Site Manager.

    Figure 5.1 – FileZilla’s Site Manager Option
  3. Select “New Site,” then provide the necessary information. For the host, select one of the datamovers listed in Table 1.1. For protocol, select SFTP – SSH File Transfer Protocol. For Logon Type, select Interactive. For User, type your UT NetID. Finally, rename the entry under sites from “New Site” to something more memorable, such as the name of the datamover you chose to use. Refer to Figures 5.2 and 5.3 to identify where to find these options.

    Figure 5.2 – New Site in FileZillaFigure 5.3 – FileZilla Site Options
  4. Select Transfer Settings, then check the box for Limit the number of simultaneous connections. Make sure the value beneath this checkbox is 1.
  5. Select “Connect” in the Site Manager window.
  6. When prompted, enter your password.
  7. Figure 5.4 – FileZilla Password Prompt
  8. When prompted, type a “1” to send a Duo Push to your mobile device, then authenticate with Duo TFA. Upon successful authentication, you will be logged in to the datamover through FileZilla.

    Figure 5.5 – FileZilla Duo Prompt

Using WinSCP to Transfer Files

WinSCP can perform file transfers to and from the DTNs. Please use the DTNs listed in Table 1.1 at the beginning of this document.

To use the WinSCP client with your NetID, password, and Duo TFA, follow these steps.

  1. Open WinSCP, then click on “New Site.”
  2. Provide the hostname of the datamover for “Host name,” your UT NetID for “User name,” and your password. Leave the port number as 22.

    Figure 6.1 – WinSCP New Site Creation
  3. When warned about an unknown server, select “Yes.”

    Figure 6.2 – Initial WinSCP Key Warning
  4. The authentication banner will appear. Select “Continue.”

    Figure 6.3- WinSCP Authentication Banner
  5. When prompted, type “1” to receive a Duo Push on your mobile device. Authenticate with Duo. You will then be logged in.

    Figure 6.4 – Duo Prompt in WinSCP
  6. Once you authenticate, you will get the WinSCP application screen. On the left side of the screen, you see your local machine. On the right side of the screen, you see the remote system into which you are logged.

Preparing Data

Before you initiate any data transfers from DTN to another storage resource, consider preparing the data you wish to transfer by archiving and compressing it. When you archive data, several files and directories can be added to the same location. When you compress data, you reduce its total size. Both methods reduce the total amount of data that must be sent across the network and make it easier for you to organize the data you wish to transfer. At the time of this writing, the tar and zip utilities are the best methods for data archiving and compression for DTN users across Linux, MacOS, and Windows.

Using the tar Utility

The tar (tape archiver) utility uses simple command syntax and allows large amounts of data to be aggregated into the same archive. Linux, MacOS, and updated Windows 10 systems can use tar. Older Windows systems will be limited to the zip utility.

To create a tar archive, execute tar czvf <archive-name> <dir-to-archive>. Replace the <archive-name> argument with the name of the new archive. Be sure to follow the name with the .tar.gz extension, as in my_archive.tar.gz. Replace the <dir-to-archive> argument with the directory you wish to place within the archive. If the directory you intend to archive is not within your working directory, specify the relative or absolute path to it. By default, tar will recursively place the directory and its contents into the new archive. Figure 2.1 shows the successful creation of a tar archive.

[user@acf-login5 ~]$ tar czvf new_archive.tar.gz Documents
Documents/
Documents/IntroUnix.pdf
Documents/JobSubData.zip
Documents/MATLAB/
Documents/Scripts.zip
Documents/PyLists.py

Figure 2.1 – Creating a tar Archive

After the archive is created, execute ls -l to verify that the archive exists. You can view its contents with the tar tvf <archive-name> command. You may then transfer the archive using one of the data transfer methods described in this document. In general, Globus is the best method. Please refer to the Globus section to learn how to configure it for your system. On the remote system, execute tar xvf <archive-name> to extract the contents of the archive. The files will be extracted into your working directory.

Using the zip Utility

On older Windows systems, the zip utility should be used to archive and compress your data.

To create a zip archive, execute zip -r <archive-name>.zip <dir-to-archive>Be sure that the directory you wish to archive is in your working directory. Otherwise, specify the relative or absolute path to the directory you wish to archive. Replace the <archive-name> argument with the name of the new zip archive. You may or may not include the .zip file extension to the archive’s name; if you do not, the zip utility will add it automatically. Replace the <dir-to-archive> argument with the directory you wish to place in the zip archive. The -r option ensures that the directory and its contents are archived and compressed. Figure 2.2 shows the successful creation of a zip archive.

[user@acf-login5 ~]$ zip -r Documents Documents
  adding: Documents/ (stored 0%)
  adding: Documents/IntroUnix.pdf (deflated 4%)
  adding: Documents/MATLAB/ (stored 0%)
  adding: Documents/PyLists.py (deflated 61%)

Figure 2.2 – Creating a zip Archive

After the zip archive has been created, execute ls -l in the directory from which you created it to ensure the archive exists. It will appear with the name you gave to the archive followed by the .zip extension.

With the zip archive created and verified, transfer it to your system using one of the data transfer methods described in this document. In most cases, Globus is the most convenient method. Please refer to the Globus section to learn how to configure it for use on your system. Once you transfer the zip archive to your system, open the File Explorer and navigate to the directory in which you placed the archive. Right-click on the archive and select the “Extract All…” option in the submenu. Figure 2.3 shows where to locate this option. Specify the directory in which the contents should be extracted, then select “Extract.” You may then open the archive and peruse its contents.

Figure 2.3 – Extracting the Contents of a zip Archive in Windows