FSx &Transfer Family & DataSync

FSx

  • Allows us to launch 3rd party high-performance file systems on AWS

  • Useful when we don’t want to use an AWS managed file system like S3

  • Can be accessed from your on-premise infrastructure

Types of FSx

  • FSx for Windows File Server

  • FSx for Lustre

  • FSx for NetApp ONTAP

  • FSx for OpenZFS

FSx for Windows

  • FSx for Windows is a Shared File System for Windows (like EFS but for Windows)

  • Supports SMB protocol, Windows NTFS, Microsoft Active Directory integration, ACLs, user quotas

  • Can be mounted on Linux EC2 instances

  • Built on SSD, scale up to 10s of GB/s, millions of IOPS, 100s PB of data

  • Supports Multi-AZ (high availability)

  • Data is backed-up daily to S3

  • Can be accessed from your on-premises infrastructure (VPN or Direct Connect)

  • Does not integrate with S3 (cannot store cold data)

  • Storage Options:

    • SSD – latency sensitive workloads (databases, media processing, data analytics, …)

    • HDD – broad spectrum of workloads (home directory, CMS, …)

FSx for Lustre

  • Parallel distributed file system for HPC (like EFS but for HPC)

  • Scales up to 100s GB/s, millions of IOPS, sub-ms latencies

  • Only works with Linux

  • Seamless integration with S3

    • Can read S3 buckets as a file system (through FSx)

    • Can write the output back to S3 (through FSx)

  • Ability to both process the hot data in a parallel and distributed fashion as well as easily store the cold data on Amazon S3

  • Storage Options:

  • SSD – low-latency, IOPS intensive workloads, small & random file operations

  • HDD – throughput-intensive workloads, large & sequential file operations

FSx Lustre - File System Deployment Options

  • Scratch File System

    • Temporary storage (cheaper)

    • Data is not replicated (data lost if the file server fails)

    • High burst (6x faster than persistent file system)

    • Usage: short-term processing

  • Persistent File System

    • Long-term storage (expensive)

    • Data is replicated within same AZ

    • Failed files are replaced within minutes

    • Usage: long-term processing, sensitive data

FSx for NetApp ONTAP

  • Managed NetApp ONTAP on AWS

  • File System compatible with NFS, SMB, iSCSI protocol

  • Move workloads running on ONTAP or NAS to AWS

  • Works with:

    • Linux

    • Windows

    • MacOS

    • VMware Cloud on AWS

    • Amazon Workspaces & AppStream 2.0

    • Amazon EC2, ECS and EKS

  • Storage shrinks or grows automatically

  • Snapshots, replication, low-cost, compression and data de-duplication

  • Point-in-time instantaneous cloning (helpful for testing new workloads)

FSx for OpenZFS

  • Managed OpenZFS file system on AWS

  • File System compatible with NFS (v3, v4, v4.1, v4.2)

  • Move workloads running on ZFS to AWS

  • Works with:

    • Linux

    • Windows

    • MacOS

    • VMware Cloud on AWS

    • Amazon Workspaces & AppStream 2.0

    • Amazon EC2, ECS and EKS

  • Up to 1,000,000 IOPS with < 0.5ms latency

  • Snapshots, compression and low-cost

  • Point-in-time instantaneous cloning (helpful for testing new workloads)

Transfer Family

  • AWS managed service to transfer files in and out of Amazon S3 or Amazon EFS using FTP protocol (instead of using proprietary methods)

  • Supported Protocols

    • FTP (File Transfer Protocol) - unencrypted in flight

    • FTPS (File Transfer Protocol over SSL) - encrypted in flight

    • SFTP (Secure File Transfer Protocol) - encrypted in flight

  • Supports Multi AZ

  • Pay per provisioned endpoint per hour + data transfers in GB

  • Clients can either connect directly to the FTP endpoint or optionally through Route 53

  • Transfer Family will need permission to read or put data into S3 or EFS

DataSync

  • Move large amount of data to and from

    • On-premises / other cloud to AWS (NFS, SMB, HDFS, S3 API… protocols) – needs agent

    • AWS to AWS (different storage services) – no agent needed

  • Can synchronize to:

    • S3 (all storage classes)

    • EFS

    • FSx for Windows

  • Scheduled Replication (hourly, daily, weekly but not continuous)

  • File permissions and metadata are preserved (NFS POSIX, SMB…)

  • Need to install AWS DataSync Agent on premises

    attachments/Pasted image 20220513214950.jpg

  • Can also be used to transfer between two EFS in different regions

    attachments/Pasted image 20220513215131.jpg

  • Perfect to move large amounts of historical data from on-premises to S3 Glacier Deep Archive (directly).