For growing data volumes, how we manage data becomes more important. This session will cover the basics of managing data in a research environment such as those at ARC and nationally. Attendees of the course will be introduced to recommended tools for data sharing and transfer both on campus, off campus, and cloud. They will learn how to prepare data for archive, including special high performance versions of tar and compression allowing significant performance benefits over the standard versions of the tools.
Lastly we will cover the properties and selection process of the appropriate general purpose storage for data that requires long term preservation and active archiving that supports the largest data volumes in a way that controls costs and ease of management.
Requirements are basic command line.