Events

Introduction to the Linux Command Line

Your Desktop

OVERVIEW This course will familiarize the student with the basics of accessing and interacting with Linux computers using the GNU/Linux operating system’s Bash shell, also generically referred to as “the command line”. Topics include: a brief overview of Linux, the Bash shell, navigating the file system, basic commands, shell redirection, permissions, processes, and the command […]

Software Development For Research: Version Control Principles

Your Desktop

Software development and computer programming is increasingly a major part of scientific research. Projects can quickly grow, and it can be difficult to manage changes, especially if multiple programmers are editing the same project at once! This workshop will cover Git, a commonly-used tool for managing coding projects with multiple users, with features to make […]

Introduction to Research Computing on the Great Lakes Cluster

Your Desktop

OVERVIEW This workshop will introduce you to high performance computing on the Great Lakes cluster.  After a brief overview of the components of the cluster and the resources available there, the main body of the workshop will cover creating batch scripts and the options available to run jobs, and hands-on experience in submitting, tracking, and interpreting the […]

Advanced Research Computing on the Great Lakes Cluster

Your Desktop

OVERVIEW This workshop will cover some more advanced topics in computing on the U-M Great Lakes Cluster. Topics to be covered include a review of common parallel programming models and basic use of Great Lakes; dependent and array scheduling; workflow scripting using bash; high-throughput computing using launcher; parallel processing in one or more of Python, […]

Processing the CoreLogic Data on Great Lakes using PySpark

Your Desktop

OVERVIEW This workshop provides an introduction to processing CoreLogic data using PySpark on the Great Lakes cluster. The CoreLogic dataset contains aggregated data from individual, parcel-level real estate transactions and financial records. U-M has licensed access to Tax, Deed, and Foreclosure data at the parcel level for every county in the United States. We will […]

Machine Learning on Great Lakes

Your Desktop

OVERVIEW This workshop will go over methods and best practices for running machine learning applications on Great Lakes. We will briefly outline machine learning before stepping through a hands-on example problem to load a project and submit a job to the HPC cluster. Participants are expected to be familiar with Python, the command line, and […]