Computational Science in the Cloud Institute 2018

View on GitHub

June 4-8, 2018

Texas Advanced Computing Center

Drawing

Spend a week with TACC’s web and cloud computing experts learning about tools, techniques, and best practices for doing high throughput computing, using virtual machines, containers, and recipes to ensure that your runtime environment is reproducible, even if you need to scale out. Be prepared to take the lessons you learn at TACC back to your own research project with hands-on exercises and one-on-one time with our experts talking about the specifics of your own research project and goals. Hear about the latest tools and research in distributed and cloud computing with invited talks from TACC users, collaborators, and guests.

Participants should bring their laptops and plan to participate actively. Laptops will require a terminal application for accessing compute resources.

Instructors:

Joe Stubbs, Charlie Dey, Greg Zynda, Cody Hammock, Julia Looney, Mike Packard

Agenda

Monday

Time Topic
8:30 - 10:00 Welcome and Introduction to Cloud Concepts
10:00 - 10:15 Break
10:15 - 11:45 Command Line and Jupyter Installation
11:45 - 13:00 Lunch
13:00 - 14:30 Jupyter Crash Course
14:30 - 14:45 Break
14:45 - 16:15 Jupyter Crash Course (cont)

Tuesday

Time Topic
8:30 - 10:00 Application Programming Interfaces
10:00 - 10:15 Break
10:15 - 11:45 Introduction to HTTP Authentication
11:45 - 13:00 Lunch
13:00 - 14:30 Introduction to OAuth and the Agave API
14:30 - 14:45 Break
14:45 - 16:15 Using Agave to Store/Retrieve Files

Wednesday

Time Topic
8:30 - 10:00 Introduction to Ansible
10:00 - 10:15 Break
10:15 - 11:45 Automating Jupyter Deployment and bonus Automating Openstack Instances
11:45 - 13:00 Lunch
13:00 - 14:30 Data Ingestion and Manipulation in Pandas
14:30 - 14:45 Break
14:45 - 16:15 Hands-on: Pandas

Thursday

Time Topic
8:30 - 10:00 Introduction to OpenStack CLI
10:00 - 10:15 Break
10:15 - 11:45 Automate Openstack Instance deployment with Ansible and Discussion of SLURM clusters on Openstack (time permitting)
11:45 - 13:00 Lunch
13:00 - 14:30 Introduction to Docker
14:30 - 14:45 Break
14:45 - 16:15 Introduction to Docker Part 2

Friday

Time Topic
8:30 - 10:00 Introduction to Singularity and Analyzing Large CSVs with Dask Distributed Pt 1 and Analyzing Large CSVs with Dask Distributed Pt 2
10:00 - 10:15 Break
10:15 - 11:45 Survey and Open Discussion
11:45 - 13:00 Lunch & Closing