XSEDE HPC Workshop: Big Data April 7-8, 2020


April 7-8, 2020

XSEDE along with the Pittsburgh Supercomputing Center is pleased to present a two day Big Data workshop.

This workshop will focus on topics such as Hadoop and Spark and will be presented using the Wide Area Classroom (WAC) training platform.

Due to COVID-19, this workshop will be remote, using zoom.  Final plans are being put in place and registration pages are soon to follow.

Due to demand, this workshop will be telecast to several satellite sites, given below.

You may attend at any of the following sites.

  • Pittsburgh Supercomputing Center
  • Carnegie Mellon University
  • Georgia Institute of Technology
  • University of Cincinnati
  • Stanford University
  • George Mason University
  • Tufts University
  • Purdue University
  • Arizona State University
  • University of Houston – Clear Lake
  • East Carolina University
  • Yale University
  • University of Utah
  • New York University
  • Pennsylvania State University
  • University of California, Los Angeles
  • University of Houston
  • University of South Carolina
  • University of Delaware
  • University of Illinois at Chicago
  • Pomona College
  • Oklahoma State University
  • University of Tennessee, Knoxville – National Institute for Computational Sciences
  • National Center for Supercomputing Applications
  • Lawrence Berkeley National Laboratory


Register by going to: https://portal.xsede.org/course-calendar

Tentative Agenda

Tuesday, April 7
All times given are Eastern
11:25Intro to Big Data
12:30Intro to Spark
1:00Lunch break
3:00Spark Exercises


Wednesday, April 8
All times given are Eastern
11:00Machine Learning: Recommender System with Spark
1:00Lunch break
2:00Deep Learning with Tensorflow


Please address any questions to Tom Maiden at tmaiden@psc.edu.

XSEDE, the Extreme Science and Engineering Discovery Environment, is the most advanced, powerful, and robust collection of integrated digital resources and services in the world. It is a single virtual system that scientists and researchers can use to interactively share computing resources, data, and expertise. XSEDE integrates the resources and services, makes them easier to use, and helps more people use them.