Pittsburgh Supercomputing Center 

Advancing the state-of-the-art in high-performance computing,
communications and data analytics.

XSEDE HPC Workshop: BIG DATA

February 4, 2014

XSEDE along with the Pittsburgh Supercomputing Center are pleased to announce a one day Big Data workshop, to be held February 4, 2014.

This workshop will focus on topics such as Hadoop and SPARQL.

11:00AM – 1:00PM Eastern Time
--Intro to Hadoop--

System overview
-Map/reduce
-Interacting with HDFS (Hadoop Distributed File System)

-Writing Java Hadoop programs
-Wordcount example
-Compiling and packaging
-Submitting
-Mappers and Reducers
-Monitoring
-API Overview

-Hadoop streaming – Hadoop programs in arbitrary languages
-Bash example
-Python example

2:00PM – 5:00PM Eastern Time

--Urika Training--

o Learn the Graph Analytic approach to Data analysis, including some real-world examples.

o Gain an introduction to the RDF data format and the SPARQL query lanquage, with hands-on practice.

o Learn how to interact with the Sherlock Urika system.

Due to demand, this workshop will be telecast to several satellite sites.

This workshop is NOT available via a webcast.

You may attend at any of the following sites.

Register at https://portal.xsede.org/group/xup/course-calendar/-/training-user/class/161

  • Pittsburgh Supercomputing Center
  • Clemson University
  • Purdue University
  • Ohio Supercomputer Center
  • University of Houston – Clear Lake
  • University of Iowa
  • Georgia State University
  • University of Utah
  • Notre Dame University
  • University of Connecticut

Please address any questions to Tom Maiden at This email address is being protected from spambots. You need JavaScript enabled to view it. .

 


XSEDE, the Extreme Science and Engineering Discovery Environment, is the most advanced, powerful, and robust collection of integrated digital resources and services in the world. It is a single virtual system that scientists and researchers can use to interactively share computing resources, data, and expertise. XSEDE integrates the resources and services, makes them easier to use, and helps more people use them.