Best Practices in Data Infrastructure Workshop

May 17-18, 2016

Pittsburgh, PA

PSC hosted a workshop on Best Practices in Data Infrastructure to bring together developers and users of advanced cyberinfrastructure relating to data management and analytics. The workshop was designed with the following groups in mind: awardees of NSF DIBBs and DataNet projects, leads for acquisitions having data as a major focus, and users with challenging data requirements. The workshop was an excellent opportunity for NSF ACI developers and users to interact. Goals of the workshop included disseminating significant results, creating opportunities for collaboration between data cyberinfrastructure projects, and identifying gaps where users need additional innovation or resources.

The workshop was held over 1.5 days to allow time for presentations and discussion and concluded at noon on May 18th.

The workshop fee was $150.00 due at registration.

Agenda

Tuesday May 17, 2016
7:30 Check-in and Continental Breakfast
8:30 Welcome J Ray Scott (PSC)
8:45 Best Practices for Timely and Trusted Data Acquisition, Curation and Coordination in Microscope Environment Klara Nahrstedt (UIUC)
9:15 Data Observation Network for Earth: Earth and Environmental Science Data Management and Discovery Amber Budden (UNM)
9:45 SciServer: Collaborative Analysis of Petabyte Scale Scientific Data Jordan Raddick (JHU)
10:15 Break
10:30 CIF21 DIBBs: Brown Dog Kenton McHenry (UIUC)
11:00 Data Analysis and Management Building Blocks for Multi-Campus Cyberinfrastructure through Cloud Federation Tom Furlani (University at Buffalo)
11:30 CIF21 – DIBBS An Integrated System for Public/Private Access to Large Scale Confidential Social Science Data Charley Kneifel (Duke)
12:00 Discussion
12:30 Lunch
1:30 The Data Exacell: Tools for Data Analytics on Big Data J Ray Scott (PSC)
2:00 Syndicate: Software-defined Wide-area Storage Jude Nelson (Princeton)
2:30 Globus as a Platform for Research Data Management Vas Vasiliadis (U. Chicago)
3:00 Break
3:15 SeedMe Data Sharing Building Blocks Amit Chourasia (SDSC/UCSD)
3:45 CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science Geoffrey Fox (Indiana U.)
4:15 Geospatial Data Building Blocks Carol Song (Purdue University)
4:45 Discussion and Daily Wrap-up
5:30 Adjourn
Wednesday May 18, 2016
7:30 Continental Breakfast
8:30 Summary and plans for today J Ray Scott (PSC)
8:45 DataNet Federation Consortium Reagan Moore (RENCI)
9:15 Wrangler: A Transformational Data Intensive Resource for the Open Science Community Dan Stanzione (TACC)
9:45 Converging HPC and Big Data with Bridges Nick Nystrom (PSC)
10:15 Break
10:30 SEAD Data Services Jim Myers (University of Michigan)
n/a Dataverse Merce Crosas (Harvard University)
10:45 Closing Discussion including Capabilities of DIBBS Projects
12:00 Workshop concludes & Box Lunches Provided

Local Accommodations

There are a number of hotels that are close to PSC. All of the properties listed below offer free shuttle service and many are within walking distance.

Residence Inn by Marriott Pittsburgh University/Medical Center (0.7 miles)

Use this link to receive the discounted rate of $129/night + tax

Hampton Inn (1.11 miles)

Use the group code BPD when booking to receive the discounted rate of $119/night + tax

Hilton Garden Inn Pittsburgh University Place (0.8 miles)

Wyndham University Center (0.42 miles)

Courtyard by Marriott (1.2 miles)

Use this link to receive the discounted rate of $165/night + tax.

Questions

Contact Michelle Devlin at 412-268-4960 or mdevlin@psc.edu with any questions.