XSEDE HPC Monthly Workshop - September 2, 2014 - Big Data

09/02/2014 11:00 - 09/02/2014 17:00 EDT
in person (West Virginia University)
Registration
Registration open date
08/08/2014 10:40 EDT
Registration close date
08/29/2014 17:00 EDT
Class size restriction
12 registrants

(6 spots left)

Waitlist

0 registrants

Contact Information
Contact
Tom Maiden
Contact phone
412-268-4960
Contact email
tmaiden@psc.edu
Location
Name
West Virginia University
Address
107 of the National Center for Coal and Energy (NRCCE)
385 Evansdale Drive
Morgantown, WV 26506
Phone
412-268-4960
URL
http://www.nrcce.wvu.edu/directions.cfm

XSEDE HPC Workshop: BIG DATA
September 2, 2014

XSEDE along with the Pittsburgh Supercomputing Center are pleased to announce a one day Big Data workshop, to be held September 2, 2014.

This workshop will focus on topics such as Hadoop and SPARQL.

11:00AM – 1:00PM Eastern Time

--Big Data Programming with Hadoop and Spark

This session will give an overview of programming big data applications focusing on Hadoop and Spark.

I. Hadoop System Overview
This section will cover the basics of the Hadoop Environment. We will discuss the Map Reduce daemons, the scheduling and monitoring environment, and interacting with the distributed file system (HDFS).

II. Hadoop Jobs
We will write a simple Java Map/Reduce program and run through the process of compiling, packaging, submitting, monitoring, and collecting the output of a Hadoop job. We will also briefly discuss other applications that run on the Hadoop platform such as HBase and Hadoop Streaming.

III. Spark
We will discuss the Spark platform and its concept of Resilient Distributed Datasets. We will cover the relationship between Spark and Hadoop, and we will write and submit an example job. We will also discuss the Spark Machine Learning API.

2:00PM – 5:00PM Eastern Time

--Urika Training--

o Learn the Graph Analytic approach to Data analysis, including some real-world examples.

o Gain an introduction to the RDF data format and the SPARQL query lanquage, with hands-on practice.

o Learn how to interact with the Sherlock Urika system.

Due to demand, this workshop will be telecast to several satellite sites.

This workshop is NOT available via a webcast.

You may attend at any of the following sites.

  • Pittsburgh Supercomputing Center
  • Florida State University
  • Ohio Supercomputer Center
  • West Virginia University
  • University of Houston, Clear-Lake
  • Georgia State University
  • Harvey Mudd College
  • Purdue University
  • The University of Michigan
  • The University of South Carolina
  • National Center for Atmospheric Research

Register at the appropriate session here:https://portal.xsede.org/web/xup/course-calendar

Please address any questions to Tom Maiden at tmaiden@psc.edu.

XSEDE, the Extreme Science and Engineering Discovery Environment, is the most advanced, powerful, and robust collection of integrated digital resources and services in the world. It is a single virtual system that scientists and researchers can use to interactively share computing resources, data, and expertise. XSEDE integrates the resources and services, makes them easier to use, and helps more people use them.

This session has ended.
Posted: 08/04/2014 13:49 UTC