Webcast - Introduction to using Hadoop on Gordon

Host Site:

San Diego Supercomputer Center

Host site URL:

http://www.sdsc.edu

This workshop introduces researchers to Hadoop for data-intensive computing and why they might want to use it. SDSC’s introduction is designed for researchers seeking to use Hadoop on XSEDE’s Gordon data intensive cluster at the San Diego Supercomputer Center. During the 2-hour workshop, participants will get an introduction on the various options available for running hadoop within Gordon’s normal production environment . The configuration is based on using SSD storage on each compute node (available via iSER) to construct the Hadoop filesystem (HDFS) and the IPoIB interface for the network communication.

Agenda

9AM – 9:45AM (PT) Overview

  • Gordon Architecture
  • Details of typical Hadoop configuration
  • Gordon specific Hadoop options

9:45AM – 10:45AM (PT) Hands on examples

  • Interactive setup of a Hadoop cluster using iSER scratch and IPoIB
  • HDFS setup, simple operations, performance benchmarking
  • TeraSort example – used to illustrate configuration options

10:45AM – 11:00AM (PT) Questions and Discussion

More information: http://www.sdsc.edu/us/resources/gordon/gordon_hadoop.html

Sessions:

Webcast

01/31/2013 09:00 - 01/31/2013 11:00 PST (SESSION HAS ENDED)
View Session Details →
Registration CLOSED
Registration open date
12/27/2012 09:00 PST
Registration close date
01/30/2013 15:18 PST
Class size restriction
99 registrants

(31 spots left)

Waitlist

0 registrants

Contact Information
Contact
Susan Rathbun
Contact phone
858-534-8321
Contact email
susan@sdsc.edu
Location
Name
San Diego Supercomputer Center
Address
Synthesis Center - Room E-B143
10100 Hopkins Dr., UC San Diego
La Jolla, CA 92093-0505
Phone
858-534-8321
URL
http://www.sdsc.edu
Posted: 12/27/2012 01:57 UTC