
This year the SDSC Discover Big Data Summer Institute will focus on big data analytics, helping attendees explore their data using a wide variety of predictive data analytics tools. This week-long summer institute combines both presentations and hands-on experience to introduce attendees to the latest approaches and tools to extract meaning and new insights from very large data sets. Participants will use SDSC’s Gordon data-intensive supercomputer as well as other computational resources at SDSC. An agenda and schedule will be posted in the coming months, but please hold the date now so you can attend! We also welcome expressions of interest by individuals and companies who would like to lecture, sponsor student attendees, or provide other forms of support for the institute.
Topics to be covered include:
- Overview of the Gordon and Trestles architectures
- Data intensive computing
- Developing shared memory applications
- Improving I/O performance with flash storage
- Using vSMP for large shared memory
- Visualization using Gordon
- Developing Science Gateways
- Overview of software, libraries, tools, and compiler options for achieving optimal performance
- XSEDE allocations process and writing a strong proposal
All current and potential users of SDSC resources are invited to apply, but experience working in a UNIX/Linux environment is essential. Preference will be given to those applicants who have some programming experience (e.g. C/C++, Fortran, R, Python) and a particular computational problem they are trying to solve.
The 2012 Summer Institute group of participants.