Welcome to the CBIIT HPC Wiki! This page is actively under construction and will soon serve as a central hub to provide you with the latest developments and resources related to the CBIIT HPC Program. We look forward to serving you!
Table of Contents
1. HPC Thought Leaders Presentations
2. HPC Program Management
3. Communications
4. Strategic Collaborations
5. HPC Support Inquiries
7. Precision Oncology and Computing
8. Exploratory Computing
9. Cloud Services
10. Computing and Cancer Community Development
11. New Initiatives
The High-Performance Computing Program Development effort aims to foster the expanded use of a high-performance computing ecosystem to accelerate advances in predictive oncology research and clinical applications. Both driven and enabled by the rapid growth rates of information collected and generated about cancer, the opportunity for ever increasing computational capability grows as the data is analyzed, explored and utilized to provide critical insight into cancer. The program aims to develop the computational and data science ecosystem by addressing critical needs in compute, data transfer, data management, exploration and education in these areas required to advance the mission of the NCI.
*For support inquiries, please contact us at nci-cbiit-hpc@list.nih.gov
CLICK THE BELOW IMAGE for a larger view of CBIIT's HPC Overview
Topic: Streamlined Transfer and Sharing of Large-scale Sensitive Data to Advance Cancer Research |
Speaker: Ian Foster, Ph.D. Read Dr. Foster's professional bio.
|
Presentation synopsis: Advances in genomics and data analytics create new opportunities for cancer research and personalized medical treatment via large-scale federation of genomic, clinical, imaging and other data from many thousands of patients across institutions around the world. Despite these opportunities and promising early results, cancer research is often stymied by information technology barriers. One major barrier is a lack of tools for the reliable, secure, rapid, and easy transfer, sharing, and management of large collections of human data. In the absence of such tools, security and performance concerns often prevent sharing altogether or force researchers to resort to slow and error prone shipping of physical media. If data are received, timely analysis is further impeded by the difficulties inherent in verifying data integrity and managing who can access data and for what purpose. Dr. Foster will discuss how the mature Globus data management platform addresses these obstacles to discovery and explain how its intuitive, web-based interfaces enable use by researchers without specialized IT knowledge.
|
Yuba Bhandari has developed state of art algorithm that matches experimental molecular
dynamics results with simulated code.
Eric Stahlberg has explored the optimization of this code in terms of runtime and input
interface on intel multicore processors using OpenMP.
Given the availability of GPU accelerator on Biowulf and the national labs
supercomputer, it is worth exploring if further speedup can be achieved using
GPUs.
Objectives:
1- Implement an GPU version of the time consuming hotspots of the code.
2- Explore the implementation of a web interface to allow users outside the NCI
to use the application.
3- Explore further refactoring of the code to enhance the optimization process.
CLICK HERE for the full report.
High Performance Computing (HPC) has an invaluable impact on driving advances in cancer research. Projects conducted within the context of the Precision Medicine Initiative, the National Cancer Institute, the Department of Energy pilots, and the National Strategic Computing Initiative are only examples of how new frontiers can be pushed using the next generation of supercomputers.
Areas of impact include, but are not limited to:
- predicative algorithms for cancer therapy using machine learning,
- molecular dynamics simulation modeling for drug discovery, and
- medical image analysis.
CLICK HERE for the full report
Multivariate Analysis of Transcript Splicing (MATS) is an open tool for transcript slicing that is commonly used by NIH Biowulf users. The MATS package accepts a number of pairs samples of RNA-Seq data to detect differential alternative spicing events. For four samples pairs, the package took around 5 days to generate its results. An NCI investigator wanted to use the package to analyze multiple groups of 32 sample pairs which might take over a month to complete. The HPC/DM group was approached to enable such analysis.
In this blog, we will show how we were able to get an average of 4x speedup in the total runtime of the MATS package which we pushed on Biowulf under the name MATS-NIH.
CLICK HERE for the full report.
Recent Updates
8/8/16
· Frontiers of Predictive Oncology and Computing Meeting - With over 100 attendees from across the Department of Energy, NCI, academia, industry and other government agencies, the meeting (hosted by Intel July 12-14, 2016) provided an opportunity to gain insight into challenges and opportunities for the future. A white paper summarizing the meeting is to be developed.
· New Data Services with Cleversafe – The Cleversafe storage system officially was moved into a production operational status at the beginning of August. Led by the IT Operations Group at Frederick National Laboratory and working with many stakeholders including CCR, CBIIT and NIH CIT, the new system is used within industry and in key efforts such as the Genomic Data Commons to provide a high level of data assurance for archive and stable data. Stay tuned for further information on opportunities to learn more how this new resource may benefit your scientific and operational needs.
· Education and Training - Plans are underway to develop educational opportunities to learn more about how high-performance computing (HPC) can be used to accelerate cancer research and clinical applications. Individuals and groups interested in learning more about HPC, either in general or with specific technologies and scientific challenges in mind may reach out and contact Eric Stahlberg, Miles Kimbrough or George Zaki.
· Computational Approaches for Cancer workshop - Scheduled for November 13, 2016 as part of the International Conference for High Performance Computing, Networking, Storage and Analysis. A call for papers has been issued. More information can be obtained at the link http://www.scworkshops.net/cancer2016/
Error(s) occurred.
Warning(s) occurred.
Calendars with restrictions are embedded here.
- EDIT THE CALENDAR
Customise the different types of events you'd like to manage in this calendar.
#legIndex/#totalLegs - RESTRICT THE CALENDAR
Optionally, restrict who can view or add events to the team calendar.
#legIndex/#totalLegs - SHARE WITH YOUR TEAM
Grab the calendar's URL and email it to your team, or paste it on a page to embed the calendar.
#legIndex/#totalLegs - ADD AN EVENT
The calendar is ready to go! Click any day on the calendar to add an event or use the Add event button.
#legIndex/#totalLegs - SUBSCRIBE
Subscribe to calendars using your favourite calendar client.
#legIndex/#totalLegs