NIH | National Cancer Institute | NCI Wiki  

Blog

Topic: Streamlined Transfer and Sharing of Large-scale Sensitive Data to Advance Cancer Research

When: Thursday, September 22, 2016, 2:00 PM – 3:00 PM ET

Where: NCI Shady Grove Building Room 2W910-912

Or via WebEx: https://cbiit.webex.com/cbiit/onstage/g.php?MTID=ef9811b3f94f17a6f2fae2da71526bdab

Event number: 732 767 995

Password: $Foster922


Speaker: Ian Foster, Ph.D.

Senior Fellow, Director of the Computation Institute, a joint institute of the University of Chicago and Argonne National Laboratory
Argonne Senior Scientist & Distinguished Fellow
Arthur Holly Compton Distinguished Service Professor of Computer Science

Read Dr. Foster's professional bio

 

Presentation synopsis: Advances in genomics and data analytics create new opportunities for cancer research and personalized medical treatment via large-scale federation of genomic, clinical, imaging and other data from many thousands of patients across institutions around the world. Despite these opportunities and promising early results, cancer research is often stymied by information technology barriers. One major barrier is a lack of tools for the reliable, secure, rapid, and easy transfer, sharing, and management of large collections of human data. In the absence of such tools, security and performance concerns often prevent sharing altogether or force researchers to resort to slow and error prone shipping of physical media. If data are received, timely analysis is further impeded by the difficulties inherent in verifying data integrity and managing who can access data and for what purpose. Dr. Foster will discuss how the mature Globus data management platform addresses these obstacles to discovery and explain how its intuitive, web-based interfaces enable use by researchers without specialized IT knowledge.


Anthony R. Kerlavage, Ph.D.
Chief, Cancer Informatics Branch
National Cancer Institute
Center for Biomedical Informatics & Information Technology

 

Yuba Bhandari has developed state of art algorithm that matches experimental molecular

dynamics results with simulated code.

 

Eric Stahlberg has explored the optimization of this code in terms of runtime and input

interface on intel multicore processors using OpenMP.

 

Given the availability of GPU accelerator on Biowulf and the national labs

supercomputer, it is worth exploring if further speedup can be achieved using

GPUs.

 

Objectives:

1- Implement an GPU version of the time consuming hotspots of the code.

2- Explore the implementation of a web interface to allow users outside the NCI

to use the application.

3- Explore further refactoring of the code to enhance the optimization process.

 

CLICK HERE for the full report.

High Performance Computing (HPC) has an invaluable impact on driving advances in cancer research. Projects conducted within the context of the Precision Medicine Initiative, the National Cancer Institute, the Department of Energy pilots, and the National Strategic Computing Initiative are only examples of how new frontiers can be pushed using the next generation of supercomputers.

Areas of impact include, but are not limited to:

  • predicative algorithms for cancer therapy using machine learning, 
  • molecular dynamics simulation modeling for drug discovery, and 
  • medical image analysis.

CLICK HERE for the full report

Multivariate Analysis of Transcript Splicing (MATS) is an open tool for transcript slicing that is commonly used by NIH Biowulf users. The MATS package accepts a number of pairs samples of RNA-Seq data to detect differential alternative spicing events. For four samples pairs, the package took around 5 days to generate its results. An NCI investigator wanted to use the package to analyze multiple groups of 32 sample pairs which might take over a month to complete. The HPC/DM group was approached to enable such analysis.

In this blog, we will show how we were able to get an average of 4x speedup in the total runtime of the MATS package which we pushed on Biowulf under the name MATS-NIH.

CLICK HERE for the full report.