The Integrative Cancer Research Workspace is producing modular and interoperable tools and interfaces that provide for integration between biomedical informatics applications and data. This will ultimately enable translational and integrative research by providing for the integration of clinical and basic research data. The Workspace is developing a software-engineered, well-documented and validated biomedical informatics toolset for use throughout the research community.
Additional information is also found on the Molecular Analysis Knowledge Center site.
Open Development and Community Projects Speaker Series
ICR WS Teleconference Speakers
The Integrative Cancer Research (ICR) Workspace is hosting a series of speakers on various open development and community driven projects. While the applications may be of greatest interest to ICR, the conversations about success factors for these types of projects is of general interest to the caBIG community.
You are welcome to join us on the 2nd and 4th Wednesdays of each month from 2:00 – 4:00 pm ET. Teleconference information:
800-593-0616 passcode: 2927756.
Adobe connect: http://cbiit.acrobat.com/icrws/
ALL TELECONFERENCE MATERIALS ARE POSTED BELOW.
BioPAX and GeneMANIA, Gary Bader (U. of Toronto)
I SPY/ Transcend, Sarah Davis (UCSF)
Protein Ontology, Cecilia Neomi Arighi (U. of Delaware)
Data standards and the Cardiovascular DAM at Duke University, Sal Mungal (Duke)
National Center for Biomedical Ontology - BioPortal, Trish Whetzel (Stanford)
POSTPONED: The Functional Genomics Data Society (FGED, formerly MGED), Chris Stoeckert (U. of Pennsylvania)]
caB2B releases v3.2
The caBench-to-Bedside (caB2B) development team at Georgetown University and caB2B Knowledge Center at Washington University announced the availability of the caB2B v3.2. In this release caB2B's functionality has been expanded to better support complex queries that involve multiple services from multiple institutions such as caTissue Suite services at Prostate Cancer SPORE sites.
The caB2B Web Application is available for the community's use at the following sites:
* https://cab2b.georgetown.edu/cab2b (Georgetown University instance).
* https://cab2b.wustl.edu/cab2b (Washington University instance) and
* https://cab2b.nci.nih.gov/cab2b (NCI instance).
GenePattern 3.3.2 Released
The GenePattern development team is pleased to announce the release of GenePattern version 3.3.2. This release includes several major feature enhancements such as upload of large files to support RNA-seq analysis and the launching of datasets in the Integrative Genomics Viewer (IGV). To learn more about the enhancements and bug fixes in GenePattern 3.3.2, please visit the GenePattern landing page .
New Molecular Analysis Tools Knowledge Center Resources
The Molecular Analysis Tools Knowledge Center (MATKC) has created/updated several tutorials to assist users in using geWorkbench. Please visit geWorkbench Tutorials to learn more on Grid services, Multiple Probeset ID, Multiple Gene ID, Fold Change analysis, ARACNe, CNKB, ANOVA, Sequence Retriever, Volcano Plot, Color Mosaic, and SkyBase.
geWorkbench v 2.2 released
The geWorkbench development team is pleased to announce the release of geWorkbench version 2.2.0. This release includes more than 180 new features, enhancements and bug fixes. For a full list, see http://www.geworkbench.org .
The meeting Date is hyperlinked to the notes and hyperlinks in the Executive Summary are for specific presentations
Cecilia Neomi Arighi from U of Delaware presented on Protein Ontology. (PRO) provides the ontological representation of proteins and protein complexes and shows the relationships between them. It encompasses three sub-ontologies: proteins based on evolutionary relatedness (ProEvo); protein forms produced from a given gene locus (ProForm); and protein-containing complexes (ProComp). There is a website with extensive browse and search capabilities and includes additional annotations using other ontologies like Pfam and GO.
Gary Bader gave a presentation on BioPAX, a standard language that enables integration, exchange, visualization and analysis of biological pathway data. BioPAX underlies the integration of public database content in Pathwaycommons.org, which has a comprehensive directory of interaction and pathway databases as well as search, download and visualization tools (from Cytoscape). Dr. Bader also gave a demo of GeneMANIA, a tool to predict gene function, gene sets, pathway members, complexes etc. It draws data from Pathway Commons and uses a Cytoscape webservice for visualization. Pathway Commons and GeneMANIA (and Cytoscape) are built entirely on open source software.
John Speakman, NCI CBIIT CPO kicked-off the workspace call with a caBIG Program Update. Mike Smoot, UCSD, is lead developer for Cytoscape. He provided an overview of the Cytoscape software platform for visualizing molecular interactions and pathways and integrating these with annotations and data. His talk also focused on the active Cytoscape community. Sunny Jansen is a researcher at the Mouse Cancer Genetics Program on the Ft. Detrick campus in Maryland. She is doing a mouse GBM study using caBIG tools such as NBIA and caIntegrator with imaging, pre-clinical and genomic data. Ravi Madduri from Argonne National Lab will provide an update on his work with Galaxy. He was recently in the Netherlands at the Galaxy users meeting and presented his work on highthroughput data and identity management in Galaxy.
Mark Bray from the Broad Institute gave a presentation on CellProfiler. It is a tool which integrates various algorithms. It is capable of extracting measurements for every individual cell in an image-based screen, ranging from standard assays (e.g., cell count, size, per-cell protein levels) as well as more complex morphological assays (e.g., cell/organelle shape or subcellular patterns of DNA or protein staining). Judith Blake from the Jackson Laboratory discussed the Gene Ontology (GO). GO provides a standardized representation of gene and gene product attributes and is comprised of a controlled vocabulary, gene product annotation data and tools to access and process the data (GO Browser and OBO ontology editor). GO is maintained by the GO Consortium which is comprised of organism and protein databases and the biological research community.
Rashmi Srinivasa hosted a caArray Users Meeting. She discussed features of the upcoming 2.4.1 release and the plans for the following release. Grace Stafford provided an update on The Jackson Laboratory caBIG Deployment Implementation. This included the impact of next generation sequencing technologies on the computational core tool needs and requirements emerging from new projects. Ravi Madduri provided a demonstration of caBIG workflows run using the Galaxy platform.
The ICR workspace heard reports on the recent BioIT and AACR meetings from Mark Adams (BAH) and Jens Poschet (Sapient) respectively. Jason Hipp of University of Michigan gave a presentation on the use of Laser Capture Microdissection in --omics research.
The ICR participants presented on their assigned projects. Jim McCusker, caIntegrator Community Contribution of Code, discussed his implementation experience and future recommendations. Dennis Thomas , Extending the Use of the NanoParticle Ontology, addressed challenges for using NPO with nano-TAB and caNanoLab NPO for data annotation, semantic integration of data, unambiguous interpretation and data sharing. Mukesh Sharma , HL7 Clinical Genomics WG, discussed “Considering changing mission and charter to reflect changing technology” and “Mission to include bridging semantics between clinical and research domains.” Nathan Baker reviewed the updates and future goals of the Nanotechnology Working Group. In addition, Bob Freimuth , Information Representation Working Group, reviewed the activities and future recommendations for the period.
Alex Kanous from the DSIC Knowledge Center provided a presentation on the electronic Data Use Agreement tool. The tool may be used for generating DUAs for outgoing data based on a catalog of standardized, modular contract clauses, each of which corresponds to one of the E-DSSF’s sensitivity ratings. Joshua Phillips and Ravi Madduri discussed the prototyping activities on workflows for caGrid 2.0. They described use cases and requirements for workflows, defining metadata needed for discovery, composition, and execution of workflows and their consideration of how best to use W3C technologies (e.g. RDF, SPARQL, SA-WSDL, inference). They also provided a demo of a workflow engine prototype based on SADI and Taverna. Ken Quinn spoke about the Roswell Park deployment of caGrid technology and use of caB2B to do federated queries across disparate, decentralized heterogeneous databases and clinical systems to support non-interventional clinical research. He described the process and challenges including: gathering senior leadership support, understanding the myriad research databases, gaining technical expertise, lack of common vocabularies and the excellent collaboration and support provided by the caB2B knowledge center.
Anton Nekrutenko and Daniel Blankenberg and of The Pennsylvania State University gave a presentation on Galaxy, an open-source next generation sequence (NGS) analysis software system. It addresses the need to empower the scientists without access to extensive infrastructure to do the analysis. Galaxy is a free web service, and has a plethora of analysis tools and has workflow generation capabilities
caArray Users Meeting featured upcoming plugin architecture for support the addition of new parsers and data storage mechanisms.
Jenny Kelley, NCI Population Sciences, updated the community on caLIMS v2 new features. She also provided a thorough demonstration of the tool.
The ICR Workspace is hearing reports on activities for the last period.
Looking for older notes? Access ICR Meeting Notes Archive