Welcome to our community.

In this community, you can submit ideas, vote on existing ideas, or add comments.

To submit an idea, please click the Submit New Idea button at the top of the navigation sidebar. You will then be asked to add a title and choose a campaign for the new idea. You will also have the option to add tags to the idea. To vote on an idea, simply click the up or down arrows to the right of the idea title/description. And to add a comment, click in the box below the idea.

If you would like to see all ideas created with a specific tag, you can click on the word or phrase via the tagcloud in the navigation sidebar area under "What we're discussing". You can also view ideas sorted by Campaigns from the right navigation area. To return to this page, click the All Ideas link.

Analysis Priorities

Support multiple workflow tools and data access mechanisms

Galaxy and GenePattern are examples of systems that could provide access to data sets, pipelines, and publishable, shareable, and reproducible workflows. Ideally, existing familiar and popular platforms such as these would be supported. In addition to improving or enabling interactions between these tools, effort should be directed towards facilitating programmatic access to the underlying data in order to support custom ...more »

Submitted by

Data Priorities

Bringing Tools to Data to Avoid Data migration and redundancy

Most current approaches for BigData analysis involve moving data to a server, HPC infrastructure or cloud where the software tools and reference databases are pre-configured. This is inefficient since this approach requires making redundant copies of data each time and additional costs/time associated with moving data back and forth. Since there is no single tool or workflow to analyze genomic data, multiple copies ...more »

Submitted by

Analysis Priorities

Actively support crowd-sourcing challenges

To stimulate learning as much as possible, as quickly as possible, the data cloud could have a utility where interested parties could pose "crowd-sourcing" challenges, e,g, Kaggle. Indeed, Harold Varmus, NCI & leaders in cancer & genomics could pose the leading questions they would like bright people to take a run at answering, e.g. Hilbert's 23 problems

Submitted by

Analysis Priorities

Integrative Analysis of molecular datatypes for a given sample

A sample could be analyzed for DNA sequence variations, structural variations, CNVs, Gene or transcript isoform expression, genome-wide methylation patterns, ChIP-seq for specific transcription factors, metabolomic or proteomic analysis, and other molecular profiles. A framework that allows a researcher to readily identify all molecular data types associated with a particular sample and integrate the results of such analyses ...more »

Submitted by

Data Priorities

Connect data with available specimens for follow-up studies

Mining cancer data in the cloud is great, but to enable ongoing research there should be a connection to specimens so researchers can pursue followup studies. This will require storing data about specimens from studies such as TCGA - where they are, how they can be accessed and what consent they are governed by. Just as the data from publications should be made available to allow reproduction of results, so should samples ...more »

Submitted by

Displaying 1 - 25 of 40 Ideas