OMB
No.: 0925-0775
|
The following sets of high-level questions are intended to provide an insight to CDS, into the data storage, access and secondary sharing needs and requirements of data submitters. It is requested that the submitters answer as many questions as they can. It is not required to answer all questions.
What are the principal types of data the program will be submitting (e.g., genomic, clinical, imaging)?
Will there be additional data types associated with the principal data types, not being submitted to CDS? For ex: Proteomics, Imaging etc.
Do you anticipate other additional data types to be submitted to CDS in future? For example, data type that does not fit the submission criteria to any of the present CRDC nodes.
Is the data from Humans?
NOTE: CDS accepts only Human data at his point.
What additional associated data would you be providing? For ex: Clinical/Phenomics data from study subjects (participants) and/or any other study associated metadata/searchable variables. Describe the format for each.
NOTE: CDS at this point will accept all metadata submitted.
What is the total number of samples and cases per study, being submitted?
For Genomics datasets, though CDS takes BAM files, it is preferred to submit CRAM files. Would you be able to provide CRAM files instead of BAM files?
Who is the PI on the study?
How much data are you planning to submit to CDS?
By data type (if known)?
What is the reason you are looking for storage with CDS? What are your challenges related to the storage of data?
Do you have a preference of AWS versus Google cloud for storage? CDS provides AWS storage as of now and plans to provide Google storage in near future.
Who will submit the data, the PI (or the PI’s team) or a collaborator?
Would there be multiple uploaders (ex: by data type or working groups)?
Is there a program timeline associated with the data Submission?
When do you plan to start submitting data to CDS?
Who is the primary point of contact for data submission?
Do you plan one or multiple submissions to CDS? For example, multiple studies or newer versions of the data for the same study.
If yes, do you have a timeline for the successive submissions?
If this submission has data from a newer version of the study already submitted to CDS, do you want to retain data from the older version/s at CDS?
Do you have an Amazon/Google account for data submission? CDS submissions presently require that the data uploaders have an Amazon account.
Is your data being released to broader research community for secondary sharing?
When is the data planned to be released for secondary sharing?
Is your data sensitive, i.e., require controlled access?
Has the data been registered with any public sharing repository such as dbGaP?
If not, is there a reason?
If yes, please share the associated study ID, for ex: dbGaP PHS number.
Is the study RELEASED by dbGaP?
Is data currently shared through NCBI /dbGaP or other means? What is a plausible timeline?
Has the data already been submitted to any data repository? For Ex: SRA
CDS does not allow downloads. Given this, does CDS meet your data sharing needs?
Are there any data access limitations?
Is the data embargoed? If yes, would the data reside in CDS during that time? How would it effect user access?
Is any part of this data “open-access”? for ex: VCFs from Genomics studies.
How can you assure the data does not contain PII and PHI and/or identifiable data elements?
How do the users access the data presently?
How well are these methods working today?
Will your data be made accessible through any other repository?
For what purpose(s) do the approved users access the data?
Conduct analyses / computations?
Cite in a publication?
Do they need to link this data to other data types in other repositories/CRDC nodes for analysis?
Do you know the post-CDS destination for this data? For ex: to other CRDC nodes such as GDC, PDC, IDC etc.
Is there any plan to move data out of CDS buckets, before sharing publicly?
Is there any other information you would like to share about your data?
File Type | application/vnd.openxmlformats-officedocument.wordprocessingml.document |
Author | Addepalli, Kanakadurga (NIH/NCI) [E] |
File Modified | 0000-00-00 |
File Created | 2023-08-19 |