Community Data Portal¶
Publish AIRR-seq Study in the AIRR Data Commons¶
Publishing your AIRR-seq study in the ADC with VDJServer is not a completely automated process; there are a number of manual validation steps that need to be performed. Furthermore, loading the data into the repository database can take hours, days or even a week depending upon the size of the data; therefore, the load process is initiated by a VDJServer administrator. The basic requirements include:
- Study metadata in AIRR Repertoire format.
Validation scripts are run to verify the metadata is valid and complete. If the study metadata has been provided in VDJServer’s Metadata Entry page, that metadata can be automatically converted into the AIRR Repertoire format.
- Rearrangement (annotated sequence) data in AIRR TSV format.
If your rearrangement data is not in the AIRR TSV format, it may need to be converted or run through the IgBLAST tool on VDJServer. Validation scripts are run to verify that the annotations are valid and complete.
- VDJServer administrator loads the study into VDJServer’s repository.
Contact VDJServer (vdjserver@utsouthwestern.edu) to initiate publishing your study.
Data Charges for Repository Copies¶
VDJServer offers the ability to provide a complete copy of the data repository to the customer. This can be a quicker option to acquire large quantities of data versus downloading through the portal. As this requires additional time and resources by VDJServer personnel, we need to impose a data charge for cost recovery. The following options are available:
- (Price: $2000 USD) Upload data to an Amazon S3 bucket.
We will upload either to a customer-provided bucket or create a new bucket and provide access. In the case that we create the bucket, it will only be accessible for a pre-determined amount of time (typically one month), so the customer is required to move the data into their own bucket.
- (Price: $2000 USD) FEDEX/UPS a hard disk drive with the data to the customer.
We purchase a standard SATA hard drive but can use a customer-provided SATA hard drive if desired. The hard drive becomes the customer’s property and does not need to be returned.
- (Price: $3000 USD) Both options 1 and 2.
We will upload data to an Amazon S3 bucket and provide the customer with a hard disk drive.
- Yearly update for options 1, 2 or 3.
The same price as the requested option. A repository copy is sent automatically to the customer every year.
Please contact VDJServer (vdjserver@utsouthwestern.edu) to initiate an order. Other data delivery options can be provided as well as training services about the data and data formats. Payment is processed through an invoice sent by UT Southwestern Medical Center.