immgenT T-RBI Data Integration

Drag & Drop your files here

Required Files Description

To run T-RBI, please submit the required files below:

• matrix.mtx (.gz optional) – required – an RNA count matrix in Matrix (.mtx) format for all cells (across datasets/samples).

• genes.tsv (.gz optional) – required – a ONE-COLUMN list of NCBI Gene Symbols corresponding to the rows of your matrix.

- Do NOT submit ENSEMBL IDs, ENTREZ IDs, etc.
- Do NOT pre-filter genes (e.g., highly variable genes only). Submit all genes.
- Required genes for QC, T cell filtering, and γδ T cell identification (the run will stop if they are missing):

T cell genes: Trav14-1, Trgv2, Trac, Cd3e, Foxp3, Izumo1r, Cd40lg, Dapl1, Cd5, Trat1, Cd3d, Zfp683, Themis, Esm1, Cd3g, Il10, Sdcbp2, Olfr524, Ly6c1, Sox13, Syt13, Gzmk
γδ T cell genes: Trdv1, Trdv2-1, Trdv2-2, Trdv3, Trdv4, Trdd1, Trdd2, Trdj1, Trdj2, Trdc, Trdv5, Trgv7, Trgv4, Trgv6, Trgv5, Trgj1, Trgc1, Trgv3, Trgj3, Trgc3, Trgc2, Trgj2, Trgv2, Trgv1, Trgj4, Trgc4, Sox13

• barcodes.tsv (.gz optional) – required – a list of unique cell IDs that represent the columns in your matrix. Ensure cell IDs are unique; barcodes alone are insufficient when combining multiple 10x lanes.

• cell_batch.csv – required – Relates each cell to a batch (corresponding to an encapsulation run) – required when submitting multiple batches together. If your dataset is from one batch, enter the same batch ID for every cell.
Column names must match exactly:

- Cell_ID: must exactly match the cell IDs in barcodes.tsv
- batch: batch identification

Examples:

																			Cell_ID,batch

																			GTCGTAACAACTGGCC-1-19-0-0,18-F-50

																			AAAGTAGAGGGTCTCC-1-38-0-0,18-M-52

																			CATGACAAGTATCGAA-1-57-1-0,1-M-63

																			GCACATAAGCTAACTC-1-49-1-0,1-M-63

																			AAACCTGGTGCAGGTA-1-4-0-0,18-F-50

Common cases:

• Different 10x lanes should be considered different batches (even if the same day, technical replicates). One 10x lane = 1 batch in the csv file

The T-RBI pipeline will return the mapping and annotation of your cells in the immgenT framework. The data you upload will be retained for a few days for verification and quality control, and then be deleted from our server.

On the other hand, and if you agree, we would like to retain data submitted to T-RBI in order to build a larger reference of T cell datasets, in addition to immgenT datasets. We believe that growing the reference by incorporating external data that best reflect evolving technologies in the community is a powerful strategy to future-proof T-RBI.

If you agree, please indicate below. Needless to say, your data will not be analyzed individually, will not be distributed to third parties. We do plan to keep a list of “donors” for future recognition.

T-RBI (immgenT Reference-Based Integration) - Data Submission

Drag & Drop your files here

Required Files Description

Please tell us about your data: