T-RBI Pipeline Data Submission

Drag & Drop your files here

or

To run T-RBI, please submit the required files below:

matrix.mtx (.gz optional)required – an RNA count matrix in Matrix (.mtx) format for all cells (across datasets/samples).
genes.tsv (.gz optional)required – a ONE-COLUMN list of NCBI Gene Symbols corresponding to the rows of your matrix.
- Do NOT submit ENSEMBL IDs, ENTREZ IDs, etc.
- Do NOT pre-filter genes (e.g., highly variable genes only). Submit all genes.
- Required genes for QC, T cell filtering, and γδ T cell identification (the run will stop if they are missing):
T cell genes: Trav14-1, Trgv2, Trac, Cd3e, Foxp3, Izumo1r, Cd40lg, Dapl1, Cd5, Trat1, Cd3d, Zfp683, Themis, Esm1, Cd3g, Il10, Sdcbp2, Olfr524, Ly6c1, Sox13, Syt13, Gzmk
γδ T cell genes: Trdv1, Trdv2-1, Trdv2-2, Trdv3, Trdv4, Trdd1, Trdd2, Trdj1, Trdj2, Trdc, Trdv5, Trgv7, Trgv4, Trgv6, Trgv5, Trgj1, Trgc1, Trgv3, Trgj3, Trgc3, Trgc2, Trgj2, Trgv2, Trgv1, Trgj4, Trgc4, Sox13
barcodes.tsv (.gz optional)required – a list of unique cell IDs that represent the columns in your matrix. Ensure cell IDs are unique; barcodes alone are insufficient when combining multiple 10x lanes.
cell_batch.csvrequired – Relates each cell to a batch (corresponding to an encapsulation run) – required when submitting multiple batches together. If your dataset is from one batch, enter the same batch ID for every cell.
Column names must match exactly:
- Cell_ID: must exactly match the cell IDs in barcodes.tsv
- batch: batch identification
Examples:
Cell_ID,batch
GTCGTAACAACTGGCC-1-19-0-0,18-F-50
AAAGTAGAGGGTCTCC-1-38-0-0,18-M-52
CATGACAAGTATCGAA-1-57-1-0,1-M-63
GCACATAAGCTAACTC-1-49-1-0,1-M-63
AAACCTGGTGCAGGTA-1-4-0-0,18-F-50

Common cases:
• Different 10x lanes should be considered different batches (even if the same day, technical replicates). One 10x lane = 1 batch in the csv file
Please tell us about your data:
0/200 words

The T-RBI pipeline on the ImmGen server will send you back the mapping and annotation of your cells in the immgenT framework. The data you upload will be retained for a few days in case verification or QC is needed, and will then be deleted.

On the other hand, it would be interesting for us to retain the data submitted in order to assemble a large collection of T cell data datasets, for system optimization/testing and to integrate these data into a growing reference pool that better powers future queries. If you agree, thanks for accepting below.