High Throughput Sequencing Of T-Cell Repertoires: Statistics and Methods
Rubio Hermosa, David
MetadataShow full item record
T-cell repertoires are now easy to sequence through CDR3 sequence isolation and further High-Throughput Sequencing. In this thesis we have investigated the use of Zipf-Mandelbrot models establishing them as accurate approximations of T-cell repertoires. We have also established the biological interpretation of the parameters produced by the model. We have compared the use of these new parameters to the metrics currently used in T-cell repertoire analysis, clonality and entropy. To establish the biological relevance of Zipf-Mandelbrot models in cancer 3 case studies have been proposed. These case studies show the usefulness of the new metrics. CDR3 variability and length have been analyzed in one of the case studies and compared to previous results produced from RNAseq data.