Ishikawa T. Next-generation sequencing traces human induced pluripotent stem cell lines clonally generated from heterogeneous cancer tissue. World J Stem Cells 2017; 9(5): 77-88 [PMID: 28596815 DOI: 10.4252/wjsc.v9.i5.77]
Corresponding Author of This Article
Tetsuya Ishikawa, PhD (DMSc), Laboratory Head, Central Animal Division, Fundamental Innovative Oncology Core Center, National Cancer Center Research Institute, 1-1, Tsukiji 5-chome, Chuo-ku, Tokyo 104-0045, Japan. humanipscells@gmail.com
Research Domain of This Article
Cell Biology
Article-Type of This Article
Basic Study
Open-Access Policy of This Article
This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Tetsuya Ishikawa, Cell Biology, Core Facilities for Research and Innovative Medicine, National Cancer Center Research Institute, Tokyo 104-0045, Japan
Tetsuya Ishikawa, Central Animal Division, Fundamental Innovative Oncology Core Center, National Cancer Center Research Institute, Tokyo 104-0045, Japan
ORCID number: $[AuthorORCIDs]
Author contributions: Ishikawa T substantially contributed to the conception and design of the study as well as the acquisition, analysis and interpretation of the data, drafted the manuscript, made critical revisions related to the intellectual content of the manuscript, and approved the final version to be published.
Supported bythe JSPS KAKENHI, No. 16K07135.
Institutional review board statement: This study was conducted with the approval of the Institutional Review Boards of the National Cancer Center of Japan and the Japanese Collection of Research Bioresources, National Institutes of Biomedical Innovation, Health and Nutrition. Written informed consent was obtained from donors for the use of their tissue in this study.
Institutional animal care and use committee statement: N/A.
Conflict-of-interest statement: The Life-Science Intellectual Property Platform Fund (LSIP) supported this work. The author confirms that the LSIP had no influence over the study design, content of the article, or selection of this journal. The author discloses patents pending (WO2013081188 A1) relevant to the work presented here.
Data sharing statement: The next-generation sequencing data in this study will be available to the public through the DDBJ Sequence Read Archive (DRA).
Open-Access: This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Correspondence to: Tetsuya Ishikawa, PhD (DMSc), Laboratory Head, Central Animal Division, Fundamental Innovative Oncology Core Center, National Cancer Center Research Institute, 1-1, Tsukiji 5-chome, Chuo-ku, Tokyo 104-0045, Japan. humanipscells@gmail.com
Telephone: +81-3-35475201 Fax: +81-3-35422548
Received: January 31, 2017 Peer-review started: February 7, 2017 First decision: March 7, 2017 Revised: April 3, 2017 Accepted: May 3, 2017 Article in press: May 5, 2017 Published online: May 26, 2017 Processing time: 107 Days and 12.7 Hours
Abstract
AIM
To investigate genotype variation among induced pluripotent stem cell (iPSC) lines that were clonally generated from heterogeneous colon cancer tissues using next-generation sequencing.
METHODS
Human iPSC lines were clonally established by selecting independent single colonies expanded from heterogeneous primary cells of S-shaped colon cancer tissues by retroviral gene transfer (OCT3/4, SOX2, and KLF4). The ten iPSC lines, their starting cancer tissues, and the matched adjacent non-cancerous tissues were analyzed using next-generation sequencing and bioinformatics analysis using the human reference genome hg19. Non-synonymous single-nucleotide variants (SNVs) (missense, nonsense, and read-through) were identified within the target region of 612 genes related to cancer and the human kinome. All SNVs were annotated using dbSNP135, CCDS, RefSeq, GENCODE, and 1000 Genomes. The SNVs of the iPSC lines were compared with the genotypes of the cancerous and non-cancerous tissues. The putative genotypes were validated using allelic depth and genotype quality. For final confirmation, mutated genotypes were manually curated using the Integrative Genomics Viewer.
RESULTS
In eight of the ten iPSC lines, one or two non-synonymous SNVs in EIF2AK2, TTN, ULK4, TSSK1B, FLT4, STK19, STK31, TRRAP, WNK1, PLK1 or PIK3R5 were identified as novel SNVs and were not identical to the genotypes found in the cancer and non-cancerous tissues. This result suggests that the SNVs were de novo or pre-existing mutations that originated from minor populations, such as multifocal pre-cancer (stem) cells or pre-metastatic cancer cells from multiple, different clonal evolutions, present within the heterogeneous cancer tissue. The genotypes of all ten iPSC lines were different from the mutated ERBB2 and MKNK2 genotypes of the cancer tissues and were identical to those of the non-cancerous tissues and that found in the human reference genome hg19. Furthermore, two of the ten iPSC lines did not have any confirmed mutated genotypes, despite being derived from cancerous tissue. These results suggest that the traceability and preference of the starting single cells being derived from pre-cancer (stem) cells, stroma cells such as cancer-associated fibroblasts, and immune cells that co-existed in the tissues along with the mature cancer cells.
CONCLUSION
The genotypes of iPSC lines derived from heterogeneous cancer tissues can provide information on the type of starting cell that the iPSC line was generated from.
Core tip: Ten induced pluripotent stem cell (iPSC) lines were clonally generated from heterogeneous colon cancer tissues and analyzed with next-generation sequencing. Non-synonymous single-nucleotide variants (SNVs) of the iPSC lines were not identical to the genotypes of the cancer tissues. The SNVs were de novo or pre-existing mutations that originated from a minor population within the cancer tissue. Meanwhile, the genotypes of the iPSC lines were not mutated genotypes of the cancer tissues, suggesting that the starting cells for the iPSC lines were not mature cancer cells. Thus, the genotypes of iPSC lines can be used to trace the genomic origins of single cells within heterogeneous cancer tissue.
Citation: Ishikawa T. Next-generation sequencing traces human induced pluripotent stem cell lines clonally generated from heterogeneous cancer tissue. World J Stem Cells 2017; 9(5): 77-88
Gene transfer of OCT3/4, SOX2, KLF4, and c-Myc to somatic cells generates human induced pluripotent stem cells (iPSCs)[1-3] although c-MYC is not required for iPSC generation[4]. Human iPSCs are indistinguishable from human embryonic stem cells (ESCs) in terms of their long-term self-renewal ability and their in vivo pluripotency[3,5]. The starting cells for iPSC generation should be appropriately chosen to generate normal or aberrant iPSC lines for the purpose of regenerative medicine or cancer research/therapy. Human iPSC lines for regenerative medicine would be ideally generated from normal neonatal tissues[3] that are typically free of postnatal aberrant mutations and epigenetic changes. Human iPSCs (or iPSC-like cells) have also been generated from cancer cell lines[6,7], the somatic cells from familial cancer patients[8,9], and pancreatic ductal adenocarcinomas[10]. For cancer research/therapy, it is of great interest to generate iPSCs from heterogeneous cancer tissues. In our recent study[11], human iPSC lines were clonally generated from a heterogeneous mixture of primary cells derived from gastric tissues or colon cancer tissues and were subjected to microarray gene expression analysis. The resultant iPSC lines expressed all ESC-enriched genes including POU5F1, SOX2 and NANOG that are essential for self-renewal ability and pluripotency[5,12] at a level equivalent to those of the typical human iPSC line (201B7)[1]. Genome-wide gene expression patterns were used to categorize the reference iPSC line 201B7 and the iPSC lines derived from distinct cancer tissues into three different groups. The gene expression profiles of these iPSC lines demonstrated differences derived from their distinct starting tissues and similarity and heterogeneity derived from their common starting heterogeneous tissues. More recently, it was reported that reference component analysis (RCA), an algorithm that substantially improves clustering accuracy, was developed to robustly cluster single-cell transcriptomes[13]. The RCA of single-cell transcriptomes elucidated cellular heterogeneity in human colorectal cancer[13].
In this study, iPSC technology and next-generation sequencing were used to resolve genotype variation among single cells within a heterogeneous cancer tissue. The genomic DNA of ten iPSC lines that were clonally generated from human colon cancer tissue was analyzed and compared with the genomic DNA from their cancer tissue of origin and matched adjacent non-cancerous tissue.
MATERIALS AND METHODS
Tissues derived from a single colon cancer patient
This study was conducted with the approval of the Institutional Review Boards of the National Cancer Center of Japan and the Japanese Collection of Research Bioresources (JCRB), National Institutes of Biomedical Innovation, Health and Nutrition. Written informed consent from a single donor was obtained for the use of the tissues for research. The anonymous remnant non-cancerous and cancerous tissues were provided by the JCRB Tissue Bank. The tissues were derived from the surgical waste material from an operation performed on a 55-year-old Japanese male S-shaped colon cancer patient.
Primary cell culture from cancer tissues
Heterogeneous primary cell culture from the colon cancer tissues was prepared as previously described[11]. Briefly, the tissues were washed with Hank’s balanced salt solution (HBSS) and minced into pieces with scissors. The pieces were further washed with HBSS. DMEM with collagenase was added to the tissue precipitates and mixed at 37 °C for 1 h on a shaker. After washing with DMEM, cells were seeded on collagen-coated dishes and cultured in DMEM supplemented with 10% FBS.
Generation of human iPSC lines
The study was approved by the Institutional Recombinant DNA Advisory Committee. Heterogeneous primary cells from the cancer tissue were cultured for 24 h at approximately 5%-10% confluency and then incubated with a pantropic retrovirus vector solution (OCT3/4, KLF4, and SOX2) at 37 °C for an additional 24 h. The vector solution was prepared as previously described[14]. Mitomycin C-treated mouse embryonic fibroblasts (MEFs, ReproCell) were seeded and co-cultured with the primary cells following the retroviral infection. The culture medium was replaced with MEF-conditioned ESC medium every 3 d until the cell layer was fully confluent and then further refreshed with mTeSR1 medium (STEMCELL Technologies) every day. Each independent colony was isolated from the culture using forceps under a microscope. The independent iPSC lines were sub-cultured with MEF in gelatin-coated 24-well plates.
Expansion and passage culture of human iPSC lines
Human iPSC lines were cultured with the MEFs in primate ESC, ReproStem (ReproCell) or mTeSR1 medium in gelatin-coated dishes[11]. The expanded iPSC lines were treated with a dissociation solution (ReproCell) or 0.25% trypsin-EDTA (Gibco) and passaged in media supplemented with 10-20 μmol/L Y-27632 to avoid cell death[3]. Independent iPSC lines were passaged from the 24-well plates into 6-well plates, further expanded into 100-mm dishes, and minimally passaged in 100-mm dishes under similar culture conditions. Each genomic DNA sample was prepared from independent iPSC lines.
Real-time RT-PCR analysis
Total RNA was prepared using the miRNeasy Mini Kit (Qiagen). Reverse transcription of the RNA was carried out using an iScript™ Advanced cDNA Synthesis Kit for RT-qPCR (Bio-Rad). Quantitative PCR was carried out with an SsoAdvanced Universal SYBR® Green Supermix using the CFX96 Real-Time PCR Detection System (Bio-Rad). PCR primer sets for OCT3/4, SOX2, NANOG, ZFP42, and GAPDH are listed in Supplemental Table 1. PCR data were analyzed using CFX Manager Software (Bio-Rad). PCR data from the iPSC 201B7[1] RNA were used as a positive control, and PCR data from cancer tissue-derived iPSC lines are presented as quantification cycle (Cq) values.
Target sequencing was conducted for twelve DNA samples from the cancer tissues, the non-cancerous tissues, and the ten iPSC lines. Genomic DNA was extracted from each of twelve samples using the DNeasy Blood AND Tissue Kit (Qiagen), sheared into approximately 150-bp fragments, and used to make a library for multiplexed paired-end sequencing with the SureSelectXT Reagent Kit (Agilent Technologies). The constructed library was hybridized to biotinylated cRNA oligonucleotide baits from the SureSelectXT Human Kinome Kit (Agilent Technologies) for target enrichment. Targeted sequences were purified by magnetic beads, amplified, and sequenced on an Illumina HiSeq2000 platform in a paired-end 101 bp configuration.
Mapping and single-nucleotide variant calling
Adapter sequences were removed by cutadapt (v1.2.1). After quality control, reads were mapped to the human reference genome hg19 using BWA (ver.0.6.2). Mapping results were corrected using Picard (ver.1.73) for removing duplicates and GATK (ver.1.5-32) for local alignment and base quality score recalibration. Single-nucleotide variant (SNV) calls were performed with multi-sample calling using GATK (UnifiedGenotyper) and filtered to coordinates with a variant call quality score ≥ 30 and a depth ≥ 8. SNVs were further classified based on their predicted functions of missense, nonsense or read-through. For final confirmation, SNVs were manually curated using the Integrative Genomics Viewer. Annotations of SNVs were based on dbSNP135, CCDS (NCBI release 20111122), RefSeq (UCSC Genome Browser, dumped 20111122), GENCODE (UCSC Genome Browser, ver. 7), and 1000 Genomes (release 20111011) sequences.
RESULTS
Human iPSC lines derived from colon cancer tissues
The human iPSC lines CC1-1, CC1-2, CC1-7, CC1-8, CC1-9, CC1-11, CC1-12, CC1-17, CC1-18, and CC1-25 were clonally generated from heterogeneous primary cells cultured from colon cancer tissue. The iPSC lines were expanded serially with MEFs in gelatin-coated dishes. The cancer tissue-derived iPSCs were indistinguishable in morphology from typical (fibroblast-derived) human iPSCs under conventional culture with MEFs (upper panels in Figure 1 and Supplemental Figure 1). The human iPSCs formed colonies consisting of very small cells and were efficiently passaged at a high recovery ratio with the addition of 10-20 μmol/L Y-27632 to the cell culture medium. Human iPSCs were also cultured with feeder-free mTeSR1 medium in BD MatrigelTM-coated 100-mm dishes and showed a high nucleus-to-cytoplasm ratio (lower panels in Figure 1 and Supplemental Figure 1).
Figure 1 Phase contrast micrographs of colon cancer tissue-derived induced pluripotent stem cells.
The human iPSC line CC1-1 was expanded with mitomycin C-treated mouse embryonic fibroblasts in gelatin-coated dishes (upper left panel: × 10, upper right panel: × 20) and cultured with feeder-free mTeSR1 medium in BD MatrigelTM-coated dishes (lower left panel: × 10, lower right panel: × 20). iPSC: Induced pluripotent stem cell; MEF: Mouse embryonic fibroblast.
Expression of human ESC-essential genes
ESC-essential gene expression of the cancer tissue-derived iPSC lines was quantitatively analyzed by real-time RT-PCR. All ten iPSC lines expressed POU5F1, SOX2, and NANOG, which are essential for self-renewal and pluripotency, at a level equivalent to those of the reference iPSC line (Supplemental Table 2). The study results support previously published microarray data showing that cancer tissue-derived iPSCs equally express ESC-enriched genes[11].
The target region (SureSelect Human Kinome) in genomic DNA samples from the ten iPSC lines, their starting cancer tissues, and the matched adjacent non-cancerous tissues was analyzed using next-generation sequencing. The target region of approximately 3.2 Mb covers the genome of the coding region of all known protein kinase genes and selected oncogenes and tumor suppressor genes, for a total of 612 genes (Supplemental Table 3). The original reads (2.6-4.0 Gb of sequence) were obtained from each genomic DNA sample by sequencing (Table 1). The modified reads were generated from the original reads (Table 2). The results of the mapped reads, the sequencing depth, and target capture are summarized in Tables 3-5. The average depth on the target region ranged from 317 to 496. More than 99.76% of the target region was covered with at least 8 × depth for high-quality genotype calls (variant call quality score ≥ 30).
Fraction of effective bases on or near target (%) (⑧/⑤)
69.06
54.82
62.78
65.43
74.12
67.70
68.38
56.06
72.74
63.05
65.95
66.80
Average sequencing depth on target (⑥/①)
445.06
344.89
317.89
406.72
380.13
430.01
431.06
342.50
432.12
447.66
495.99
344.37
Average sequencing depth near target (⑦/②)
116.05
90.22
93.63
106.76
100.21
115.48
118.31
86.16
117.51
113.43
118.68
90.33
Average sequencing depth on or near target (⑧/③)
265.21
205.68
195.30
242.75
227.11
258.07
260.10
202.38
260.14
264.96
289.74
205.50
Base covered on target ⑨
3143221
3143152
3142784
3143540
3143280
3143263
3143277
3142887
3143338
3143035
3143296
3142818
Coverage of target region (%) (⑨/①)
99.98
99.98
99.97
99.99
99.98
99.98
99.98
99.97
99.98
99.98
99.98
99.97
Base covered near target ⑩
3775671
3773869
3774076
3771915
3768892
3773823
3776942
3760218
3776060
3766215
3762031
3762823
Coverage of near target region (%) (⑩/②)
99.60
99.56
99.56
99.51
99.43
99.56
99.64
99.20
99.62
99.36
99.25
99.27
Fraction of target covered with at least 15 × (%)
99.72
99.62
99.59
99.69
99.65
99.68
99.70
99.55
99.69
99.60
99.68
99.58
Fraction of target covered with at least 8 × (%)
99.86
99.78
99.78
99.83
99.81
99.82
99.83
99.76
99.83
99.78
99.83
99.77
Fraction of target covered with at least 4 × (%)
99.93
99.89
99.89
99.90
99.90
99.90
99.91
99.87
99.92
99.89
99.91
99.89
Fraction of flanking region covered with at least 15 × (%)
86.88
84.24
87.52
85.84
84.62
87.92
89.85
79.85
90.17
84.50
80.72
82.36
Fraction of flanking region covered with at least 8 × (%)
94.32
93.13
94.66
93.66
92.95
94.70
95.77
89.93
95.87
92.55
89.96
91.39
Fraction of flanking region covered with at least 4 × (%)
97.85
97.33
97.91
97.50
97.15
97.87
98.27
95.76
98.32
96.86
95.71
96.37
Non-synonymous SNVs compared with hg19
After sequencing, the reads underwent bioinformatics analysis (Figure 2). Through comparison with the human reference genome hg19, the non-synonymous SNVs (missense, nonsense or read-through) were called on the target region (on and near DNA target enrichment baits) of twelve samples (the ten iPSC lines, their starting cancer tissues, and the matched adjacent non-cancerous tissues). Of the resulting 378 non-synonymous SNVs (Supplemental Table 4), 50 were novel SNVs (not reported in dbSNP135 or 1000 Genomes, Supplemental Table 5).
Figure 2 Pipeline of bioinformatics analysis following next-generation sequencing.
The thirteen confirmed SNVs are shown in Tables 6-8. SNVs: Single-nucleotide variants.
Confirmed genotypes of the twelve samples
Of the 378 non-synonymous SNVs from the twelve samples, 40 were distinct heteroallelic genotypes and included known SNVs in the 612 sequenced gene target region. Supplemental Table 6 lists the forty SNVs that were distinct among the human iPSC lines, their starting cancer tissue, and the matched adjacent non-cancerous tissues. The allelic depth and genotype quality of thirteen of the forty SNVs were validated and manually curated using the Integrative Genomics Viewer (Figure 2).
Mutated genotypes of cancer tissue-derived iPSC lines
The chromosome number, genome position, novelty, gene symbol, and mutation type of the thirteen confirmed SNVs are shown in Table 6; the allelic depth is shown in Table 7; and the genotype is shown in Table 8. The respective SNVs of the ten iPSC lines were compared to those of their starting cancer tissue and the matched non-cancerous tissue. The genotypes, which showed nonsense or missense mutations in EIF2AK2, TTN, ULK4, TSSK1B, FLT4, STK19, STK31, TRRAP, WNK1, PLK1, or PIK3R5 (Table 6), of the iPSC lines were different from that of the non-cancerous tissue sample (Table 8). Nevertheless, the genotypes of the iPSC samples were also different from that of the starting cancer tissue sample. The heteroallelic read sequences of ULK4, TRRAP, and WNK1 of the starting cancer tissue sample consisted of 247|2 of A|C, 240|1 of G|C, and 246|2 of C|T, respectively (Table 7). Although the major read sequences indicated the genotypes of the non-cancerous tissues, the minor reads indicated missense mutations. The potential heteroallelic genotypes were identical to the mutated genotypes of the CC1-25, CC1-12, and CC1-8 iPSC lines. Meanwhile, the genotypes of all ten of the iPSC lines were different from the mutated genotypes in ERBB2 and MKNK2 of the cancer tissues and were identical to those of the non-cancerous tissues and human reference genome hg19 (Table 8). Thus, all analyzed iPSC lines were preferentially generated from starting cells without mutations in ERBB2 and MKNK2, except for those generated from mature cancer cells. Furthermore, the iPSC lines CC1-7 and CC1-17 did not have any confirmed mutated genotypes despite originating from the cancer tissue.
Table 6 Chromosome number, genome position, reference vs single-nucleotide variant, novelty vs dbSNP135, gene symbol, and mutation types of single-nucleotide variants.
SNV No.
Chromo-some No.
Genome position
Ref.|SNV
Novel/known
Gene symbol
Mutation types
1
chr2
37336419
C|T
Novel
EIF2AK2
Missense
2
chr2
179408086
A|G
Novel
TTN
Missense
3
chr3
41705179
A|C
Novel
ULK4
Missense
4
chr5
112769527
C|T
Novel
TSSK1B
Missense
5
chr5
180048626
C|T
Novel
FLT4
Missense
6
chr6
31947203
T|C
Novel
STK19
Missense
7
chr7
23808650
G|T
Novel
STK31
Missense
8
chr7
98490141
G|C
Novel
TRRAP
Missense
9
chr12
1009680
C|T
Novel
WNK1
Missense
10
chr16
23690401
C|T
Novel
PLK1
Missense
11
chr17
8789811
G|A
Novel
PIK3R5
Nonsense
12
chr17
37881392
A|G
Novel
ERBB2
Missense
13
chr19
2046399
G|A
Novel
MKNK2
Missense
Table 7 Allelic depth of single-nucleotide variants among the matched adjacent non-cancerous tissue, the starting cancer tissue, and the cancer tissue-derived induced pluripotent stem cell lines.
Allelic depth of SNVs
SNV No.
NCC1
CC1
CC1-1
CC1-2
CC1-7
CC1-8
CC1-9
CC1-11
CC1-12
CC1-17
CC1-18
CC1-25
1
250|0
246|0
232|0
250|0
250|0
250|0
250|0
248|0
250|0
250|0
129|121
250|0
2
249|0
240|0
240|0
248|0
248|1
250|0
129|121
248|0
242|0
250|0
250|0
244|0
3
246|0
247|2
249|0
238|1
246|0
248|0
233|0
241|0
238|1
241|0
245|0
132|106
4
250|0
239|0
243|0
248|0
245|0
120|129
250|0
236|0
250|0
250|0
250|0
249|0
5
216|0
150|0
75|79
189|0
184|0
180|0
200|1
131|0
176|0
221|0
207|0
179|0
6
249|0
238|0
250|0
132|114
250|0
250|0
242|0
248|0
248|0
250|0
250|0
249|0
7
250|0
248|0
250|0
250|0
245|0
246|0
245|0
135|111
249|0
250|0
250|0
246|1
8
233|0
240|1
243|0
250|0
245|0
242|0
247|0
248|0
132|113
240|1
247|0
241|0
9
249|0
246|2
250|0
250|0
249|0
220|30
244|0
249|0
250|0
249|1
250|0
249|0
10
247|0
177|0
188|0
119|121
198|0
244|0
241|0
176|0
221|0
224|0
249|0
174|0
11
246|1
172|0
181|0
208|0
209|0
198|0
189|0
175|0
244|0
182|0
233|0
95|87
12
249|1
195|54
241|0
249|0
249|0
249|1
249|0
250|0
249|0
250|0
249|1
250|0
13
137|0
91|10
79|0
131|0
102|0
103|0
103|0
83|0
106|0
111|0
142|0
90|0
Table 8 Genotypes of single-nucleotide variants among the matched adjacent non-cancerous tissue, the starting cancer tissue, and the cancer tissue-derived induced pluripotent stem cell lines.
Genotypes of SNVs
SNV No.
NCC1
CC1
CC1-1
CC1-2
CC1-7
CC1-8
CC1-9
CC1-11
CC1-12
CC1-17
CC1-18
CC1-25
1
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/T
C/C
2
A/A
A/A
A/A
A/A
A/A
A/A
A/G
A/A
A/A
A/A
A/A
A/A
3
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/C
4
C/C
C/C
C/C
C/C
C/C
C/T
C/C
C/C
C/C
C/C
C/C
C/C
5
C/C
C/C
C/T
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/C
6
T/T
T/T
T/T
T/C
T/T
T/T
T/T
T/T
T/T
T/T
T/T
T/T
7
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/T
G/G
G/G
G/G
G/G
8
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/C
G/G
G/G
G/G
9
C/C
C/C
C/C
C/C
C/C
C/T
C/C
C/C
C/C
C/C
C/C
C/C
10
C/C
C/C
C/C
C/T
C/C
C/C
C/C
C/C
C/C
C/C
C/C
C/C
11
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/A
12
A/A
A/G
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
A/A
13
G/G
G/A
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
G/G
DISCUSSION
The ten iPSC lines were clonally generated from a heterogeneous mixture of primary cells derived from the colon cancer tissue of a single patient. The genomes of the ten iPSC lines were analyzed using next-generation sequencing. The genomes of the starting cancer tissue and matched adjacent non-cancerous tissue from the same donor were also analyzed. The target region for analysis was the human kinome and cancer-related genes that are typically mutated in human tumors. A total of 378 non-synonymous SNVs identified from samples of the ten iPSC lines and the cancerous and non-cancerous tissues were identified by comparing the sequence reads to the human reference genome hg19. Most of the non-synonymous SNVs showed the genotype of the non-cancerous tissue, suggesting their germline origin. The SNVs of the ten iPSC lines were compared with those of the cancerous and non-cancerous tissues. Forty of the SNVs were distinct genotypes among all twelve samples. Thirteen of the forty SNVs were confirmed using allelic depth, genotype quality, and the Integrative Genomics Viewer.
In eight of the ten iPSC lines, one or two novel, non-synonymous SNVs (heteroallelic missense or nonsense mutation) in EIF2AK2, TTN, ULK4, TSSK1B, FLT4, STK19, STK31, TRRAP, WNK1, PLK1 or PIK3R5 were identified as genotypes different from those of the non-cancerous tissue. Unexpectedly, all the SNVs were not identical to the genotypes found in the cancer tissues. Because of minor read sequences, the potential genotype of ULK4, TRRAP or WNK1 in the cancer tissues was implied. The sequences indicated a missense mutation in ULK4, TRRAP or WNK1 identical to that found in the iPSC lines CC1-25, CC1-12, and CC1-8. Accordingly, there is a possibility that each iPSC line was generated from a starting cell from a minor cell population with a mutation in ULK4, TRRAP, or WNK1 that was present within the cancer tissue. The minor read sequences could be confirmed by ultra-deep sequencing to support the potential heteroallelic genotypes. Interestingly, two iPSC lines CC1-7 and CC1-17 did not have any confirmed mutated genotypes despite originating from the cancer tissues. Therefore, the two iPSC lines might be generated from non-cancerous cells such as pre-cancer (stem) cells and cancer-associated fibroblasts[15,16].
The SNVs of the ten iPSC lines could be de novo or pre-existing mutations that originated from minor cell populations, such as multifocal cancer (stem) cells and pre-metastatic cancer cells, present within the heterogeneous cancer tissue. Primary cancer tissues include multifocal pre-, mature and pre-metastatic cancer cells, so it makes sense that their genomes would be heterogeneous. The genotypes of pre-cancer (stem) cells would not be identical to those of germline or mature cancer cells, as colon cancer develops from an adenoma to carcinoma through the accumulation of a number of genetic mutations and epigenetic aberration[17]. It is likely that the genotypes of pre-metastatic cancer cells in multiple clonal evolutions would be different from those of non-metastatic cancer cells. Meanwhile, genotypes of major mature cancer cells would be identical to those of cancer tissues; therefore, it was expected that genotypes of cancer tissue-derived iPSC lines would be identical to those of their starting cancer tissues. It was reported that ERBB2 mutations were persistent in 3.6% of patients with colorectal cancer[18]. Indeed, a mutated genotype in ERBB2 of the colon cancer tissues was also identified in this study.
Nevertheless, the genotypes of the ten iPSC lines were different from the mutated ERBB2 and MKNK2 genotypes in the cancer tissues and were identical to those of the non-cancerous tissues and the human reference genome hg19. This result suggests that the starting cells for the iPSC lines did not carry the mutations in ERBB2 and MKNK2 present in the cancer tissues. It is conceivable that the non-mutated genotypes of each iPSC line were identical to those of non-cancerous cells such as pre-cancer (stem) cells, stroma cells and immune cells that existed within the tissue. Each iPSC line was clonally established by selecting an independent single colony expanded from a putative single starting cell originating from heterogeneous cancer tissue. The genome sequence of each iPSC line was derived from its starting single cell. As a result, each iPSC line conserved the non-mutated ERBB2 and MKNK2 genotypes that originated from their respective starting single cells. Interestingly, all ten iPSC lines were not generated from cell populations containing either a mutated ERBB2 and/or a mutated MKNK2. Thus, the genotypes of each iPSC line provide information on the genomic origin of the starting single cell derived from the heterogeneous cancer tissue.
Although the cause of the preference for the genomic origin of their starting cells was not clarified in this study, it seems that chemicals[19], gene sets[1,4], gene transfer[20,21], or inventive pre-culture[22,23], in which the starting cells might be preferentially specified, can affect iPSC generation. Accordingly, materials and methods can be optimized to generate normal or aberrant iPSC lines for the purposes of regenerative medicine or cancer research/therapy. Cancer tissues comprise (pre-) cancer (stem) cells, pre-metastatic cancer cells, stromal cells (such as mesenchymal stem cells, cancer-associated fibroblasts[15,16,24] and tumor endothelial cells) and immune cells (such as tumor-associated macrophages[25], dendritic cells[26] and tumor-infiltrating T cells[23]). Therefore, such a cell-derived iPSC line might be useful for immune-cell therapy[27] with cellular vaccines[28], dendritic cells[29-32] or tumor antigen-specific cytotoxic T cells[23], in addition to the development of models of carcinogenesis[33-35] and drug discovery tools[36,37]. For the purposes of regenerative medicine, human iPSCs are ideally generated from normal neonatal tissues[3,38-40] that are typically inexperienced of postnatal aberrant mutations or epigenetic changes. By contrast, aging and sun-exposed skin carries thousands of evolving clonal cells carrying cancer-causing mutations[41,42]. Indeed, genetic mutations accumulate gradually over a lifetime, even in human somatic stem cells[43]. For this reason, cell sources for iPSC generation should be selected based on the given field of research. Furthermore, iPSC lines with few or no mutations need to be established by the modification of existing methodology[39,44,45], as cell lines with de novo mutations not originating from the starting cells are not desired[46-50].
Nevertheless, cancer tissue-derived iPSCs might give rise to such de novo mutations, as their starting cells might have already suffered from an aberration (epigenetics or gene expression) associated with de novo mutations or cancer. Indeed, colon cancer tissue-derived iPSC lines exhibited unique gene expression profiles, with particular upregulation of FAM19A5 and SLC39A7[11], in comparison with those of the typical iPSC line 201B7[1]. FAM19A5 and SLC39A7 were found to be expressed at lower levels in many human iPSC and ESC lines based on a free online expression atlas (Amazonia!, http://amazonia.transcriptome.eu/search.php)[51]. FAM19A5 was reported as a novel cholangiocarcinoma biomarker[52], while SLC39A7 is an intracellular zinc transporter and a hub for tyrosine kinase activation related to diseases such as cancer[53]. The analysis of iPSC genomes might expose rare single cells, such as an authentic cancer stem cells present within cancer tissues. Thus, next-generation sequencing of heterogeneous cancer tissue-derived iPSC lines might reveal potential aberrations or changes originating from the cancer tissue.
In conclusion, the genotypes of iPSC lines can be used to trace the genotype of the original single cells derived from heterogeneous cancer tissues.
ACKNOWLEDGMENTS
The author would like to thank Dr. Takanori Washio, Mr. O Kobayashi and Mr. Wataru Kurihara of Riken Genesis for supporting the informatics analysis of the sequencing data, Dr. Momoko Kobayashi and Ms. Natsumi Suda for supporting the iPSC culture and genomic DNA preparation, and the members of the Fundamental Innovative Oncology Core Center, the Division of Molecular and Cellular Medicine, the Division of Genetics, and the Division of Carcinogenesis and Prevention for the helpful discussions. The author would also like to acknowledge the professional language editing service of Springer Nature Author Services. The author also acknowledges Dr. Toshio Kitamura, the JCRB Tissue Bank, and the RIKEN BioResource Center for providing the Plat-GP packaging cells, remnant tissues from cancer patients, and the iPSC line (201B7), respectively.
COMMENTS
Background
Starting cells for induced pluripotent stem cell (iPSC) generation should be appropriately adopted to generate normal or aberrant iPSC lines for use in regenerative medicine or cancer research/therapy. Human iPSC lines for regenerative medicine would be ideally generated from normal neonatal tissues, as they are typically free of postnatal aberrant mutations and epigenetic changes. For cancer research/therapy, it is of great interest to generate iPSCs that originate from heterogeneous cancer tissues.
Research frontiers
Microarray experiments have profiled the gene expression of human iPSC lines clonally generated from a heterogeneous mixture of primary cells derived from gastric tissue or colon cancer tissue. The gene expression profiles of such iPSC lines demonstrate differences derived from their distinct starting tissues and similarity and heterogeneity derived from their common starting heterogeneous tissue.
Innovations and breakthroughs
This is the first study to analyze human iPSC lines clonally generated from a heterogeneous mixture of primary cells derived from cancer tissues using next-generation sequencing. Eight of the ten iPSC lines had single-nucleotide variants with de novo or pre-existing mutations originating from a minor population within the cancer tissues. Meanwhile, all other genotypes of the iPSC lines were not mutated as in the original cancer tissues. Two of the ten iPSC lines did not possess any confirmed mutated genotypes despite having been derived from cancer tissue. These results suggest that the majority of iPSC lines originated from starting cells other than major cancer cells. Thus, the genotypes of iPSC lines can be used to trace the genotypes of the starting single cells.
Applications
It is conceivable that cancer tissues are made up of not only pre-cancer (stem) cells and pre-metastatic cancer cells but also stroma cells (such as mesenchymal stem cells, cancer-associated fibroblasts and tumor endothelial cells) and immune cells (such as tumor-associated macrophages, dendritic cells and tumor-infiltrating T cells). These other cell types might serve as targets for drug discovery and immune-cell therapy against cancer. Therefore, such a cell-derived iPSC line might be useful for immune-cell therapies such as cancer vaccines, dendritic cells and tumor antigen-specific cytotoxic T cells, in addition to the development of models of carcinogenesis and drug discovery tools.
Terminology
Most single-nucleotide variants are heteroallelic genotypes that are validated with allelic depth and genotype quality and manually curated using the Integrative Genomics Viewer. Deeper allelic depth of next-generation sequencing further resolves genotype variations among the starting single cells present within heterogeneous cancer tissues. In this way, the genotypes of the iPSC lines may be used to trace the genomic identity of their starting single cells derived from a heterogeneous cancer tissue.
Peer-review
The manuscript is well written and easy to follow.
Footnotes
Manuscript source: Invited manuscript
Specialty type: Cell and tissue engineering
Country of origin: Japan
Peer-review report classification
Grade A (Excellent): A
Grade B (Very good): 0
Grade C (Good): C, C
Grade D (Fair): 0
Grade E (Poor): 0
P- Reviewer: Cao T, Li SC, Mozdarani H S- Editor: Ji FF L- Editor: A E- Editor: Li D
Thomson JA, Itskovitz-Eldor J, Shapiro SS, Waknitz MA, Swiergiel JJ, Marshall VS, Jones JM. Embryonic stem cell lines derived from human blastocysts.Science. 1998;282:1145-1147.
[PubMed] [DOI][Cited in This Article: ]
Kim J, Hoffman JP, Alpaugh RK, Rhim AD, Reichert M, Stanger BZ, Furth EE, Sepulveda AR, Yuan CX, Won KJ. An iPSC line from human pancreatic ductal adenocarcinoma undergoes early to invasive stages of pancreatic cancer progression.Cell Rep. 2013;3:2088-2099.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 115][Cited by in F6Publishing: 133][Article Influence: 12.1][Reference Citation Analysis (0)]
Ishikawa T, Kobayashi M, Yanagi S, Kato C, Takashima R, Kobayashi E, Hagiwara K, Ochiya T. Human induced hepatic lineage-oriented stem cells: autonomous specification of human iPS cells toward hepatocyte-like cells without any exogenous differentiation factors.PLoS One. 2015;10:e0123193.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 18][Cited by in F6Publishing: 20][Article Influence: 2.2][Reference Citation Analysis (0)]
Fusaki N, Ban H, Nishiyama A, Saeki K, Hasegawa M. Efficient induction of transgene-free human pluripotent stem cells using a vector based on Sendai virus, an RNA virus that does not integrate into the host genome.Proc Jpn Acad Ser B Phys Biol Sci. 2009;85:348-362.
[PubMed] [DOI][Cited in This Article: ]
Iwamoto H, Ojima T, Hayata K, Katsuda M, Miyazawa M, Iida T, Nakamura M, Nakamori M, Iwahashi M, Yamaue H. Antitumor immune response of dendritic cells (DCs) expressing tumor-associated antigens derived from induced pluripotent stem cells: in comparison to bone marrow-derived DCs.Int J Cancer. 2014;134:332-341.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 14][Cited by in F6Publishing: 16][Article Influence: 1.5][Reference Citation Analysis (0)]