| 24小時(shí)熱門(mén)版塊排行榜 |
| 3 | 1/1 | 返回列表 |
| 查看: 20682 | 回復(fù): 2 | |||
QQ894064647新蟲(chóng) (初入文壇)
|
[交流]
TCGA(癌癥和腫瘤基因圖譜)數(shù)據(jù)下載和處理(TCGA-Assembler) 已有2人參與
|
|
國(guó)政府發(fā)起的癌癥和腫瘤基因圖譜(Cancer Genome Atlas,TCGA)計(jì)劃,試圖通過(guò)應(yīng)用基因組分析技術(shù),特別是采用大規(guī)模的基因組測(cè)序,將人類(lèi)全部癌癥(近期目標(biāo)為50種包括亞型在內(nèi)的腫瘤)的基因組變異圖譜繪制出來(lái),并進(jìn)行系統(tǒng)分析,旨在找到所有致癌和抑癌基因的微小變異,了解癌細(xì)胞發(fā)生、發(fā)展的機(jī)制,在此基礎(chǔ)上取得新的診斷和治療方法,最后可以勾畫(huà)出整個(gè)新型“預(yù)防癌癥的策略”。 TCGA 使命:提高人們對(duì)癌癥發(fā)病分子基礎(chǔ)的科學(xué)認(rèn)識(shí)及提高我們?cè)\斷、治療和預(yù)防癌癥的能力 TCGA 目標(biāo):完成一套完整的與所有癌癥基因組改變相關(guān)的“圖譜”。 圖1.png TCGA數(shù)據(jù)源大部分都是公開(kāi)的,如何有效的進(jìn)行收集和預(yù)處理是一個(gè)頭疼的問(wèn)題。今天我們講解下怎么將TCGA的數(shù)據(jù)轉(zhuǎn)化成癌癥類(lèi)型的二維數(shù)據(jù)矩陣(例如基因?yàn)閞ows,樣本為columns)。得到這個(gè)矩陣之后,后面的事情就好辦了,我們可以做差異表達(dá),共表達(dá)網(wǎng)絡(luò),生存分析等。今天我們主要講解如何下載TCGA的數(shù)據(jù),大家對(duì)后續(xù)分析感興趣的話(huà),可以在加“生物信息培訓(xùn)+視頻”裙,或者大家可以在掏寶搜索“生物信息視頻”,跟我們聯(lián)系。 我們開(kāi)始吧,我們可以使用TCGA-Assembler這軟件去下載TCGA的數(shù)據(jù)http://www.compgenome.org/TCGA-Assembler/。TCGA-Assembler不但可以很方便的下載數(shù)據(jù),還能對(duì)數(shù)據(jù)進(jìn)行初始化處理,非常方便。下載完后,我們使用首先要安裝一些依賴(lài)包。通過(guò)下面的命令: install.packages(c("HGNChelper", "RCurl", "httr", "stringr", "digest", "bitops" , dependencies=T)安裝完了依賴(lài)包,我們進(jìn)入剛才下載的TCGA-Assembler的目錄,使用setwd(C:/Users/cloud/Desktop/TCGA-Assembler)設(shè)置TCGA-Assembler的目錄為工作目錄,接下來(lái),我們就可以下載數(shù)據(jù)了。我們需要下載什么數(shù)據(jù),就選擇相應(yīng)的腳本。具體腳本如下: # Load module A functions. source("Module_A.r" ;# Download level-3 miRNA-seq data of six rectum adenocarcinoma (READ) samples miRNASeqRawData = DownloadmiRNASeqData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "miRNASeq", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AF-2689-11", "TCGA-AF-2691-11" ); # Download level-3 DNA copy number data of six READ samples CNARawData = DownloadCNAData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "genome_wide_snp_6", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AF-2692-10", "TCGA-AG-4021-10" ); # Download level-3 RNASeqV2 gene expression and exon expression data of six READ samples RNASeqRawData = DownloadRNASeqData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "RNASeqV2", dataType = c("rsem.genes.normalized_results", "exon_quantification" , inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AG-3732-11", "TCGA-AG-3742-11" ); # Download level-3 HumanMethylation27 data of six READ samples Methylation27RawData = DownloadMethylationData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "humanmethylation27", inputPatientIDs = c("TCGA-AG-3583-01", "TCGA-AG-A032-01", "TCGA-AF-2692-11", "TCGA-AG-4001-01", "TCGA-AG-3608-01", "TCGA-AG-3574-01" ); # Download level-3 HumanMethylation450 data of six READ samples Methylation450RawData = DownloadMethylationData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData", cancerType = "READ", assayPlatform = "humanmethylation450", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AG-A01W-11", "TCGA-AG-3731-11" ); # Download level-3 RPPA protein expression data of six READ samples RPPARawData = DownloadRPPAData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData", cancerType = "READ", assayPlatform = "mda_rppa_core", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AG-3582-01", "TCGA-AG-4001-01" ); # Download de-identified clinical information of READ patients DownloadClinicalData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData", cancerType = "READ", clinicalDataType = c("patient", "drug", "follow_up" );運(yùn)行上面的腳本,我們就能得到我們想要的結(jié)果了,假如我們需要下載adenocarcinoma的miRNA數(shù)據(jù),我們可以使用。下載完后,我們就得到了adenocarcinoma的矩陣了(基因?yàn)閞ows,樣本為columns)。 setwd(C:/Users/cloud/Desktop/TCGA-Assembler) source("Module_A.r" ;miRNASeqRawData = DownloadmiRNASeqData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "miRNASeq" ; |
新蟲(chóng) (初入文壇)
木蟲(chóng) (正式寫(xiě)手)
五道杠

| 3 | 1/1 | 返回列表 |
| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 307求調(diào)劑 +3 | wyyyqx 2026-03-17 | 3/150 |
|
|---|---|---|---|---|
|
[考研] 265求調(diào)劑 +3 | Jack?k?y 2026-03-17 | 3/150 |
|
|
[考研] 279分求調(diào)劑 一志愿211 +11 | chaojifeixia 2026-03-19 | 12/600 |
|
|
[考研] 材料專(zhuān)業(yè)求調(diào)劑 +6 | hanamiko 2026-03-18 | 6/300 |
|
|
[考研] 317求調(diào)劑 +8 | 申子申申 2026-03-19 | 13/650 |
|
|
[考研] 304求調(diào)劑 +7 | 司空. 2026-03-18 | 7/350 |
|
|
[考研] 317求調(diào)劑 +5 | 申子申申 2026-03-19 | 9/450 |
|
|
[考研] 287求調(diào)劑 +7 | 晨昏線(xiàn)與星海 2026-03-19 | 8/400 |
|
|
[考研] 329求調(diào)劑 +9 | 想上學(xué)吖吖 2026-03-19 | 9/450 |
|
|
[考研] 一志愿中南化學(xué)(0703)總分337求調(diào)劑 +8 | niko- 2026-03-19 | 9/450 |
|
|
[考研] 289求調(diào)劑 +6 | 懷瑾握瑜l 2026-03-20 | 6/300 |
|
|
[考研] 0703化學(xué)調(diào)劑 ,六級(jí)已過(guò),有科研經(jīng)歷 +13 | 曦熙兮 2026-03-15 | 13/650 |
|
|
[考研] 0703化學(xué)調(diào)劑 +5 | pupcoco 2026-03-17 | 8/400 |
|
|
[考研] 材料工程專(zhuān)碩調(diào)劑 +5 | 204818@lcx 2026-03-17 | 6/300 |
|
|
[考研] 326求調(diào)劑 +5 | 上岸的小葡 2026-03-15 | 6/300 |
|
|
[碩博家園] 湖北工業(yè)大學(xué) 生命科學(xué)與健康學(xué)院-課題組招收2026級(jí)食品/生物方向碩士 +3 | 1喜春8 2026-03-17 | 5/250 |
|
|
[考研] 333求調(diào)劑 +3 | 文思客 2026-03-16 | 7/350 |
|
|
[考研] 304求調(diào)劑 +5 | 素年祭語(yǔ) 2026-03-15 | 5/250 |
|
|
[考研] 一志愿211 0703方向310分求調(diào)劑 +3 | 努力奮斗112 2026-03-15 | 3/150 |
|
|
[考研] 304求調(diào)劑 +3 | 曼殊2266 2026-03-14 | 3/150 |
|