| 3 | 1/1 | 返回列表 |
| 查看: 20681 | 回復(fù): 2 | |||
QQ894064647新蟲 (初入文壇)
|
[交流]
TCGA(癌癥和腫瘤基因圖譜)數(shù)據(jù)下載和處理(TCGA-Assembler) 已有2人參與
|
|
國(guó)政府發(fā)起的癌癥和腫瘤基因圖譜(Cancer Genome Atlas,TCGA)計(jì)劃,試圖通過(guò)應(yīng)用基因組分析技術(shù),特別是采用大規(guī)模的基因組測(cè)序,將人類全部癌癥(近期目標(biāo)為50種包括亞型在內(nèi)的腫瘤)的基因組變異圖譜繪制出來(lái),并進(jìn)行系統(tǒng)分析,旨在找到所有致癌和抑癌基因的微小變異,了解癌細(xì)胞發(fā)生、發(fā)展的機(jī)制,在此基礎(chǔ)上取得新的診斷和治療方法,最后可以勾畫出整個(gè)新型“預(yù)防癌癥的策略”。 TCGA 使命:提高人們對(duì)癌癥發(fā)病分子基礎(chǔ)的科學(xué)認(rèn)識(shí)及提高我們?cè)\斷、治療和預(yù)防癌癥的能力 TCGA 目標(biāo):完成一套完整的與所有癌癥基因組改變相關(guān)的“圖譜”。 圖1.png TCGA數(shù)據(jù)源大部分都是公開的,如何有效的進(jìn)行收集和預(yù)處理是一個(gè)頭疼的問(wèn)題。今天我們講解下怎么將TCGA的數(shù)據(jù)轉(zhuǎn)化成癌癥類型的二維數(shù)據(jù)矩陣(例如基因?yàn)閞ows,樣本為columns)。得到這個(gè)矩陣之后,后面的事情就好辦了,我們可以做差異表達(dá),共表達(dá)網(wǎng)絡(luò),生存分析等。今天我們主要講解如何下載TCGA的數(shù)據(jù),大家對(duì)后續(xù)分析感興趣的話,可以在加“生物信息培訓(xùn)+視頻”裙,或者大家可以在掏寶搜索“生物信息視頻”,跟我們聯(lián)系。 我們開始吧,我們可以使用TCGA-Assembler這軟件去下載TCGA的數(shù)據(jù)http://www.compgenome.org/TCGA-Assembler/。TCGA-Assembler不但可以很方便的下載數(shù)據(jù),還能對(duì)數(shù)據(jù)進(jìn)行初始化處理,非常方便。下載完后,我們使用首先要安裝一些依賴包。通過(guò)下面的命令: install.packages(c("HGNChelper", "RCurl", "httr", "stringr", "digest", "bitops" , dependencies=T)安裝完了依賴包,我們進(jìn)入剛才下載的TCGA-Assembler的目錄,使用setwd(C:/Users/cloud/Desktop/TCGA-Assembler)設(shè)置TCGA-Assembler的目錄為工作目錄,接下來(lái),我們就可以下載數(shù)據(jù)了。我們需要下載什么數(shù)據(jù),就選擇相應(yīng)的腳本。具體腳本如下: # Load module A functions. source("Module_A.r" ;# Download level-3 miRNA-seq data of six rectum adenocarcinoma (READ) samples miRNASeqRawData = DownloadmiRNASeqData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "miRNASeq", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AF-2689-11", "TCGA-AF-2691-11" ); # Download level-3 DNA copy number data of six READ samples CNARawData = DownloadCNAData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "genome_wide_snp_6", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AF-2692-10", "TCGA-AG-4021-10" ); # Download level-3 RNASeqV2 gene expression and exon expression data of six READ samples RNASeqRawData = DownloadRNASeqData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "RNASeqV2", dataType = c("rsem.genes.normalized_results", "exon_quantification" , inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AG-3732-11", "TCGA-AG-3742-11" ); # Download level-3 HumanMethylation27 data of six READ samples Methylation27RawData = DownloadMethylationData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "humanmethylation27", inputPatientIDs = c("TCGA-AG-3583-01", "TCGA-AG-A032-01", "TCGA-AF-2692-11", "TCGA-AG-4001-01", "TCGA-AG-3608-01", "TCGA-AG-3574-01" ); # Download level-3 HumanMethylation450 data of six READ samples Methylation450RawData = DownloadMethylationData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData", cancerType = "READ", assayPlatform = "humanmethylation450", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AG-A01W-11", "TCGA-AG-3731-11" ); # Download level-3 RPPA protein expression data of six READ samples RPPARawData = DownloadRPPAData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData", cancerType = "READ", assayPlatform = "mda_rppa_core", inputPatientIDs = c("TCGA-EI-6884-01", "TCGA-DC-5869-01", "TCGA-G5-6572-01", "TCGA-F5-6812-01", "TCGA-AG-3582-01", "TCGA-AG-4001-01" ); # Download de-identified clinical information of READ patients DownloadClinicalData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData", cancerType = "READ", clinicalDataType = c("patient", "drug", "follow_up" );運(yùn)行上面的腳本,我們就能得到我們想要的結(jié)果了,假如我們需要下載adenocarcinoma的miRNA數(shù)據(jù),我們可以使用。下載完后,我們就得到了adenocarcinoma的矩陣了(基因?yàn)閞ows,樣本為columns)。 setwd(C:/Users/cloud/Desktop/TCGA-Assembler) source("Module_A.r" ;miRNASeqRawData = DownloadmiRNASeqData(traverseResultFile = "./DirectoryTraverseResult_Jul-08-2014.rda", saveFolderName = "./QuickStartGuide_Results/RawData/", cancerType = "READ", assayPlatform = "miRNASeq" ; |
木蟲 (正式寫手)
五道杠

| 3 | 1/1 | 返回列表 |
| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 261求B區(qū)調(diào)劑,科研經(jīng)歷豐富 +3 | 牛奶很忙 2026-03-20 | 4/200 |
|
|---|---|---|---|---|
|
[考研] 材料與化工專碩調(diào)劑 +7 | heming3743 2026-03-16 | 7/350 |
|
|
[考研] 086500 325 求調(diào)劑 +3 | 領(lǐng)帶小熊 2026-03-19 | 3/150 |
|
|
[考研] 材料學(xué)碩297已過(guò)四六級(jí)求調(diào)劑推薦 +6 | adaie 2026-03-19 | 6/300 |
|
|
[考研] 286分人工智能專業(yè)請(qǐng)求調(diào)劑愿意跨考! +3 | lemonzzn 2026-03-17 | 4/200 |
|
|
[考研] 材料學(xué)碩318求調(diào)劑 +5 | February_Feb 2026-03-19 | 5/250 |
|
|
[考研] 一志愿中海洋材料工程專碩330分求調(diào)劑 +7 | 小材化本科 2026-03-18 | 7/350 |
|
|
[考研] 材料工程專碩調(diào)劑 +5 | 204818@lcx 2026-03-17 | 6/300 |
|
|
[考研] 328求調(diào)劑,英語(yǔ)六級(jí)551,有科研經(jīng)歷 +3 | 生物工程調(diào)劑 2026-03-17 | 7/350 |
|
|
[考研] 【同濟(jì)軟件】軟件(085405)考研求調(diào)劑 +3 | 2026eternal 2026-03-18 | 3/150 |
|
|
[考研] 311求調(diào)劑 +11 | 冬十三 2026-03-15 | 12/600 |
|
|
[考研] 070300化學(xué)319求調(diào)劑 +6 | 錦鯉0909 2026-03-17 | 6/300 |
|
|
[考研] 299求調(diào)劑 +5 | △小透明* 2026-03-17 | 5/250 |
|
|
[考研] 280求調(diào)劑 +6 | 咕嚕曉曉 2026-03-18 | 7/350 |
|
|
[考研] 303求調(diào)劑 +4 | 睿08 2026-03-17 | 6/300 |
|
|
[考研] 材料專碩326求調(diào)劑 +6 | 墨煜姒莘 2026-03-15 | 7/350 |
|
|
[考研] 一志愿蘇州大學(xué)材料工程(085601)專碩有科研經(jīng)歷三項(xiàng)國(guó)獎(jiǎng)兩個(gè)實(shí)用型專利一項(xiàng)省級(jí)立項(xiàng) +6 | 大火山小火山 2026-03-16 | 8/400 |
|
|
[論文投稿] 有沒有大佬發(fā)小論文能帶我個(gè)二作 +3 | 增銳漏人 2026-03-17 | 4/200 |
|
|
[考研] 318求調(diào)劑 +3 | Yanyali 2026-03-15 | 3/150 |
|
|
[考研] 070303 總分349求調(diào)劑 +3 | LJY9966 2026-03-15 | 5/250 |
|