| 查看: 5685 | 回復: 12 | ||||
zhangjunpeng至尊木蟲 (知名作家)
|
[交流]
TCGA數據庫的使用交流 已有10人參與
|
|||
|
在生信領域,數據源是非常重要的,畢竟誰也不想感受“巧婦無米之炊”的感覺。隨著大數據時代的到來,各種大型生物公共數據庫也不斷完善,其中就包括The Cancer Genome Atlas (TCGA,https://tcga-data.nci.nih.gov/tcga/tcgaHome2.jsp)數據庫。TCGA數據源大部分都是公開的,如何有效的進行收集和預處理是一個頭疼的問題。 目前來能夠從TCGA數據庫中提取數據的處理工具有cBioPortal(http://www.cbioportal.org/public-portal/cgds_r.jsp),ICGC(http://dcc.icgc.org/download/current)和GenePattern(http://www.broadinstitute.org/ca ... tern/download/index)。這些工具使用起來還是有其局限性,都不能夠輕易獲取每個癌癥類型的二維數據矩陣(例如基因為rows,樣本為columns)。 因此開此貼,歡迎各位同行蟲友交流一下獲取TCGA數據庫的經驗,以及間接獲取TCGA數據的工具使用技巧和方法。 |
分子生化實驗經驗積累 |

|
In order to download data from TCGA data portal: 1. Connect to https://tcga-data.nci.nih.gov/tcga/ 2. Select the cancer subtype you are interested in (i.e breast invasive carcinoma) 3. Select mRNA 4. Now you can see a table where rows are representing different patients. 5. If present select the column (by clicking on header) that referse to RNASeq or RNASeqV2 if it is present for that cancer subtype and then click BUILD archive. 6. Keep in mind that just below the header there is a number indicating the respective data level. Levels 1-4 (https://wiki.nci.nih.gov/display/TCGA/Data+level) If you need RAW data such as FASTQ files you have find level 1 data, but often this kind of data is not publicly available on TCGA and you might need to ask for permission in order to download it. |
木蟲 (正式寫手)
五道杠
|
http://wenku.baidu.com/link?url= ... O7J9NHzBL_xnc1QCBRC 鏈接是一個TCGA的基礎培訓,可以學習一下 |

金蟲 (初入文壇)
至尊木蟲 (知名作家)

木蟲 (小有名氣)
至尊木蟲 (知名作家)

金蟲 (正式寫手)
新蟲 (初入文壇)
|
下面是某學長發(fā)給我的TCGA部分數據,請問每組數據代表什么意思? # Mutation matrix made from SNV data (/data/compbio/datasets/MutationMatrices/BREAST/2012-10-31/brca_mutation_fromPanCancer.snv) and CNA data (brca_cna_gistic_wide.cna). TCGA-A1-A0SD ANK3 C12ORF51 C19ORF51 CASK CDHR3 CNTFR COL14A1 CPAMD8 CPEB2 CXORF58 FAM182B FNDC1 GDF5 GRIN2C IGSF3 KIRREL KLK15 L1CAM LOC653125 LRBA LRP2 NCOA3 PAK1(A) PCDHA6 PGC PNLIPRP2 PTEN(D) RP1 SFRS17A SIDT2 SLC44A3 SLFN14 SNX5 TLR5 WDR72 ZFP91 ZFR2 ZNF544 ZNF740 TCGA-A1-A0SE ARRDC4 B3GNT1 C10ORF71 C3ORF38 CCND1(A) CDH1 ENSG00000234924 ENSG00000245041 ENSG00000245055 ENSG00000245922 ENSG00000246925 ENSG00000247772 LOC646096 MAP2K4(D) MED23 MGA MRPS18B PAK1(A) RBM26 SDR16C5 SYDE2 TBC1D12 UNC13C WDR91 ZFHX4 TCGA-A1-A0SH 12p13.33(A) ACSL4 AHCTF1 ALPK3 ANK3 ANKRD7 APOB48R ARHGAP28 ASL ATPIF1 BCL7B BDP1 BLOC1S1 BRCA1 C14ORF37 CAP2 CCT8 CD97 CDCA2 CHCHD1 CNTN4 COL14A1 CUBN DAPK2 DHRS13 DMD DNAH8 DRGX ENSG00000210082 ENSG00000245997 ENSG00000246667 ENSG00000247966 ESCO1 EXPH5 FAM111A FAM149B1 FAM150B FAM83B FBXO4 GDF9 GPR32 H2BFWT HCFC2 HOMER3 HYDIN IRS4 ITIH5 KCNT2 KCNU1 KDELR3 KLHL25 KRT28 LOC100130982 LOC100288406 LOC201651 LOC440292 LOC645954 LPP LRRC8A MAGEA12 MARCH7 MED13L METT5D1 MICAL1 MTM1 NOS3 PALLD PCTK3 PHF17 PLCE1 PNCK PPARA PREX1 PTPRD(D) PZP RHCG SCAPER SLC17A4 SPTBN1 TAS2R46 TIFAB TTC39A UPRT WDR7 WDR87 WWOX(D) ZFHX4 ZNF606 TCGA-A1-A0SJ 20p12.1(D) ADK-MYST4(A) ADORA3 ALG1 AMZ2 ASCL3 C14ORF104 CCND1(A) CHML CILP CNR1 COL20A1 ENSG00000240720 ENSG00000245434 ENSG00000245549 ENSG00000245900 ENSG00000246515 ENSG00000247089 FAF2 FLJ40292 GJB2 GNPTAB HAGHL HNRPDL HOOK2 IKZF1 LATS2 LOC100287308 LOC100290640 LOC729866 MAP2K4(D) MCTS1 MDM2(A) NOTUM NUP62 OFD1 PAK1(A) PSMD11 RANBP6 RASSF7 SCN4A SNAI1 SPEN TH1L TNRC6A ZBTB11 ZNF217(A) ZNF543 TCGA-A1-A0SK 8p11(A) 8p11.23(A) ACBD5 AHNAK ANKRD42 ARL11 ASB10 ATG2A C19ORF29OS CAMTA2 CCL23 CHRNB4 COPE CYP21A2 DMBT1 DTX1 ENSG00000005206 ENSG00000223274 FAT3 FLJ32810 GDPD5 GMEB1 GTF3C1 HNRNPA1 IDS LAMA3 LARGE LOC645954 LRP2 LSR NPAS2 NSMAF OBFC2B OR5AU1 PJA1 POU4F1 RB1(D) RXFP4 SCD5 SERTAD3 SHPK SLITRK4 SSR4 TECTA TEX11 TG TP53 TRAF3IP1 UGT2B15 UNC5D USH2A VIT YIPF7 |

新蟲 (初入文壇)
新蟲 (初入文壇)
| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 初試 317 +3 | 半拉月丙 2026-03-20 | 3/150 |
|
|---|---|---|---|---|
|
[考研] 330求調劑0854 +3 | assdll 2026-03-21 | 3/150 |
|
|
[考研] 336求調劑 +3 | rmc8866 2026-03-21 | 3/150 |
|
|
[考研] 070300化學319求調劑 +7 | 錦鯉0909 2026-03-17 | 7/350 |
|
|
[考研] 303求調劑 +5 | 睿08 2026-03-17 | 7/350 |
|
|
[考研] 083200學碩321分一志愿暨南大學求調劑 +3 | innocenceF 2026-03-17 | 3/150 |
|
|
[考研] 化學求調劑 +4 | 臨澤境llllll 2026-03-17 | 5/250 |
|
|
[考研] 324分 085600材料化工求調劑 +4 | llllkkkhh 2026-03-18 | 4/200 |
|
|
[考研] 一志愿 西北大學 ,070300化學學碩,總分287,雙非一本,求調劑。 +3 | 晨昏線與星海 2026-03-18 | 3/150 |
|
|
[考研] 311求調劑 +5 | 冬十三 2026-03-18 | 5/250 |
|
|
[考研] 南京大學化學376求調劑 +3 | hisfailed 2026-03-19 | 6/300 |
|
|
[考研] 求調劑 +3 | @taotao 2026-03-20 | 3/150 |
|
|
[考研] 求調劑 +3 | 暗涌afhb 2026-03-16 | 3/150 |
|
|
[考研] 生物學調劑招人。! +3 | 山海天嵐 2026-03-17 | 4/200 |
|
|
[考研] 286求調劑 +6 | lemonzzn 2026-03-16 | 10/500 |
|
|
[考研] 301求調劑 +4 | A_JiXing 2026-03-16 | 4/200 |
|
|
[考研] 材料專碩326求調劑 +6 | 墨煜姒莘 2026-03-15 | 7/350 |
|
|
[考研] 290求調劑 +3 | p asserby. 2026-03-15 | 4/200 |
|
|
[考博] 26申博 +4 | 八6八68 2026-03-16 | 4/200 |
|
|
[考研] 070305求調劑 +3 | mlpqaz03 2026-03-14 | 4/200 |
|