版塊導(dǎo)航: 正在加載中...

登錄注冊

應(yīng)《網(wǎng)絡(luò)安全法》要求，自2017年10月1日起，未進行實名認(rèn)證將不得使用互聯(lián)網(wǎng)跟帖服務(wù)。為保障您的帳號能夠正常使用，請盡快對帳號進行手機號驗證，感謝您的理解與支持！

24小時熱門版塊排行榜

北京石油化工學(xué)院2026年研究生招生接收調(diào)劑公告

返回列表

當(dāng)前只顯示滿足指定條件的回帖，點擊這里查看本話題的所有回帖

cnlics

木蟲 (小有名氣)

應(yīng)助: 2 (幼兒園)
金幣: 3014.2
紅花: 4
帖子: 270
在線: 422.4小時
蟲號: 795158
注冊: 2009-06-16
性別: GG
專業(yè): 當(dāng)代宗教

[交流] 【分享】蛋白質(zhì)結(jié)構(gòu)預(yù)測流程已有23人參與

我慢慢翻譯慢慢貼

這里貼的內(nèi)容是以前收集的，應(yīng)該是來自EMBL，我粗略瀏覽了下內(nèi)容，還沒有過時。

WORD文檔可以在這里下載：
http://ifile.it/dwzy278

蛋白質(zhì)結(jié)構(gòu)預(yù)測一般流程見下圖：

內(nèi)容目錄：

•相關(guān)實驗數(shù)據(jù)
•序列數(shù)據(jù)和初步分析
•搜索序列數(shù)據(jù)庫
•識別結(jié)構(gòu)域
•多序列比對
•比較或同源建模
•二級結(jié)構(gòu)預(yù)測
•折疊的識別
•折疊分析與二級結(jié)構(gòu)比對
•序列與結(jié)構(gòu)的比對

[ Last edited by cnlics on 2010-9-16 at 08:24 ]

回復(fù)此樓

» 收錄本帖的淘帖專輯推薦

蛋白質(zhì)生物學(xué)實驗經(jīng)驗	分子生物實驗及蛋白純化結(jié)晶相關(guān)鏈接	生物信息學(xué)	生物化學(xué)和分子生物學(xué)
精品收藏	待下載	蛋白質(zhì)	交叉知識
比偶長大	蛋白分析軟件	生物信息學(xué)

» 本帖已獲得的紅花（最新10朵）

beimi

» 猜你喜歡

材料考研調(diào)劑已經(jīng)有3人回復(fù)
材料調(diào)劑已經(jīng)有12人回復(fù)
英一數(shù)一408，總分284，二戰(zhàn)真誠求調(diào)劑已經(jīng)有14人回復(fù)
085410 一志愿211 22408分?jǐn)?shù)359求調(diào)劑已經(jīng)有4人回復(fù)
271求調(diào)劑已經(jīng)有19人回復(fù)
385分生物學(xué)（071000）求調(diào)劑已經(jīng)有3人回復(fù)
一志愿安徽大學(xué)計算機科學(xué)與技術(shù)學(xué)碩，331分求調(diào)劑已經(jīng)有3人回復(fù)
318求調(diào)劑，計算材料方向已經(jīng)有8人回復(fù)
291求調(diào)劑已經(jīng)有25人回復(fù)
一志愿北京科技大學(xué)085601材料工程英一數(shù)二初試總分335求調(diào)劑已經(jīng)有6人回復(fù)

» 本主題相關(guān)價值貼推薦，對您同樣有幫助:

用SwissModel預(yù)測只能得到蛋白質(zhì)的一大部分的三維結(jié)構(gòu)，why？已經(jīng)有14人回復(fù)
蛋白分子建模小分子化合物畫圖酶與配體的分子模擬已經(jīng)有11人回復(fù)
蛋白質(zhì)二級結(jié)構(gòu)預(yù)測已經(jīng)有9人回復(fù)
蛋白質(zhì)高級結(jié)構(gòu)預(yù)測已經(jīng)有8人回復(fù)
蛋白質(zhì)3-d 結(jié)構(gòu)預(yù)測已經(jīng)有3人回復(fù)
關(guān)于兩個蛋白質(zhì)結(jié)構(gòu)疊合的原理（或者相關(guān)的程序）已經(jīng)有12人回復(fù)
求一個認(rèn)可度較高蛋白質(zhì)二級結(jié)構(gòu)預(yù)測軟件已經(jīng)有1人回復(fù)

1樓 2010-09-14 01:36:32

已閱回復(fù)此樓關(guān)注TA 給TA發(fā)消息送TA紅花 TA的回帖

cnlics

木蟲 (小有名氣)

應(yīng)助: 2 (幼兒園)
金幣: 3014.2
紅花: 4
帖子: 270
在線: 422.4小時
蟲號: 795158
注冊: 2009-06-16
性別: GG
專業(yè): 當(dāng)代宗教

蛋白序列數(shù)據(jù)

對蛋白序列的初步分析有一定價值。例如，如果蛋白是直接來自基因預(yù)測，就可能包含多個結(jié)構(gòu)域。更嚴(yán)重的是，可能會包含不太可能是球形或可溶性的區(qū)域。此流程圖假設(shè)你的蛋白是可溶的，可能是一個結(jié)構(gòu)域并不包含非球形結(jié)構(gòu)域。

需要考慮以下方面：
•是跨膜蛋白或者包含跨膜片段嗎？有許多方法預(yù)測這些片段，包括：

o TMAP (EMBL)
o PredictProtein (EMBL/Columbia)
o TMHMM (CBS, Denmark)
o TMpred (Baylor College)
o DAS (Stockholm)

•如果包含卷曲(coiled-coils)可以在COILS server 預(yù)測coiled coils 或者下載 COILS 程序（最近已經(jīng)重寫，注意GCG程序包里包含了COILS的一個版本）

•蛋白包含低復(fù)雜性區(qū)域？蛋白經(jīng)常含有數(shù)個聚谷氨酸或聚絲氨酸區(qū)，這些地方不容易預(yù)測。可以用SEG（GCG程序包里包含了一個版本的SEG程序）檢查。

如果出現(xiàn)以上一種情況，就應(yīng)該將序列打成碎片，或忽略序列中的特定區(qū)段，等等。這個問題與細(xì)胞定位結(jié)構(gòu)域相關(guān)。

[ Last edited by cnlics on 2010-9-16 at 08:25 ]

贊一下

回復(fù)此樓

3樓2010-09-14 01:41:58

已閱回復(fù)此樓關(guān)注TA 給TA發(fā)消息送TA紅花 TA的回帖

查看全部 33 個回答

cnlics

木蟲 (小有名氣)

應(yīng)助: 2 (幼兒園)
金幣: 3014.2
紅花: 4
帖子: 270
在線: 422.4小時
蟲號: 795158
注冊: 2009-06-16
性別: GG
專業(yè): 當(dāng)代宗教

實驗數(shù)據(jù)

許多實驗數(shù)據(jù)可以輔助結(jié)構(gòu)預(yù)測過程，包括：
•二硫鍵，固定了半胱氨酸的空間位置
•光譜數(shù)據(jù)，可以提供蛋白的二級結(jié)構(gòu)內(nèi)容
•定位突變研究，可以發(fā)現(xiàn)活性或結(jié)合位點的殘基
•蛋白酶切割位點，翻譯后修飾如磷酸化或糖基化提示了殘基必須是暴露的
•其他
預(yù)測時，必須清楚所有的數(shù)據(jù)。必須時刻考慮：預(yù)測與實驗結(jié)果是否一致？如果不是，就有必要修改做法。

[ Last edited by cnlics on 2010-9-14 at 19:31 ]

贊一下

回復(fù)此樓

2樓2010-09-14 01:41:00

已閱回復(fù)此樓關(guān)注TA 給TA發(fā)消息送TA紅花 TA的回帖

cnlics

木蟲 (小有名氣)

應(yīng)助: 2 (幼兒園)
金幣: 3014.2
紅花: 4
帖子: 270
在線: 422.4小時
蟲號: 795158
注冊: 2009-06-16
性別: GG
專業(yè): 當(dāng)代宗教

搜索序列數(shù)據(jù)庫

分析任何新序列的第一步顯然是搜索序列數(shù)據(jù)庫以發(fā)現(xiàn)同源序列。這樣的搜索可以在任何地方或者在任何計算機上完成。而且，有許多WEB服務(wù)器可以進行此類搜索，可以輸入或粘貼序列到服務(wù)器上并交互式地接收結(jié)果。

序列搜索也有許多方法，目前最有名的是BLAST程序�？梢匀菀椎玫皆诒镜剡\行的版本（從 NCBI 或者 Washington University），也有許多的WEB頁面允許對多基因或蛋白質(zhì)序列的數(shù)據(jù)庫比較蛋白質(zhì)或DNA序列，僅舉幾個例子：
•National Center for Biotechnology Information (USA) Searches
•European Bioinformatics Institute (UK) Searches
•BLAST search through SBASE (domain database; ICGEB, Trieste)
•還有更多的站點

最近序列比較的重要進展是發(fā)展了gapped BLAST 和PSI-BLAST (position specific interated BLAST)，二者均使BLAST更敏感，后者通過選取一條搜索結(jié)果，建立模式（profile），然后用再它搜索數(shù)據(jù)庫尋找其他同源序列（這個過程可以一直重復(fù)到發(fā)現(xiàn)不了新的序列為止），可以探測進化距離非常遠(yuǎn)的同源序列。很重要的一點是，在利用下面章節(jié)方法之前，通過PSI-BLAST把蛋白質(zhì)序列和數(shù)據(jù)庫比較，找尋是否有已知結(jié)構(gòu)。
將一條序列和數(shù)據(jù)庫比較的其他方法有：
•FASTA軟件包 (William Pearson, University of Virginia, USA)
•SCANPS (Geoff Barton, European Bioinformatics Institute, UK)
•BLITZ (Compugen's fast Smith Waterman search)
•其他方法.

It is also possible to use multiple sequence information to perform more sensitive searches. Essentially this involves building a profile from some kind of multiple sequence alignment. A profile essentially gives a score for each type of amino acid at each position in the sequence, and generally makes searches more sentive. Tools for doing this include:
•PSI-BLAST (NCBI, Washington)
•ProfileScan Server (ISREC, Geneva)
•HMMER 隱馬氏模型（Sean Eddy， Washington University）
•Wise package （Ewan Birney， Sanger Centre；用于蛋白質(zhì)對DNA的比較）
•其他方法.

A different approach for incorporating multiple sequence information into a database search is to use a MOTIF. Instead of giving every amino acid some kind of score at every position in an alignment, a motif ignores all but the most invariant positions in an alignment, and just describes the key residues that are conserved and define the family. Sometimes this is called a "signature". For example, "H-[FW]-x-[LIVM]-x-G-x(5)-[LV]-H-x(3)-[DE]" describes a family of DNA binding proteins. It can be translated as "histidine, followed by either a phenylalanine or tryptophan, followed by an amino acid (x), followed by leucine, isoleucine, valine or methionine, followed by any amino acid (x), followed by glycine,... [etc.]".

PROSITE (ExPASy Geneva) contains a huge number of such patterns, and several sites allow you to search these data:
•ExPASy
•EBI

It is best to search a few different databases in order to find as many homologues as possible. A very important thing to do, and one which is sometimes overlooked, is to compare any new sequence to a database of sequences for which 3D structure information is available. Whether or not your sequence is homologous to a protein of known 3D structure is not obvious in the output from many searches of large sequence databases. Moreover, if the homology is weak, the similarity may not be apparent at all during the search through a larger database.

One last thing to remember is that one can save a lot of time by making use of pre-prepared protein alignments. Many of these alignments are hand edited by experts on the particular protein families, and thus represent probably the best alignment one can get given the data they contain (i.e. they are not always as up to date as the most recent sequence databases). These databases include:
•SMART (Oxford/EMBL)
•PFAM (Sanger Centre/Wash-U/Karolinska Intitutet)
•COGS (NCBI)
•PRINTS (UCL/Manchester)
•BLOCKS (Fred Hutchinson Cancer Research Centre, Seatle)
•SBASE (ICGEB, Trieste)

通常把蛋白質(zhì)序列和數(shù)據(jù)比較都有很多的方法，這些對于識別結(jié)構(gòu)域非常有用。

[ Last edited by cnlics on 2010-9-14 at 19:54 ]

贊一下

回復(fù)此樓

4樓2010-09-14 01:42:52

已閱回復(fù)此樓關(guān)注TA 給TA發(fā)消息送TA紅花 TA的回帖

cnlics

木蟲 (小有名氣)

應(yīng)助: 2 (幼兒園)
金幣: 3014.2
紅花: 4
帖子: 270
在線: 422.4小時
蟲號: 795158
注冊: 2009-06-16
性別: GG
專業(yè): 當(dāng)代宗教

確定結(jié)構(gòu)域

If you have a sequence of more than about 500 amino acids, you can be nearly certain that it will be divided into discrete functional domains. If possible, it is preferable to split such large proteins up and consider each domain separately. You can predict the locatation of domains in a few different ways. The methods below are given (approximately) from most to least confident.
• If homology to other sequences occurs only over a portion of the probe sequence and the other sequences are whole (i.e. not partial sequences), then this provides the strongest evidence for domain structure. You can either do database searches yourself or make use of well-curated, pre-defined databases of protein domains. Searches of these databases (see links below) will often assign domains easily.
o SMART (Oxford/EMBL)
o PFAM (Sanger Centre/Wash-U/Karolinska Intitutet)
o COGS (NCBI)
o PRINTS (UCL/Manchester)
o BLOCKS (Fred Hutchinson Cancer Research Centre, Seatle)
o SBASE (ICGEB, Trieste)
You can also find domain descriptions in the annotations in SWISSPROT.
• Regions of low-complexity often separate domains in multidomain proteins. Long stretches of repeated residues, particularly Proline, Glutamine, Serine or Threonine often indicate linker sequences and are usually a good place to split proteins into domains.
Low complexity regions can be defined using the program SEG which is generally available in most BLAST distributions or web servers (a version of SEG is also contained within the GCG suite of programs).
• Transmembrane segments are also very good dividing points, since they can easily separate extracellular from intracellular domains. There are many methods for predicting these segments, including:
o TMAP (EMBL)
o PredictProtein (EMBL/Columbia)
o TMHMM (CBS, Denmark)
o TMpred (Baylor College)
o DAS (Stockholm)
• Something else to consider are the presence of coiled-coils. These unusual structural features sometimes (but not always) indicate where proteins can be divided into domains. You can predict coiled coils at the COILS server or you can download the COILS program (recently re-written by me of all people; a version of SEG is also contained within the GCG suite of programs).
• Secondary structure prediction methods (see below) will often predict regions of proteins to have different protein structural classes. For example one region of sequence may be predicted to contain only lpha helices and another to contain only beta sheets. These can often, though not always, suggest likely domain structure (e.g. an all alpha domain and an all beta domain)
If you have separated a sequence into domains, then it is very important to repeat all the database searches and alignments using the domains separately. Searches with sequences containing several domains may not find all sub-homologies, particularly if the domains are abundent in the database (e.g. kinases, SH2 domains, etc.). There may also be "hidden" domains. For example if there is a stretch of 80 amino acids with few homologues nested in between a kinase and an SH2 domain, then you may miss matches found when searching the whole sequence against a database.
Anyway, here is my slide from the talk related to this subject:

贊一下

回復(fù)此樓

5樓2010-09-14 01:44:10

已閱回復(fù)此樓關(guān)注TA 給TA發(fā)消息送TA紅花 TA的回帖

查看全部 33 個回答

普通表情龍兔虎貓高級回復(fù) (可上傳附件)

最具人氣熱帖推薦 [查看全部]		作者	回/看	最后發(fā)表

[考研] 一志愿北京科技大學(xué)085601材料工程英一數(shù)二初試總分335求調(diào)劑 +6	雙馬尾痞老板2 2026-04-01	6/300	2026-04-01 23:33 by chaolymer
[考研] 339求調(diào)劑，想調(diào)回江蘇 +7	烤麥芽 2026-03-27	10/500	2026-04-01 21:35 by 495374996
[考研] 261求調(diào)劑 +3	明仔· 2026-04-01	3/150	2026-04-01 20:52 by cq2548
[考研] 085600，320分求調(diào)劑 +5	大饞小子 2026-04-01	6/300	2026-04-01 19:40 by 唐沐兒
[考研] 353求調(diào)劑 +4	拉鉤不許變 2026-04-01	4/200	2026-04-01 18:10 by 記事本2026
[考研] 349求調(diào)劑 +6	吃的不少 2026-04-01	6/300	2026-04-01 17:55 by JYD2011
[考研] 考研調(diào)劑 +11	Amber00 2026-03-31	11/550	2026-04-01 11:32 by wangjy2002
[考研] 土木304求調(diào)劑 +5	兔突突突， 2026-03-31	6/300	2026-04-01 09:37 by JourneyLucky
[考研] 求調(diào)劑，一志愿南京航空航天大學(xué) ，080500材料科學(xué)與工程學(xué)碩，總分289分 +10	@taotao 2026-03-29	10/500	2026-04-01 09:30 by oooqiao
[考研] 349求調(diào)劑 +6	zwjjjjjj 2026-03-31	6/300	2026-04-01 09:16 by JourneyLucky
[考研] 初試301，代碼085701環(huán)境工程，本碩一致，四六級已過，有二區(qū)一作，共發(fā)表5篇論文 +3	axibli 2026-04-01	3/150	2026-04-01 08:43 by i_cooler
[考研] 282求調(diào)劑不挑專業(yè) 求收留 +4	Yam. 2026-03-30	5/250	2026-03-31 14:41 by 王亮_大連醫(yī)科大
[考研] 286求調(diào)劑 +6	Faune 2026-03-30	6/300	2026-03-31 14:37 by jp9609
[考研] 求調(diào)劑 +8	11ggg 2026-03-30	8/400	2026-03-31 13:56 by nanaliuyun
[考研] 吉大生物學(xué)326分求調(diào)劑 +3	sunnyupup 2026-03-31	3/150	2026-03-31 09:28 by longlotian
[考研] 105500藥學(xué)求調(diào)劑，一志愿山東大學(xué)藥學(xué)，348分 +3	gr哈哈哈 2026-03-28	3/150	2026-03-30 18:56 by 源_2020
[考研] 0703化學(xué)求調(diào)劑 +6	丹青奶蓋 2026-03-26	8/400	2026-03-30 18:33 by 探123
[考研] 085600，材料與化工321分求調(diào)劑 +10	大饞小子 2026-03-28	10/500	2026-03-29 23:35 by 飛行日記西
[考研] 356求調(diào)劑 +3	gysy?s?a 2026-03-28	3/150	2026-03-29 00:33 by 544594351
[考研] 285求調(diào)劑 +4	AZMK 2026-03-27	7/350	2026-03-27 20:59 by AZMK

亭亭五月天在线观看,亭亭五月天在线观看,国产最新av一区二区,国产 高清 中文字幕,99re热久久亚洲综合精品成人,熟妇 一区二区三区,一级做a爰片性色毛片武则天,美女的骚穴视频播放,国产美女午夜免费视频

24小時熱門版塊排行榜

cnlics

[交流] 【分享】蛋白質(zhì)結(jié)構(gòu)預(yù)測流程 已有23人參與

» 收錄本帖的淘帖專輯推薦

» 本帖已獲得的紅花（最新10朵）

» 猜你喜歡

» 本主題相關(guān)價值貼推薦，對您同樣有幫助:

cnlics

cnlics

cnlics

cnlics

亭亭五月天在线观看,亭亭五月天在线观看,国产最新av一区二区,国产高清中文字幕,99re热久久亚洲综合精品成人,熟妇一区二区三区,一级做a爰片性色毛片武则天,美女的骚穴视频播放,国产美女午夜免费视频

[交流] 【分享】蛋白質(zhì)結(jié)構(gòu)預(yù)測流程已有23人參與