Current Opinion in Biotechnology 2000,
Biological function made crystal clear — annotation of hypothetical proteins via structural genomics
该课题组通过解析脑膜炎菌(Haemophilus influenzae)的50个假想蛋白的结构来推断其功能。在基因组序列中发现一些ORF在不同物种中保守存在,然而找不到与其相关的已知结构和功能的蛋白,因此视为假想蛋白(hypothetical protein),对应的基因被称为‘orphan’ genes。
从1743个ORF中选出65个靶目标。选择的标准是:1. 功能未知。2. 可溶(通过TopPhred预测少于3个跨膜区);然后排除可能预测错误的ORF,标准是在至少3种微生物中都存在;最后一步是用BLAST/BEAUTY、FASTA搜索PROSITE、Swiss Prot、GeneQuiz。每个月要对PDB搜索一次,看看有没有新进展。
从克隆到结构解析的过程:
克隆与表达是常规的,在大肠杆菌中表达,每个ORF都用3种载体,分别用Native和His-Tag的形式,发现25%不能或少量表达。
结晶方面是用机器人,先快速筛选,如不能找到合适条件再用更多条件试。pH2-10,温度6-35。找到合适的条件再优化。得到晶体后先进行初步衍射试验,加防冻剂等,然后用同步辐射加速器进行X-ray晶体衍射。采用多波长反常衍射方法(MAD)。
NMR的优点是在蛋白在可溶状态下确定结构,然而它只能观察小蛋白(<30kD),并且要高度可溶(毫摩尔浓度)。
下面是如何从结构推断功能:蛋白质的结构提供了直接和间接的线索来推断其功能,采用以下四个步骤:
Case 1: the protein has a fold that has been seen before 该蛋白具有已知的折叠(fold)如该折叠与一种或几种生物学功能相联系,则可以对该蛋白的功能做一评估。
Case 2: the protein is an enzyme 该蛋白是一种酶许多酶有着相似的催化机制,如Ser–His–Asp这样的three-dimensional catalytic motifs, 可以搜索新结构中是否有这样的motif.
Case 3: the protein binds one or more small-molecule ligands 该蛋白与一个或多个小分子配体结合。结合位点几乎总是蛋白表面最大的凹陷处,可以通过计算机自动识别。这也是以结构为基础的药物设计的靶标。
Case 4: the protein interacts with other macromolecules 该蛋白与其他大分子相互作用根据相互作用的性质,有5种方法来确定该蛋白是否与其他大分子相互作用。 First, in cases where electrostatics plays a major role in binding, such as when a protein associates with RNA or DNA, mapping of the surface potential can be an effective technique . 表面势能。 Second, sites of tight association with other proteins may be identified by analyses of surface composition. Third, where a large family of sequences are available, mapping the extent of conservation of surface residues provides a means of identifying interaction sites. Fourth, three new genome-scale non-structure-based methods hold promise for providing clues to identifying interacting proteins . Hypotheses generated by these methods may be tested structurally by protein–protein docking methods that search for specific binding sites . Fifth, it has also been suggested that interaction sites can also be identified by analyses of the correlation of sequence changes between pairs of proteins across many species. These cases can also be clarified using protein–protein docking methods.
通过各种方法预测其功能后再用相应的实验来证实。
有时配体会与蛋白共纯化,共结晶,这会提供某种线索。如某蛋白结合有ATP,则该蛋白可能是ATP酶或是由ATP调节的分子开关。