Bio.PDB.Polypeptide 模块

多肽相关类（构建和表示）。

包含多个链的简单示例，

>>> from Bio.PDB.PDBParser import PDBParser
>>> from Bio.PDB.Polypeptide import PPBuilder
>>> structure = PDBParser().get_structure('2BEG', 'PDB/2BEG.pdb')
>>> ppb=PPBuilder()
>>> for pp in ppb.build_peptides(structure):
...     print(pp.get_sequence())
LVFFAEDVGSNKGAIIGLMVGGVVIA
LVFFAEDVGSNKGAIIGLMVGGVVIA
LVFFAEDVGSNKGAIIGLMVGGVVIA
LVFFAEDVGSNKGAIIGLMVGGVVIA
LVFFAEDVGSNKGAIIGLMVGGVVIA

使用 PDB 文件中的 HETATM 行的非标准氨基酸示例，在这种情况下为硒代蛋氨酸 (MSE)

>>> from Bio.PDB.PDBParser import PDBParser
>>> from Bio.PDB.Polypeptide import PPBuilder
>>> structure = PDBParser().get_structure('1A8O', 'PDB/1A8O.pdb')
>>> ppb=PPBuilder()
>>> for pp in ppb.build_peptides(structure):
...     print(pp.get_sequence())
DIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNW
TETLLVQNANPDCKTILKALGPGATLEE
TACQG

如果您愿意，可以在肽中包含非标准氨基酸

>>> for pp in ppb.build_peptides(structure, aa_only=False):
...     print(pp.get_sequence())
...     print("%s %s" % (pp.get_sequence()[0], pp[0].get_resname()))
...     print("%s %s" % (pp.get_sequence()[-7], pp[-7].get_resname()))
...     print("%s %s" % (pp.get_sequence()[-6], pp[-6].get_resname()))
MDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTILKALGPGATLEEMMTACQG
M MSE
M MSE
M MSE

在这种情况下，硒代蛋氨酸（第一个以及最后第七和第六个残基）已通过 get_sequence 方法显示为 M（蛋氨酸）。

Bio.PDB.Polypeptide.index_to_one(index)

索引到相应的单字母氨基酸名称。

>>> index_to_one(0)
'A'
>>> index_to_one(19)
'Y'

Bio.PDB.Polypeptide.one_to_index(s)

单字母代码到索引。

>>> one_to_index('A')
0
>>> one_to_index('Y')
19

Bio.PDB.Polypeptide.index_to_three(i)

索引到相应的三个字母氨基酸名称。

>>> index_to_three(0)
'ALA'
>>> index_to_three(19)
'TYR'

Bio.PDB.Polypeptide.three_to_index(s)

三个字母代码到索引。

>>> three_to_index('ALA')
0
>>> three_to_index('TYR')
19

Bio.PDB.Polypeptide.is_aa(residue, standard=False)

如果残基对象/字符串是氨基酸，则返回 True。

参数：:

residue (L{Residue} 或 string) – L{Residue} 对象或三个字母的氨基酸代码
standard (boolean) – 检查 20 个氨基酸的标志（默认值为 false）

>>> is_aa('ALA')
True

已知修饰氨基酸的三字母代码受支持，

>>> is_aa('FME')
True
>>> is_aa('FME', standard=True)
False

Bio.PDB.Polypeptide.is_nucleic(residue, standard=False)

如果残基对象/字符串是核酸，则返回 True。

参数：:

residue (L{Residue} 或 string) – L{Residue} 对象或三个字母代码
standard (boolean) – 用于检查 8 个（DNA + RNA）规范碱基的标志。默认值为 False。

>>> is_nucleic('DA ')
True

>>> is_nucleic('A  ')
True

已知修饰核苷酸的三字母代码受支持，

>>> is_nucleic('A2L')
True
>>> is_nucleic('A2L', standard=True)
False

class Bio.PDB.Polypeptide.Polypeptide(iterable=(), /)

基类：list

多肽只是一个 L{Residue} 对象列表。

get_ca_list()

获取多肽中 C-alpha 原子的列表。

返回：:: C-alpha 原子列表
返回类型：:: [L{Atom}, L{Atom}, …]

get_phi_psi_list(): 返回 phi/psi 二面角列表。

get_tau_list(): 所有 4 个连续 Calpha 原子的 tau 扭转角列表。

get_theta_list(): 所有 3 个连续 Calpha 原子的 theta 角列表。

get_sequence()

将 AA 序列作为 Seq 对象返回。

返回：:: 多肽序列
返回类型：:: L{Seq}

__repr__()

返回多肽的字符串表示形式。

返回 <Polypeptide start=START end=END>，其中 START 和 END 是外部残基的序列标识符。

class Bio.PDB.Polypeptide.CaPPBuilder(radius=4.3)

基类：_PPBuilder

使用 CA–CA 距离查找多肽。

__init__(radius=4.3): 初始化类。

class Bio.PDB.Polypeptide.PPBuilder(radius=1.8)

基类：_PPBuilder

使用 C–N 距离查找多肽。

__init__(radius=1.8): 初始化类。