Inspecting number of PTMs with annotation information available

Inspecting number of PTMs with annotation information available#

As described in Running PTM-POSE section, PTM-POSE provides various options for annotating functional information for PTMs, coming from various databases. However, PTM functional information is inherently sparse, and so most annotations will only provide information on a handful of PTMs. For this reason, it can be useful to probe how many PTMsTo better understand the types of annotations that are available, as well as the number of PTMs that have an annotation of that type. This can be done using the analyze function in PTM-POSE.

Note: This examples assumes that you have already run the PTM-POSE pipeline and have at annotated PTMs with at least one layer of information.

from ptm_pose import analyze
from ptm_pose import plots as pose_plots
import pandas as pd

# Load spliced ptm and altered flank data
spliced_ptms = pd.read_csv('spliced_ptms.csv')
altered_flanks = pd.read_csv('altered_flanks.csv')

pose_plots.show_available_annotations(spliced_ptms)
plot num annotations

As you can, see there are only a few PTMs from each annotation that have available information, with the most being 9 PTMs out of the 184 differentially included sites having been associated with a biological process. While this this should be taken into consideration when analyzing these annotations, we can glean some useful information and identify potentially interesting proteins/sites to dig deeper into. Let’s look at the PTMs that have been associated with a biological process:

ptms_with_annotation, annotation_counts = analyze.get_ptm_annotations(spliced_ptms, database = "PhosphoSitePlus", annotation_type = 'Process')
print('Specific PTMs with annotation:')
ptms_with_annotation
Specific PTMs with annotation:
Gene UniProtKB Accession Residue PTM Position in Canonical Isoform Modification Class PSP:ON_PROCESS dPSI Significance
0 BCAR1 P56945 Y 267.0 Phosphorylation cell growth, induced -0.07 0.0458775672499
1 BCAR1 P56945 Y 287.0 Phosphorylation cell growth, induced -0.07 0.0458775672499
2 BIN1 O00499 T 348.0 Phosphorylation signaling pathway regulation -0.112 0.0233903490744
3 CEACAM1 P13688 S 461.0 Phosphorylation apoptosis, altered 0.525 1.73943268451e-09
4 CTTN Q14247 K 272.0 Acetylation cell motility, inhibited 0.09 0.0355211287599
5 CTTN Q14247 S 298.0 Phosphorylation cell motility, altered; cytoskeletal reorganiz... 0.09 0.0355211287599
6 SPHK2 Q9NRA0 S 387.0 Phosphorylation cell motility, altered 0.253 0.0129400018182
7 SPHK2 Q9NRA0 T 614.0 Phosphorylation cell motility, altered 0.253 0.0129400018182
8 TSC2 P49815 S 981.0 Phosphorylation carcinogenesis, inhibited; cell growth, inhibi... -0.219 4.18472157275e-05
9 YAP1 P46937 K 342.0 Ubiquitination carcinogenesis, altered -0.161;-0.188 0.000211254197372;4.17884655686e-07


We can also look at the number of PTMs associated with each annotation:

print('Number of PTMs associated with each annotation:')
annotation_counts
Number of PTMs associated with each annotation:

PSP:ON_PROCESS
cell motility, altered          3
signaling pathway regulation    2
cell growth, induced            2
apoptosis, altered              1
cell motility, inhibited        1
cytoskeletal reorganization     1
cell adhesion, inhibited        1
carcinogenesis, inhibited       1
cell growth, inhibited          1
autophagy, inhibited            1
carcinogenesis, altered         1
Name: count, dtype: int64

Total running time of the script: (0 minutes 0.159 seconds)

Gallery generated by Sphinx-Gallery