Support Vector Machine GPCR Subfamily Classification Results for NP_005963

>NP_005963 MNTSHLLALLLPKSPQGENRSKPLGTPYNFSEHCQDSVDVMVFIVTSYSIETVVGVLGNLCLMCVTVRQK EKANVTNLLIANLAFSDFLMCLLCQPLTAVYTIMDYWIFGETLCKMSAFIQCMSVTVSILSLVLVALERH QLIINPTGWKPSISQAYLGIVLIWVIACVLSLPFLANSILENVFHKNHSKALEFLADKVVCTESWPLAHH RTIYTTFLLLFQYCLPLGFILVCYARIYRRLQRQGRVFHKGTYSLRAGHMKQVNVVLVVMVVAFAVLWLP LHVFNSLEDWHHEAIPICHGNLIFLVCHLLAMASTCVNPFIYGFLNTNFKKEIKALVLTCQQSAPLEESE HLPLSTVHTEVSKGSLRLSGRSNPI



Here is a list of Class A Rhodopsin like subfamilies with support vector machine models and the score this sequence received with respect to each model.

1.1184375 Peptide GPCRDB Peptide
  1.029154 Neuropeptide Y GPCRDB Neuropeptide Y
   1.0564485 Neuropeptide Y type 4 GPCRDB Neuropeptide Y type 4
  -0.97961855Neuropeptide Y / peptide YY
  -1.0058744Neuropeptide Y other
  -1.0122416Neuropeptide Y type 5
  -1.0522074Neuropeptide Y type 2
  -1.0831356Neuropeptide Y type 1
  -1.1769111Neuropeptide Y type 6
 -0.9367496Tachykinin
 -0.98866045Galanin
 -1.0677159CCK
 -1.0719771Endothelin
 -1.0898963Bombesin
 -1.0960164Orexin & neuropeptide FF
 -1.1006536Neuromedin U
 -1.111029Chemokine
 -1.11762GPR37 / endothelin B-like
 -1.1480564Thrombin
 -1.154386Urotensin II
 -1.1785784Fmet-leu-phe
 -1.2192647Proteinase activated
 -1.2196621Melanocortin
 -1.224143Interleukin-8
 -1.2449771Adrenomedullin (G10D)
 -1.2631245Angiotensin
 -1.2847996C5a anaphylatoxin
 -1.301489Vasopressin-like
 -1.301893Chemokine receptor-like
 -1.3095837APJ like
 -1.3107557Somatostatin
 -1.3117136Neurotensin
 -1.3399863Bradykinin
 -1.3532531Opioid
-0.95062363Gonadotropin-releasing hormone
-1.0089756Class A Orphan/other
-1.0704085Thyrotropin-releasing hormone & Secretagogue
-1.0742856Hormone protein
-1.0774887Prostanoid
-1.089304Melatonin
-1.1134927Cannabis
-1.1980278(Rhod)opsin
-1.1997341Platelet activating factor
-1.2109182Viral
-1.2172604Amine
-1.3195257Lysosphingolipid & LPA (EDG)
-1.3292975Olfactory
-1.4761937Nucleotide-like






HOW TO INTERPRET THE SCORES:

Subfamily Scoring:

The support vector machine for each subfamily is trained to score positive examples (members of the subfamily) as 1.0 and negative examples (non-members) as -1.0

If this sequence receives a negative score with respect to a subfamily, it is probably not a member of the subfamily.

If this sequence receives a positive score with respect to a subfamily, it is probably in the subfamily.

In general, the score's distance from zero gives you the confidence level of the prediction. Positive scores greater than +1.0 mean a classifier has strongly accepted your sequence. Negative scores less than -1.0 mean a classifier has strongly rejected your sequence. For a more detailed evaluation of scores and confidence levels, take a look at Classification Statistics.
SOME ADDITIONAL INFORMATION:

There were 19 Class B Secretin like and 11 Class C Metabotropic glutamate / pheromone and 2 Class D Fungal pheromone and 5 Frizzled/Smoothened family and 4 Nematode chemoreceptors and 4 Vomeronasal receptors (V1R & V3R) and 81 Class A Rhodopsin like subfamilies whose classifiers were not run, because the families or subfamilies that contain them were rejected by classifiers higher in the hierarchy. For example, if the Amine receptor classifier scores this sequence negatively, it will not be checked with respect to the subfamilies of Amine receptors (Histamine, Serotonin, Dopamine, Octopamine and Acetylcholine (muscarinic) receptors).

Not all subfamilies have models yet. This sequence may be in a subfamily that has not yet been modeled.


Please cite: R. Karchin, K. Karplus and D. Haussler "Classifying G-Protein Coupled Receptors with Support Vector Machines" Bioinformatics 2002 in press [postscript, pdf]