Support Vector Machine GPCR Subfamily Classification Results for gi|7305025|ref|NP_038475.1|

>gi|7305025|ref|NP_038475.1| egf-like module containing, mucin-like, hormone receptor-like sequence 2 [Homo sapiens] MGGRVFLVFLAFCVWLTLPGAETQDSRGCARWCPQDSSCVNATACRCNPGFSSFSEIITTPMETCDDINE CATLSKVSCGKFSDCWNTEGSYDCVCSPGYEPVSGAKTFKNESENTCQDVDECQQNPRLCKSYGTCVNTL GSYTCQCLPGFKLKPEDPKLCTDVNECTSGQNPCHSSTHCLNNVGSYQCRCRPGWQPIPGSPNGPNNTVC EDVDECSSGQHQCDSSTVCFNTVGSYSCRCRPGWKPRHGIPNNQKDTVCEDMTFSTWTPPPGVHSQTLSR FFDKVQDLGRDYKPGLANNTIQSILQALDELLEAPGDLETLPRLQQHCVASHLLDGLEDVLRGLSKNLSN GLLNFSYPAGTELSLEVQKQVDRSVTLRQNQAVMQLDWNQAQKSGDPGPSVVGLVSIPGMGKLLAEAPLV LEPEKQMLLHETHQGLLQDGSPILLSDVISAFLSNNDTQNLSSPVTFTFSHRSVIPRQKVLCVFWEHGQN GCGHWATTGCSTIGTRDTSTICRCTHLSSFAVLMAHYDVQEEDPVLTVITYMGLSVSLLCLLLAALTFLL CKAIQNTSTSLHLQLSLCLFLAHLLFLVAIDQTGHKVLCSIIAGTLHYLYLATFTWMLLEALYLFLTARN LTVVNYSSINRFMKKLMFPVGYGVPAVTVAISAASRPHLYGTPSRCWLQPEKGFIWGFLGPVCAIFSVNL VLFLVTLWILKNRLSSLNSEVSTLRNTRMLAFKATAQLFILGCTWCLGILQVGPAARVMAYLFTIINSLQ GVFIFLVYCLLSQQVREQYGKWSKGIRKLKTESEMHTLSSSAKADTSKPSTVN


Here is a list of Class B Secretin like subfamilies with support vector machine models and the score this sequence received with respect to each model.

1.0213706 EMR1 GPCRDB EMR1
-0.9978248Methuselah-like proteins (MTH)
-1.0055819Latrophilin
-1.03209Vasoactive intestinal polypeptide
-1.0334659Brain-specific angiogenesis inhibitor (BAI)
-1.0437756Gastric inhibitory peptide
-1.072373Growth hormone-releasing hormone
-1.0755323PACAP
-1.0822017Class B orphan/other
-1.129915Glucagon
-1.2025776Corticotropin releasing factor
-1.2441665Parathyroid hormone
-1.2946649Calcitonin
-1.3207667Diuretic hormone
-1.3247771Secretin






HOW TO INTERPRET THE SCORES:

Subfamily Scoring:

The support vector machine for each subfamily is trained to score positive examples (members of the subfamily) as 1.0 and negative examples (non-members) as -1.0

If this sequence receives a negative score with respect to a subfamily, it is probably not a member of the subfamily.

If this sequence receives a positive score with respect to a subfamily, it is probably in the subfamily.

In general, the score's distance from zero gives you the confidence level of the prediction. Positive scores greater than +1.0 mean a classifier has strongly accepted your sequence. Negative scores less than -1.0 mean a classifier has strongly rejected your sequence. For a more detailed evaluation of scores and confidence levels, take a look at Classification Statistics.
SOME ADDITIONAL INFORMATION:

There were 199 Class A Rhodopsin like and 11 Class C Metabotropic glutamate / pheromone and 2 Class D Fungal pheromone and 5 Frizzled/Smoothened family and 4 Nematode chemoreceptors and 4 Vomeronasal receptors (V1R & V3R) and 4 Class B Secretin like subfamilies whose classifiers were not run, because the families or subfamilies that contain them were rejected by classifiers higher in the hierarchy. For example, if the Amine receptor classifier scores this sequence negatively, it will not be checked with respect to the subfamilies of Amine receptors (Histamine, Serotonin, Dopamine, Octopamine and Acetylcholine (muscarinic) receptors).

Not all subfamilies have models yet. This sequence may be in a subfamily that has not yet been modeled.


Please cite: R. Karchin, K. Karplus and D. Haussler "Classifying G-Protein Coupled Receptors with Support Vector Machines" Bioinformatics 2002 in press [postscript, pdf]