Support Vector Machine GPCR Subfamily Classification Results for gi|4502415|ref|NP_001707.1|

>gi|4502415|ref|NP_001707.1| Burkitt lymphoma receptor 1, isoform 1; C-X-C chemokine receptor type 5; monocyte-derived receptor 15 [Homo sapiens] MNYPLTLEMDLENLEDLFWELDRLDNYNDTSLVENHLCPATEGPLMASFKAVFVPVAYSLIFLLGVIGNV LVLVILERHRQTRSSTETFLFHLAVADLLLVFILPFAVAEGSVGWVLGTFLCKTVIALHKVNFYCSSLLL ACIAVDRYLAIVHAVHAYRHRRLLSIHITCGTIWLVGFLLALPEILFAKVSQGHHNNSLPRCTFSQENQA ETHAWFTSRFLYHVAGFLLPMLVMGWCYVGVVHRLRQAQRRPQRQKAVRVAILVTSIFFLCWSPYHIVIF LDTLARLKAVDNTCKLNGSLPVAITMCEFLGLAHCCLNPMLYTFAGVKFRSDLSRLLTKLGCTGPASLCQ LFPSWRRSSLSESENATSLTTF


Here is a list of Class A Rhodopsin like subfamilies with support vector machine models and the score this sequence received with respect to each model.

1.0342636 Peptide GPCRDB Peptide
  0.91875833 Chemokine GPCRDB Chemokine
   0.8824525 C-X-C Chemokine GPCRDB  C-X-C Chemokine
    0.76595616 C-X-C Chemokine type 5 GPCRDB    C-X-C Chemokine type 5
   -0.7944133 C-X-C Chemokine type 4
   -0.9087162 C-X-C Chemokine type 3
   -1.0424411 BONZO receptors (CXC6R)
  -0.6939169 C-C Chemokine
  -1.0959551 XC Chemokine
  -1.1562527 C-X3-C Chemokine
 -0.7805642Interleukin-8
 -0.9943843Galanin
 -1.0165837Angiotensin
 -1.0529681Thrombin
 -1.0550568Fmet-leu-phe
 -1.0551845C5a anaphylatoxin
 -1.0738173Tachykinin
 -1.0821599Endothelin
 -1.1072987Neuromedin U
 -1.1412753CCK
 -1.142992GPR37 / endothelin B-like
 -1.1465014Urotensin II
 -1.1508722Adrenomedullin (G10D)
 -1.1627636APJ like
 -1.1757075Melanocortin
 -1.1808827Bradykinin
 -1.1895512Chemokine receptor-like
 -1.2174425Bombesin
 -1.2534008Vasopressin-like
 -1.2796946Proteinase activated
 -1.2990526Neurotensin
 -1.3333042Orexin & neuropeptide FF
 -1.3514677Opioid
 -1.351838Somatostatin
 -1.4002843Neuropeptide Y
-0.8293954Viral
-0.9213619Nucleotide-like
-0.9786803Gonadotropin-releasing hormone
-1.0026479Class A Orphan/other
-1.0119247Prostanoid
-1.0239766Hormone protein
-1.0613081Thyrotropin-releasing hormone & Secretagogue
-1.0692434Platelet activating factor
-1.1289392Cannabis
-1.1424971Melatonin
-1.1993377Olfactory
-1.2437484(Rhod)opsin
-1.3433423Lysosphingolipid & LPA (EDG)
-1.63884Amine






HOW TO INTERPRET THE SCORES:

Subfamily Scoring:

The support vector machine for each subfamily is trained to score positive examples (members of the subfamily) as 1.0 and negative examples (non-members) as -1.0

If this sequence receives a negative score with respect to a subfamily, it is probably not a member of the subfamily.

If this sequence receives a positive score with respect to a subfamily, it is probably in the subfamily.

In general, the score's distance from zero gives you the confidence level of the prediction. Positive scores greater than +1.0 mean a classifier has strongly accepted your sequence. Negative scores less than -1.0 mean a classifier has strongly rejected your sequence. For a more detailed evaluation of scores and confidence levels, take a look at Classification Statistics.
SOME ADDITIONAL INFORMATION:

There were 19 Class B Secretin like and 11 Class C Metabotropic glutamate / pheromone and 2 Class D Fungal pheromone and 5 Frizzled/Smoothened family and 4 Nematode chemoreceptors and 4 Vomeronasal receptors (V1R & V3R) and 97 Class A Rhodopsin like subfamilies whose classifiers were not run, because the families or subfamilies that contain them were rejected by classifiers higher in the hierarchy. For example, if the Amine receptor classifier scores this sequence negatively, it will not be checked with respect to the subfamilies of Amine receptors (Histamine, Serotonin, Dopamine, Octopamine and Acetylcholine (muscarinic) receptors).

Not all subfamilies have models yet. This sequence may be in a subfamily that has not yet been modeled.


Please cite: R. Karchin, K. Karplus and D. Haussler "Classifying G-Protein Coupled Receptors with Support Vector Machines" Bioinformatics 2002 in press [postscript, pdf]