• PubMed² - Experimenting with biomedical literature for tablets and smart phones


    We're still playing around with data visualisation, and the experiment of this week focuses on the scientific literature and is designed with tablet devices (such as the iPad or the Nexus 7) and smartphones in mind. The application is a re-thinking of PubMed's search interface and you can get to play with it here at http://pubmed-square.org/

    Let us know in the comments what you think.

  • Masters Project - Ion-channel structural pharmacology


    We have a position in the group in the area of ion-channel structural pharmacology - mapping known ion-channel modulators to sequences and binding sites. This will be in partnership with Pfizer, and the role will involve time spent both at the EBI and at Pfizer's labs in the Cambridge UK area - so a great opportunity to pick up some industrial experience.

    If you are interested, please get in touch by December 15th 2012, when we will shortlist candidates for interview.

  • Random Notes on Open Drug Discovery/Data Sharing: Part 1


    There are some fantastic initiatives in Open Drug Discovery going on at the moment. I for one, are convinced that we are on the cusp of a large structural change in drug discovery, and like at the beginning of all revolutions, the future is not clear, and we all a little bit excited and nervous at the same time. One of the commonly quoted benefits of an Open strategy is that it can avoid duplication, and if you avoid duplication, it means that you get to the goal, faster and cheaper (since other researchers can explore alternative approaches), and there is no repetition. There, you've just read it, and it's quite seductive isn't it?


    I've never quite bought this "avoid duplication" argument for the following three reasons. (I should declare my political/philosophical hand here, I have a very deep rooted empathy with the concept of The Free Market. Not the goofy, fudged form that we've had in Western Economies for some time - but that really is a different story, for another time).

    1) Scientists are not perfect and they mess things up now and then. The "no duplication" strategy places a lot of weight on the capabilities of a single group, who may not follow the best decision making, have the best approach to data analysis/design etc. There is a lot of discussion at the moment in the literature of the non-repeatability of key pharmacology data, to not have several parallel attempts at a problem seems a little rash given the probably high likelihood of individual failure. If an individual group has a likelihood of 0.6 of getting something done within a given time and given funding. Two groups (with the same likelihood of success/failure) in parallel have a probability of getting it done of 0.84. Simples.

    2) Competition is well established to be one of the major drivers of rapid completion in almost all endeavours of life; if you have someone breathing down your neck, potentially scooping you on a paper, you think in a different way, and tend to stay focussed on the task in hand. Given a finite time to complete a piece of work with preplanned and coordinated deliverables, the work miraculously fills the time and funding available.

    3) Who will take the decisions over non-duplication? Effectively saying you will not work on this compound series, and another group will, and will people abide with the decisions? We all know that grant committees are useless (unless we are on them of course), and without a lot of process transparency, things could rapidly descend into slow chaos and confusion.


    However, I think the arguments for rapid data sharing are very very strong, primarily because they increase liquidity and transparency in the market, and allow market participants to take more rational decisions on the allocation of their resources (individual labs and funders). For me this is the biggest single reason for data sharing (i.e. it actually increases competition, not decreases it). The Free Market of Knowledge in Drug Discovery will drive participants to their best composite roles, based on their abilities.

  • Paper: Cheminformatics - Communications of the ACM


    Here is a review article on cheminformatics, written as an orientation piece for people from a computational sciences background.

    %T Cheminformatics
    %A J.K. Wegner
    %A A. Sterling
    %A R. Guha
    %A A. Bender
    %A J.-L. Faulon
    %A J. Hastings
    %A N. O'Boyle
    %A J. Overington
    %A H. Van Vlijmen
    %A E. Willighagen 
    %J Communications of the ACM
    %V 55
    %I 11
    %P 65-75
    %O DOI:10.1145/2366316.2366334
    

  • GPCR Structure: Rat neurotensin receptor


    Yet another GPCR structure - PDBe code 4grv, the rat neurotensin receptor. The link to the paper is here.

    Update - on the first revision of the post, I had word blindness and listed the species as human - it is in fact rat.

    %T Structure of the agonist-bound neurotensin receptor
    %A J.F. White
    %A N. Noinaj
    %A Y. Shibata
    %A J. Love
    %A B. Kloss
    %A F. Xu
    %A J. Gvozdenovic-Jeremic
    %A P. Shah
    %A J. Shiloach
    %A C.G. Tate
    %A R. Grisshammer
    %J Nature 
    %V 490
    %P 508–513
    %D 2012 
    %O doi:10.1038/nature11558
    
    1. 3uon - human muscarinic M2 receptor 
    2. 4daj - rat muscarinic M3 receptor 
    3. 3rze - human histamine H1 receptor
    4. 2rh1 - human beta-2 adrenergic receptor 
    5. 2vt4 - turkey beta-1 adrenergic receptor 
    6. 3pbl - human dopamine D3 receptor
    7. 2ydv - human adenosine A2a receptor 
    8. 3v2w - human sphingosine-1-phosphate receptor 
    9. 4djh - human kappa opioid receptor 
    10. 4dkl - mouse mu opioid receptor 
    11. 4ej4 - mouse delta opioid receptor
    12. 4ea3 - human nociceptin receptor
    13. 4grv - rat neurotensin receptor
    14. 3odu - human CXCR4 receptor 
    15. 2lnl - human CXCR1 receptor (NMR)
    16. 2i35 - bovine rhodopsin 
    17. 2z73 - squid rhodopsin
                               10        20        30        40        50        60        70    
    3uon   (  20 )                                             tfevvfivlvagslSlvTiigNilVmvSIkvnrh
    4dajA  (  64 )                                             iwqvvfiafltgflAlvTiigNilVivAFkvnkq
    3rze   (  28 )                                                 mplvvvlsticlvTvglNllVlyAvrserk
    2rh1   (  29 )                                            devwvvgmgivmslivlaIvfgNvlVitAIakfer
    2vt4A  (  40 )                                               weagmsllmalVvllIvagNvlViaAigstqr
    3pblA  (  32 )                                                   yalsYcalilaIvfgNglVcmAVlkera
    2ydv   (   3 )                                             imgssvYitvElaiavlAilgNvlVcwAvwlnsn
    3v2w   (  17 )           sdyvnydIIvrHYnyTgklnisa                ltsvvfiliCcfIileNifvlltiwktkk
    4djhA  (  55 )                                            spaipviitavysvvfvvGlvgNslVmfVIirytk
    4dkl   (  65 )                                             mvtaitimalYsiVcvvGlfgNflvmyvIvrytk
    4ej4   (  41 )                                        rsasslalaiaitalYsavcavGllgNvlvmfgIvrytk
    4ea3A  (  47 )                                            plglkvtIvglYlavcvgGllgNclvmyVIlrhtk
    4grvA  (  52 )                                    nsdldVnTdiyskvlvtaiYlalfvvGtvgNsvtlftlark s
    3oduA  (  27 )            pçfre-------------------------enanfnkiflptiYsiIfltGivgNglvilvMgyqkk
    2lnl   (  29 )            pÇmle--------------------------tetLnkYvviiayalvFllsllgNslvMlvilysrv
    1u19A  (   1 )            mnGtegpnfyVPfsnktgvVrsPFeapQyyLaepwqFsmlAayMflLimlGfpiNflTlyVTvqHkk
    2z73A  (   9 )         etwwyNpsIvVhpHWref--------------dqvpdavYyslGifIgiCgiiGcggNgiViyLFtktks
                                                                  aaaaaaaaaaaaaaaaaaaaaaaaaaaa   
    
                          80        90        100       110       120       130       140       150 
    3uon   (  54 )    LqtvnnyflfSLAcADliiGvfSMnlytlytvi--gyWplgpvvÇdlWlalDYvVSNAsVmNLliiSfdryfcvt
    4dajA  (  98 )    LktvnnyFllSLAcADliIGviSMnlFttyiim--nrWalgnlaÇdlwLSiDYvASNAsVmNLlvISfDryfsit
    3rze   (  58 )    LhtvGnlYIvsLSvADliVGavVMpmnilyllm--skwsLgrplÇlfWLSmDYVASTASIfSVfiLCiDryrsvq
    2rh1   (  64 )    LqtvtnyFItsLAcADlvMGlaVVpfgaahilm--kmWtfgnfwçefWTSiDVlCVTASIeTLcvIAvdryfAIt
    2vt4A  (  72 )    LqtltnlFItsLAcADlvvGllVVpfgatlvvr--gtWlwgsflçelWTSlDVlCVTAsIeTLcvIAiDrylait
    3pblA  (  60 )    LqtttnyLVvsLAvADllvAtlVMpwvvylevt-ggvWnfsricÇdvFVTlDVmMcTAsIwNLCaISidRytAVv
    2ydv   (  37 )    LqnvtnyFVvsAAaADilVGvlAIpfaiaIst----GfçaaçhgÇLfiACfVLVLTASSIfSLlaIAiDryiair
    3v2w   (  76 )    FhrpMYyFIgnLAlSDllaGvaYtaNlllsga---tTykLtPaqWFlREGsMFvALSASVfSLlaIAieryitml
    4djhA  (  90 )    mktaTniYIfNLAlADalVTtTMpfqstvylmn---sWpfgdvlÇkiVlsiDyyNMfTSIfTLtmMSvdRyiaVc
    4dkl   (  99 )    MktAtniYIfNLAlADalATsTLpfqsvnylmg---tWpfgnilÇkiviSidYyNMFTSIfTLctMSvdRyiAVC
    4ej4   (  80 )    LktATniYIfNLAlADalATstLpfqsakylme---tWpfgellÇkaVlSidYyNMFTSIfTLtmMSvDRyiavc
    4ea3A  (  82 )    mktatNiYIfNLAlADtlVLlTLpfQGtdillg---fWpfgnalÇktVIaiDyyNMFTSTfTLtaMSvdryvaic
    4grvA  (  98 )    lqstvhyHlgsLalSDllILllAMpvElyNFIWvhhpWafgdagÇrgyYflRDactYATAlNVasLSvaRylAic
    3oduA  (  69 )    lrsmtdkYRlhLSvADllFVitLpfWavDAva----nWyfgnflÇkaVHviYTVNlYSSVwILAfISlDRylAiV
    2lnl   (  70 )    GrsvTdvyLlnLalaDllfaltlpiwaaSkvn----gwifgtfLÇkvVslLkEvnfYsgilLlacIsvdrylaiv
    1u19A  (  68 )    LrtplNyILlnLAvADlfMVfg-GFtTTlyTSl-hGyFvfgptGÇnlEGffATLGGEIaLWSLvvLaieRyvvVc
    2z73A  (  65 )    LqtpanmFiinLAfSDftFSlvNGfplMtiSCf-lkkWifgfaaÇkvYGfiGGiFGFMsIMTMAMiSiDrynViG
                         aaaaaaaaaaaaaaaaaa aaaaaaaaaa         aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
    
                               160       170       180       190       200       210       220   
    3uon   ( 127 )    kpltypvk---rttkmAgmmiaaAwvlSfilwapaIlfwqfivg-----------vrtVedgeÇyIqff------
    4dajA  ( 171 )    rpltyrak---rttkrAgvmiglAwviSfvlWApaIlfwqyfvg-----------krtVppgeÇfIqfl------
    3rze   ( 131 )    qplrylky---rtktrAsatilgawflSfl-WvipIlgwnh                 rredkÇeTdfy------
    2rh1   ( 137 )    spfkyqSl---ltknkArviilmvwivSgltSflpIqmhwyr-----athqeAinÇyae-etçÇdff--------
    2vt4A  ( 145 )    spfryqsl---mtrarAkviictvwaiSalvSflpImmhwWr-----dedpqAlkçyqd-pgçÇdfv--------
    3pblA  ( 134 )    mpvhyqhgtgqsscrrValmitavwvlAfaVSc-pLlfgfNtTg---------------dptvÇsIs--------
    2ydv   ( 108 )    iplryngl---vtgtrAkgiiaicwvlSfaIGltPmlgwnnÇgqp--kegkahsqgÇgegqvAÇlFedVV-----
    3v2w   ( 148 )    k           nnfrlfllisacwviSlilGglPimgwn---------------ÇisalssÇSTVLP-------
    4djhA  ( 162 )    hpvkaldf---rtplkAkiinicIwllSssvGisAivlGGtkvred------------vdvieÇslqFpdddysw
    4dkl   ( 171 )    hpvkaldf---rtprnAkivnvcNwilSsaiGlpVmfmAttkyrqg--------------sidçtltfsh-ptwy
    4ej4   ( 152 )    hpvkaldf---rtpakAklinicIwvlAsgvGvpimvmAvtqprdg--------------avvÇmlqfps-pswy
    4ea3A  ( 154 )    hp          tsskAqavnvaIwalAsvvGvpvaimGsAqvede--------------eieÇlveipt-pqdy
    4grvA  ( 173 )    hpfkaktl---msrsrtkkfisaIwlaSallAipMlftMGlqnrSadg--------thpgGlVÇTPiv----dta
    3oduA  ( 140 )    hatn---sqrprkllAekvVyvgVwipAlllT-ipDfif--Anvsead-----------dryiÇdrfyp---ndl
    2lnl   ( 141 )    haTr----tltqkrhlvkfvclgcwglsmnlS-lpFflf--RQayhpN----------NsSPvÇyEVlg-ndtak
    1u19A  ( 141 )    kpmsn----frfgenhaimgvafTwvmAlaCAapPlvgwSrYIPE-------------GMQCSÇGIDYYTpheet
    2z73A  ( 139 )    rpmaas---kkMshrrAfimiifVwlwSvlwAigPifgwGaYtLE-------------GVLCNÇSFdYIsr--ds
                                   aaaaaaaaaaaaaaaaaaa  aaa                                      
    
                          230       240       250       260       270       280       290       300 
    3uon   ( 182 )    snaavtfgtAiaaFylpviiMtvlywhisrasksri                   pppsrekkvtrtilaIllaF
    4dajA  ( 226 )    septitfgtAiaaFymPvtiMtilywrIyketek                       like   aqTlsaIllaF
    3rze   ( 186 )    dvtwfkvmtaiinFylPtllMlwfyakIykaVrqhc                   lhmnrerkaakQLgfIMaaF
    2rh1   ( 195 )    TnqayaiasSivSFyvplviMvfvYsrVfqeakrql                   kfclkeHkaLktlgiIMgtF
    2vt4A  ( 203 )    TnrayaiasSiiSFyipLliMifvalrvyreakeq                       irehkalktlgiImgvF
    3pblA  ( 185 )    -npdFViySSvvSFylPfgvTvlvyarIyvvlkqrrrk-----------------gvplrekkatqMVaiVlgaF
    2ydv   ( 173 )    pmnYMVyfNffaCVlvPlllMlgvylrIflaarrqlkqmesq             stlqkevhaakSLaiIvglF
    3v2w   ( 197 )    LYhkhYIlfCTtvFtllllsIvilYcriyslvrtr                   asrssenvaLlkTViiVLsvF
    4djhA  ( 222 )    wdlfmkicVfifAfviPvliIivcytlMilrlksvrllsg              rekdrnlrritrLVlvVVavF
    4dkl   ( 228 )    wenllKicVfifAfimPvliItvcyglmilrlksvr                   ekdrnlrritrMVlvVvavF
    4ej4   ( 209 )    wdtvtkicvflfAfvvPiliitvcyglMllrlrsvr                   ekdrslrriTrMVlvVvgaF
    4ea3A  ( 211 )    wgpvfaiciflfSFivPvlvIsvcyslMirrlrgvrlls-------------gsrekdrnlrritrLVlvVvavF
    4grvA  ( 233 )    tvkvvIqvNtfmSFlfPmlvIsilNtvIAnkLtvmv                     vqalrhGVlvAraVviaf
    3oduA  ( 195 )    wvvvfqfqhimvglilPgivIlsCyciIisklshs                     kghqkrkalktTviLilaF
    2lnl   ( 198 )    wrmvLrilPHtfGfivplfvmlfcygftlrtlf---------------------kahmgqkhrAmrvIfaVvlif
    1u19A  ( 199 )    nNesFViyMfvvHfiiPlivIffcygqLvftvkeaaaq------------qqesattqkaekevTrMviiMviaF
    2z73A  ( 196 )    ttrsNIlcMFilGffgPiliiffCyfnIvmsvsnhekemaamakrlnakelrkaqaganaemrlAkIsivIVsqF
                        aaaaaaaaaaa aaaaaaaaaaaaaaaaa                            aaaaaaaaaaaaaaaa
    
                               310       320       330       340       350       360       370   
    3uon   ( 397 )    iitWapYNvmVlintfçap--------ç--ipntvwtiGywlCYinstiNpacYalcnatFkktfkhllm     
    4dajA  ( 500 )    iitWtpyNimVlvntfçds--------ç--ipktywnlgywlCYiNStvNPvcYalcnktFrttfkt        
    3rze   ( 425 )    ilCWipYFiffmviafçkn--------ç--cnehlhmftiWlGYiNStlNPliYplCnenFkktfkrilhi    
    2rh1   ( 283 )    tlcWlpFFiVNivhviqdn----------lirkevyillNwiGYvNSgfNpliYc-rspdfriAfqellcl    
    2vt4A  ( 300 )    tlCWlpFFlvnivnvfnrd----------lvpdwlfvafnwlGYAnSAmnpiiYc-rspdfrkAfkrlla     
    3pblA  ( 339 )    ivCWlpFFltHvlnthçqt--------ç-hvspelysattwlGYvNsalNPviYttfnieFrkAflkilsc    
    2ydv   ( 243 )    alCWlpLHiiNcftffçpd--------çshaplwlMylAivlSHtNSvvNPfiyAyrireFrqTFrkiirshvlr
    3v2w   ( 266 )    iacwapLFiLLllDvgçkvk------tç--diLfrAeyfLvlAvlNSgtNPiiytltNkemrrafiri       
    4djhA  ( 284 )    vvcWtpIHifilvealgs            aalssyyfcIalGytNSslNPilYafldenFkrcfrdfcfp    
    4dkl   ( 290 )    ivcWtpIHiyViikaliti-------pettfqtvswhfcialGYtNSclNpvlYafldenFkrCfrefci     
    4ej4   ( 271 )    vvCWapIHifVivwtlvdi------nrrdplvvaalhlcialGYaNSslNpvlYaflDenfkrc           
    4ea3A  ( 273 )    vgcWtpVQvfvlaqglgvq-------pssetavailrfctAlGYvNSclNpilYafldenFkacfr         
    4grvA  ( 318 )    vvcWlpYHvRRlmFCyisdeq--WttflFdfYHyfYmlTNalAYasSAinpilYnlvsanFrqv           
    3oduA  ( 249 )    facWlpyyigisidsfilleiikqgçefentvhkwisitEAlAFfHCclNpilyaflgakfktsaqhalts    
    2lnl   ( 252 )    llcwlpynlvlLadTlmrtqviqeeRrNnIGraLdatEilGflhsclnpiiyafigqnfrhgflkilamhg  
    1u19A  ( 262 )    liCWlpYAgvAfyIfthqgsd---------fgpifMTipAFfAKtSAvyNPviYimmnkqFrnCmvttlccgknp
    2z73A  ( 271 )    llSWspYAvvAllAQfgplew---------VtpyaAQlpVMfAKaSaihNPmiYsvsHpkFreAIsqtfpwvLtc
                      aaaaaaaaaaaaaaaa                aaaaaaaaaaaaa   aaaaaaaaa aaaaaaaaaa       
    
                          380       390       400       410 
    3uon                                                   
    4dajA                                                  
    3rze                                                   
    2rh1                                                   
    2vt4A                                                  
    3pblA                                                  
    2ydv   ( 310 )    qqepfkaa                             
    3v2w                                                   
    4djhA                                                  
    4dkl                                                   
    4ej4                                                   
    4ea3A                                                  
    4grvA                                                  
    3oduA                                                  
    2lnl                                                   
    1u19A  ( 328 )    lgddeasttVsktetsqvapa                
    2z73A  ( 337 )    cqfddketeddkdaeteipage               
                                                           
    
    

  • ChEMBL - now with added DOIness



    In order to provide ChEMBL users with a persistent and citable link to datasets that have been deposited in ChEMBL we have started registering DOIs (Digital Object Identifiers) for these datasets. Many of you will be familiar with the use of DOIs as identifiers for journal articles but they can be used for any document that you want to permanently identify and share with others. By doing this we are providing people with a way of citing a deposited dataset in exactly the same way as you would a scientific publication.

    We are also hoping that by issuing DOIs for deposited data we will encourage people to contribute additional data to the ChEMBL database as the DOI will provide them with a permanent way to reference their contribution, for example by using the DOI in a subsequent publication.

    At the moment we have DOIs for four of the deposited datasets in the ChEMBL database.  Two are results from screens on the GSK PKIS set and two are datasets measured as part of DNDi but we expect these to increase.  These datasets and their DOIs are shown below.

    CHEMBL_ID
    Description
    DOI
    CHEMBL1961873
    Compounds: GSK PKIS; Assays: Nanosyn kinase panel
    10.6019/CHEMBL1961873
    CHEMBL2007661
    Compounds: GSK PKIS; Assays: UNC Frye lab
    10.6019/CHEMBL2007661
    CHEMBL1857833
    Screening and optimization of specific chemical series against human African Trypanosomiasis (HAT)
    10.6019/CHEMBL1857833
    CHEMBL1862790
    Optimisation of fenarimol series for the treatment of Chagas disease
    10.6019/CHEMBL1862790

    The DOIs can be resolved to the ChEMBL Document Report Card from the DOI.org website http://dx.doi.org/10.6019/CHEMBL1961873

  • Open data for drug discovery: learning from the biological community


    We've just co-authored with a collaborator from GSK an editorial on Open Data available here....

    %T Open data for drug discovery: learning from the biological community.
    %A A. Hersey
    %A S. Senger
    %A J.P. Overington
    %J Future Medicinal Chemistry 
    %D 2012
    %I 10
    %V 4
    %P 1865-1867
    %O DOI:10.4155/fmc.12.159
    

    The picture of the Fifty Shades of Grey dog I found on the internet somewhere...

  • GPCR Structure: Human CXCR1 receptor


    Hot on the heels of this years Nobel Prize for Chemistry awarded for structural studies on GPCRs, there's a brand new GPCR structure - human CXCR1, this time an NMR one, published in Nature. The PDBe code is 2lnl. This brings the total to 16 sequence distinct rhodopsin-like GPCR structures that we now have free access to.

    %T Structure of the chemokine receptor CXCR1 in phospholipid bilayers
    %A S.H. Park
    %A B.B. Das
    %A F. Casagrande
    %A Y. Tian
    %A H.J. Nothnagel
    %A M. Chu
    %A H. Kiefer
    %A K. Maier
    %A A.A. De Angelis
    %A F.M. Marassi
    %A S.J. Opella
    %J Nature
    %D 2012
    %V 491
    %P 779-783
    %O doi:10.1038/nature11580
    

    1. 3uon - human muscarinic M2 receptor 
    2. 4daj - rat muscarinic M3 receptor 
    3. 3rze - human histamine H1 receptor
    4. 2rh1 - human beta-2 adrenergic receptor 
    5. 2vt4 - turkey beta-1 adrenergic receptor 
    6. 3pbl - human dopamine D3 receptor
    7. 2ydv - human adenosine A2a receptor 
    8. 3v2w - human sphingosine-1-phosphate receptor 
    9. 4djh - human kappa opioid receptor 
    10. 4dkl - mouse mu opioid receptor 
    11. 4ej4 - mouse delta opioid receptor
    12. 4ea3 - human nociceptin receptor
    13. 3odu - human CXCR4 receptor 
    14. 2lnl - human CXCR1 receptor (NMR)
    15. 2i35 - bovine rhodopsin 
    16. 2z73 - squid rhodopsin

                               10        20        30        40        50        60        70  
    3uon   (  20 )                                             tfevvfivlvagslSlvTiigNilVmvSI
    4dajA  (  64 )                                             iwqvvfiafltgflAlvTiigNilVivAF
    3rze   (  28 )                                                 mplvvvlsticlvTvglNllVlyAv
    2rh1   (  29 )                                            devwvvgmgivmslivlaIvfgNvlVitAI
    2vt4A  (  40 )                                               weagmsllmalVvllIvagNvlViaAi
    3pblA  (  32 )                                                   yalsYcalilaIvfgNglVcmAV
    2ydv   (   3 )                                             imgssvYitvElaiavlAilgNvlVcwAv
    3v2w   (  17 )           sdyvnydIIvrHYnyTgklnisa                ltsvvfiliCcfIileNifvllti
    4djhA  (  55 )                                            spaipviitavysvvfvvGlvgNslVmfVI
    4dkl   (  65 )                                             mvtaitimalYsiVcvvGlfgNflvmyvI
    4ej4   (  41 )                                        rsasslalaiaitalYsavcavGllgNvlvmfgI
    4ea3A  (  47 )                                            plglkvtIvglYlavcvgGllgNclvmyVI
    3oduA  (  27 )            pçfre-------------------------enanfnkiflptiYsiIfltGivgNglvilvM
    2lnl   (  29 )            pÇmle--------------------------tetLnkYvviiayalvFllsllgNslvMlvi
    1u19A  (   1 )            mnGtegpnfyVPfsnktgvVrsPFeapQyyLaepwqFsmlAayMflLimlGfpiNflTlyVT
    2z73A  (   9 )         etwwyNpsIvVhpHWref--------------dqvpdavYyslGifIgiCgiiGcggNgiViyLF
                                                                  aaaaaaaaaaaaaaaaaaaaaaaaaa
    
                               80        90        100       110       120       130       140 
    3uon   (  49 )    kvnrhLqtvnnyflfSLAcADliiGvfSMnlytlytvi-gyWplgpvvÇdlWlalDYvVSNAsVmNLlii
    4dajA  (  93 )    kvnkqLktvnnyFllSLAcADliIGviSMnlFttyiim-nrWalgnlaÇdlwLSiDYvASNAsVmNLlvI
    3rze   (  53 )    rserkLhtvGnlYIvsLSvADliVGavVMpmnilyllm-skwsLgrplÇlfWLSmDYVASTASIfSVfiL
    2rh1   (  59 )    akferLqtvtnyFItsLAcADlvMGlaVVpfgaahilm-kmWtfgnfwçefWTSiDVlCVTASIeTLcvI
    2vt4A  (  67 )    gstqrLqtltnlFItsLAcADlvvGllVVpfgatlvvr-gtWlwgsflçelWTSlDVlCVTAsIeTLcvI
    3pblA  (  55 )    lkeraLqtttnyLVvsLAvADllvAtlVMpwvvylevtggvWnfsricÇdvFVTlDVmMcTAsIwNLCaI
    2ydv   (  32 )    wlnsnLqnvtnyFVvsAAaADilVGvlAIpfaiaIst---GfçaaçhgÇLfiACfVLVLTASSIfSLlaI
    3v2w   (  71 )    wktkkFhrpMYyFIgnLAlSDllaGvaYtaNlllsga--tTykLtPaqWFlREGsMFvALSASVfSLlaI
    4djhA  (  85 )    irytkmktaTniYIfNLAlADalVTtTMpfqstvylmn--sWpfgdvlÇkiVlsiDyyNMfTSIfTLtmM
    4dkl   (  94 )    vrytkMktAtniYIfNLAlADalATsTLpfqsvnylmg--tWpfgnilÇkiviSidYyNMFTSIfTLctM
    4ej4   (  75 )    vrytkLktATniYIfNLAlADalATstLpfqsakylme--tWpfgellÇkaVlSidYyNMFTSIfTLtmM
    4ea3A  (  77 )    lrhtkmktatNiYIfNLAlADtlVLlTLpfQGtdillg--fWpfgnalÇktVIaiDyyNMFTSTfTLtaM
    3oduA  (  64 )    gyqkklrsmtdkYRlhLSvADllFVitLpfWavDAva---nWyfgnflÇkaVHviYTVNlYSSVwILAfI
    2lnl   (  65 )    lysrvGrsvTdvyLlnLalaDllfaltlpiwaaSkvn---gwifgtfLÇkvVslLkEvnfYsgilLlacI
    1u19A  (  63 )    vqHkkLrtplNyILlnLAvADlfMVfg-GFtTTlyTSlhGyFvfgptGÇnlEGffATLGGEIaLWSLvvL
    2z73A  (  60 )    tktksLqtpanmFiinLAfSDftFSlvNGfplMtiSCflkkWifgfaaÇkvYGfiGGiFGFMsIMTMAMi
                      aa      aaaaaaaaaaaaaaaaaa aaaaaaaaaa        aaaaaaaaaaaaaaaaaaaaaaaaa
    
                               150       160       170       180       190       200       210 
    3uon   ( 118 )    Sfdryfcvtkpltypvk---rttkmAgmmiaaAwvlSfilwapaIlfwqfivg-----------vrtVed
    4dajA  ( 162 )    SfDryfsitrpltyrak---rttkrAgvmiglAwviSfvlWApaIlfwqyfvg-----------krtVpp
    3rze   ( 122 )    CiDryrsvqqplrylky---rtktrAsatilgawflSfl-WvipIlgwnh                 rre
    2rh1   ( 128 )    AvdryfAItspfkyqSl---ltknkArviilmvwivSgltSflpIqmhwyr-----athqeAinÇyae-e
    2vt4A  ( 136 )    AiDrylaitspfryqsl---mtrarAkviictvwaiSalvSflpImmhwWr-----dedpqAlkçyqd-p
    3pblA  ( 125 )    SidRytAVvmpvhyqhgtgqsscrrValmitavwvlAfaVSc-pLlfgfNtTg---------------dp
    2ydv   (  99 )    AiDryiairiplryngl---vtgtrAkgiiaicwvlSfaIGltPmlgwnnÇgqp--kegkahsqgÇgegq
    3v2w   ( 139 )    Aieryitmlk           nnfrlfllisacwviSlilGglPimgwn---------------Çisals
    4djhA  ( 153 )    SvdRyiaVchpvkaldf---rtplkAkiinicIwllSssvGisAivlGGtkvred------------vdv
    4dkl   ( 162 )    SvdRyiAVChpvkaldf---rtprnAkivnvcNwilSsaiGlpVmfmAttkyrqg--------------s
    4ej4   ( 143 )    SvDRyiavchpvkaldf---rtpakAklinicIwvlAsgvGvpimvmAvtqprdg--------------a
    4ea3A  ( 145 )    Svdryvaichp          tsskAqavnvaIwalAsvvGvpvaimGsAqvede--------------e
    3oduA  ( 131 )    SlDRylAiVhatn---sqrprkllAekvVyvgVwipAlllT-ipDfif--Anvsead-----------dr
    2lnl   ( 132 )    svdrylaivhaTr----tltqkrhlvkfvclgcwglsmnlS-lpFflf--RQayhpN----------NsS
    1u19A  ( 132 )    aieRyvvVckpmsn----frfgenhaimgvafTwvmAlaCAapPlvgwSrYIPE-------------GMQ
    2z73A  ( 130 )    SiDrynViGrpmaas---kkMshrrAfimiifVwlwSvlwAigPifgwGaYtLE-------------GVL
                      aaaaaaaa              aaaaaaaaaaaaaaaaaaa  aaa                        
    
                               220       230       240       250       260       270       280 
    3uon   ( 174 )    geÇyIqff------snaavtfgtAiaaFylpviiMtvlywhisrasksri                   p
    4dajA  ( 218 )    geÇfIqfl------septitfgtAiaaFymPvtiMtilywrIyketek                      
    3rze   ( 178 )    dkÇeTdfy------dvtwfkvmtaiinFylPtllMlwfyakIykaVrqhc                   l
    2rh1   ( 189 )    tçÇdff--------TnqayaiasSivSFyvplviMvfvYsrVfqeakrql                   k
    2vt4A  ( 197 )    gçÇdfv--------TnrayaiasSiiSFyipLliMifvalrvyreakeq                     
    3pblA  ( 179 )    tvÇsIs---------npdFViySSvvSFylPfgvTvlvyarIyvvlkqrrrk-----------------g
    2ydv   ( 164 )    vAÇlFedVV-----pmnYMVyfNffaCVlvPlllMlgvylrIflaarrqlkqmesq             s
    3v2w   ( 190 )    sÇSTVLP-------LYhkhYIlfCTtvFtllllsIvilYcriyslvrtr                   as
    4djhA  ( 208 )    ieÇslqFpdddyswwdlfmkicVfifAfviPvliIivcytlMilrlksvrllsg              re
    4dkl   ( 215 )    idçtltfsh-ptwywenllKicVfifAfimPvliItvcyglmilrlksvr                   e
    4ej4   ( 196 )    vvÇmlqfps-pswywdtvtkicvflfAfvvPiliitvcyglMllrlrsvr                   e
    4ea3A  ( 198 )    ieÇlveipt-pqdywgpvfaiciflfSFivPvlvIsvcyslMirrlrgvrlls-------------gsre
    3oduA  ( 184 )    yiÇdrfyp---ndlwvvvfqfqhimvglilPgivIlsCyciIisklshs                     
    2lnl   ( 185 )    PvÇyEVlg-ndtakwrmvLrilPHtfGfivplfvmlfcygftlrtlf---------------------ka
    1u19A  ( 185 )    CSÇGIDYYTpheetnNesFViyMfvvHfiiPlivIffcygqLvftvkeaaaq------------qqesat
    2z73A  ( 184 )    CNÇSFdYIsr--dsttrsNIlcMFilGffgPiliiffCyfnIvmsvsnhekemaamakrlnakelrkaqa
                                      aaaaaaaaaa  aaaaaaaaaaaaaaaaa                         
    
                               290       300       310       320       330       340       350 
    3uon   ( 378 )    ppsrekkvtrtilaIllaFiitWapYNvmVlintfçap--------ç--ipntvwtiGywlCYinstiNp
    4dajA  ( 482 )     like   aqTlsaIllaFiitWtpyNimVlvntfçds--------ç--ipktywnlgywlCYiNStvNP
    3rze   ( 406 )    hmnrerkaakQLgfIMaaFilCWipYFiffmviafçkn--------ç--cnehlhmftiWlGYiNStlNP
    2rh1   ( 264 )    fclkeHkaLktlgiIMgtFtlcWlpFFiVNivhviqdn----------lirkevyillNwiGYvNSgfNp
    2vt4A  ( 238 )      irehkalktlgiImgvFtlCWlpFFlvnivnvfnrd----------lvpdwlfvafnwlGYAnSAmnp
    3pblA  ( 320 )    vplrekkatqMVaiVlgaFivCWlpFFltHvlnthçqt--------ç-hvspelysattwlGYvNsalNP
    2ydv   ( 224 )    tlqkevhaakSLaiIvglFalCWlpLHiiNcftffçpd--------çshaplwlMylAivlSHtNSvvNP
    3v2w   ( 247 )    rssenvaLlkTViiVLsvFiacwapLFiLLllDvgçkvk------tç--diLfrAeyfLvlAvlNSgtNP
    4djhA  ( 265 )    kdrnlrritrLVlvVVavFvvcWtpIHifilvealgs            aalssyyfcIalGytNSslNP
    4dkl   ( 271 )    kdrnlrritrMVlvVvavFivcWtpIHiyViikaliti-------pettfqtvswhfcialGYtNSclNp
    4ej4   ( 252 )    kdrslrriTrMVlvVvgaFvvCWapIHifVivwtlvdi------nrrdplvvaalhlcialGYaNSslNp
    4ea3A  ( 254 )    kdrnlrritrLVlvVvavFvgcWtpVQvfvlaqglgvq-------pssetavailrfctAlGYvNSclNp
    3oduA  ( 230 )    kghqkrkalktTviLilaFfacWlpyyigisidsfilleiikqgçefentvhkwisitEAlAFfHCclNp
    2lnl   ( 233 )    hmgqkhrAmrvIfaVvlifllcwlpynlvlLadTlmrtqviqeeRrNnIGraLdatEilGflhsclnp
    1u19A  ( 243 )    tqkaekevTrMviiMviaFliCWlpYAgvAfyIfthqgsd---------fgpifMTipAFfAKtSAvyNP
    2z73A  ( 252 )    ganaemrlAkIsivIVsqFllSWspYAvvAllAQfgplew---------VtpyaAQlpVMfAKaSaihNP
                         aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa                aaaaaaaaaaaaa   aaa
    
                               360       370       380       390       400       410 
    3uon   ( 438 )    acYalcnatFkktfkhllm                                          
    4dajA  ( 541 )    vcYalcnktFrttfkt                                             
    3rze   ( 466 )    liYplCnenFkktfkrilhi                                         
    2rh1   ( 324 )    liYc-rspdfriAfqellcl                                         
    2vt4A  ( 341 )    iiYc-rspdfrkAfkrlla                                          
    3pblA  ( 381 )    viYttfnieFrkAflkilsc                                         
    2ydv   ( 286 )    fiyAyrireFrqTFrkiirshvlrqqepfkaa                             
    3v2w   ( 309 )    iiytltNkemrrafiri                                            
    4djhA  ( 328 )    ilYafldenFkrcfrdfcfp                                         
    4dkl   ( 334 )    vlYafldenFkrCfrefci                                          
    4ej4   ( 316 )    vlYaflDenfkrc                                                
    4ea3A  ( 317 )    ilYafldenFkacfr                                              
    3oduA  ( 300 )    ilyaflgakfktsaqhalts                                         
    2lnl   ( 303 )    iiyafigqnfrhgflkilamhg                                       
    1u19A  ( 304 )    viYimmnkqFrnCmvttlccgknplgddeasttVsktetsqvapa                
    2z73A  ( 313 )    miYsvsHpkFreAIsqtfpwvLtccqfddketeddkdaeteipage               
                      aaaaa  aaaaaaaaaa