#NEXUS [!This data set was downloaded from TreeBASE, a relational database of phylogenetic knowledge. TreeBASE has been supported by the NSF, Harvard University, Yale University, SDSC and UC Davis. Please do not remove this acknowledgment from the Nexus file. Generated on September 19, 2025; 21:50 GMT TreeBASE (cc) 1994-2008 Study reference: Yan H., Lou Z., Li L., Brindley P.J., Zheng Y., Luo X., Hou J., Guo A., Jia W., & Cai X. 2014. Genome-wide analysis of regulatory proteases sequences identified through bioinformatics data mining in Taenia solium. BMC Genomics, . TreeBASE Study URI: http://purl.org/phylo/treebase/phylows/study/TB2:S15682] [ The following blocks are input data for analysis step 13557 ] BEGIN TAXA; TITLE TaxaForAnalysisStep13557; DIMENSIONS NTAX=68; TAXLABELS Aedes_aegypti_EAT45919 Brugia_malayi_AAT07059 Caenorhabditis_elegans_AAB65956 Caenorhabditis_elegans_CPR3_P43507 Caenorhabditis_elegans_CPR5_P43509 Caenorhabditis_elegans_CPR6_P43510 Caenorhabditis_elegans_CPR6_Q8MQC6 Caenorhabditis_elegans_CPZ1_G5EGP8 Clonorchis_sinensis_AAP33049 Clonorchis_sinensis_AAP33050 'Danio rerio NP_001071036' 'Drosophila melanogaster CATB NP_001259536' Drosophila_melanogaster_CATL_Q95029 Echinococcus_multilocularis_CATL_E9RH13 Echinococcus_multilocularis_CATL_Q0WYD8 Fasciola_gigantica_AAF44675 Fasciola_gigantica_AAF44676 Fasciola_hepatica_CATLL_Q24940 Homo_sapiens_AAD26616 Homo_sapiens_CATB_P07858 Homo_sapiens_CATFF_Q9UBX1 Homo_sapiens_CATH_P09668 Homo_sapiens_CATK_P43235 Homo_sapiens_CATL_P07711 Homo_sapiens_CATO_P43234 Homo_sapiens_CATS_P25774 Homo_sapiens_CATV_O60911 Homo_sapiens_CATW_P56202 Homo_sapiens_CATZ_Q9UBR2 Mus_musculus_CATB_P10605 Mus_musculus_CATF_Q9R013 Mus_musculus_CATH_P49935 Mus_musculus_CATK_P55097 Mus_musculus_CATL_P06797 Mus_musculus_CATM_Q9JL96 Mus_musculus_CATO_Q8BM88 Mus_musculus_CATP_Q9R014 Mus_musculus_CATR_Q9JIA9 'Mus musculus CATS NP_001254624' Mus_musculus_CATW_P56203 Mus_musculus_CATZ_Q9WUU7 Mus_musculus_CTS1_Q9JI84 Mus_musculus_CTS2_Q9JI81 Mus_musculus_CTS3_Q9DAZ8 Mus_musculus_CTS6_Q9ET52 Paragonimus_westermani_AAF21461 Paragonimus_westermani_AAW28151 Paragonimus_westermani_AAY81946 Schistosoma_japonicum_AAW25775 Schistosoma_japonicum_CATB_P43157 Schistosoma_japonicum_CB_Q7Z1I6 Schistosoma_mansoni_CAA83538 Schistosoma_mansoni_CB2_Q95PM1 Schistosoma_mansoni_CYSP_P25792 Taenia_asiatica_CATL_B7XBA1 Taenia_pisiformis_CESTL_F6MEN8 Taenia_saginata_CATL_B7XBA0 Taenia_solium_AAS00027 Taenia_solium_LongOrf.asmbl_1043 Taenia_solium_LongOrf.asmbl_24242 Taenia_solium_LongOrf.asmbl_24428 Taenia_solium_LongOrf.asmbl_6319 Taenia_solium_Scaffold00002.gene342 Taenia_solium_Scaffold00009.gene1353 Taenia_solium_Scaffold00115.gene6434 Taenia_solium_Scaffold00212.gene8293 Trichobilharzia_regenti_CB2 Trypanosoma_brucei_AAX80359 ; END; BEGIN CHARACTERS; [! TreeBASE Matrix URI: http://purl.org/phylo/treebase/phylows/matrix/TB2:M21803] TITLE C1_proteases_family; LINK TAXA = TaxaForAnalysisStep13557; DIMENSIONS NCHAR=455; FORMAT DATATYPE=Protein SYMBOLS= "A C D E F G H I K L M N P Q R S T V W Y" MISSING=? GAP= -; MATRIX [ 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 400 410 420 430 440 450 ] [ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ] Aedes_aegypti_EAT45919 -----------VSLYELVKEE---WNAFKLQHRKNYDSETEERIRLK-IYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTD--SKKSLKGVRIE-----------EPVTFIEP-ANVEVPTTVDWRKKG-------AVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKL------VSLSEQNLVDCSG-------KYGNN--GCNGGMMDYAFQYI-----------KDNG---GIDTEKSYPYEA---------------IDDT---CHFN--------------PKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDS--ENLDHGVLAVGYG-------------------------TSEEGED-------YWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPLV- Brugia_malayi_AAT07059 MKSLDLAMNSQEWQNEEKKTLWSDFMTFIKKFKREYSSIEEQLDRFR-IYLQN---MNFAKKLQFEEKGTAIYGATKFSDMTAEEFQKIMLPSIWWDRVESNG----------------IT-FNLNDFNLSIYNLPSKFDWRT-E------GVVTPVKDQGSCGSCWAFSVTGNIESLWAIKTGKL------ISLSEQELIDCD-----------VIDKGCNGGLPINAFREIKR--------------MGGLEPEDQYPYEAKNGT------------------CHLVRA------------QIA--VSIDDAVEIPR-NETVMKAWIAQRGPLSVGIDAEL--LSYYKSGILHPSKSRCPPSKINHGVLITGYG--------------------------IEN-------NLPYWTIKNSWGEQWGENGYFQLMRGK-NICGVSDLVSSAIIY Caenorhabditis_elegans_AAB65956 DSITVQELRKAKIIRPRDYVIWNSFLDFVDRHEKKYTNKREVLKRFR-VFKKN---AKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIMLPYQWEQPVYPMEQ---------------AN-FEKHDVTINEEDLPESFDWRE-K------GAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKL------VSLSEQELVDCD-----------SMDQGCNGGLPSNAYKEIIR--------------MGGLEPEDAYPYDGRGET------------------CHLVRK------------DIA--VYINGSVELPH-DEVEMQKWLVTKGPISIGLNANT--LQFYRHGVVHPFKIFCEPFMLNHGVLIVGYG--------------------------KDG-------RKPYWIVKNSWGPNWGEAGYFKLYRGK-NVCGVQEMATSALVN Caenorhabditis_elegans_CPR3_P43507 ----------------------------------IGQSPQKVLVDHVN-TVQTSWVAEHNEISEFEMKFK------VMDVKFAEPLEKDSDVASEL----------------------------FVRGEIVPEPLPDTFDAREK--WPDCN-TIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGT----QQPVISVEDILSCCG---------TTCGYGCKGGYSIEALRFWASS------GAVTGGDYG-GHGCMPYSFAPCTK--------NCPESTTPS--CKTTCQS----SYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYED-FYHYKSGVYHYTSGKLV---GGHAVKIIGWGV--------------------------ENGVD-------YWLIANSWGTSFGEKGFFKIRRG-TNECQIEGNVVA---- Caenorhabditis_elegans_CPR5_P43509 -----------------------------------PALTGQALIDYVN-SAQKLWTAGHQVIPKEKITKK------LMDVKYLVPHKDEDIVA-----------------------------------TEVSDAIPDHFDARDQ--WPNCM-SINNIRDQSDCGSCWAFAAAEAISDRTCIASNGA----VNTLLSSEDLLSCCTGM-------FSCGNGCEGGYPIQAWKWWVKH------GLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKN--NYAT-PYLQDKHFGSTAYAVG--KKVEQIQTEILTNGPIEVAFTVYED-FYQYTTGVYVHTAGASL---GGHAVKILGWGV--------------------------DNGTP-------YWLVANSWNVAWGEKGYFRIIRG-LNECGIEHSAVA---- Caenorhabditis_elegans_CPR6_P43510 --------------------------------LEAAELDGDDLIDYVNENQNL-WTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQ-HL-------------------------------SKTKDLDLDIPESFDSRDN--WPKCD-SIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGE----LQVTLSADDLLSCCK----------SCGFGCNGGDPLAAWRYWVKD------GIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVS----DYTDKTYSEDKFFGASAYGVK--DDVEAIQKELMTHGPLEIAFEVYED-FLNYDGGVYVHTGGKLG---GGHAVKLIGWGI--------------------------DDGIP-------YWTVANSWNTDWGEDGFFRILRG-VDECGIESGVVG---- Caenorhabditis_elegans_CPR6_Q8MQC6 ----------------------------------VETLTGQALVDYVN-SAQSLFKTEHVEITEEEMKFK------LMDGKYAAAHSDEIRAT---------------------------------EQEVVLASVPATFDSRTQ--WSECK-SIKLIRDQATCGSCWAFGAAEMISDRTCIETKGA----QQPIISPDDLLSCCG---------SSCGNGCEGGYPIQALRWWDSK------GVVTGGDYH-GAGCKPYPIAPCTSG-------NCPESKTPS--CSMSCQS----GYST-AYAKDKHFGVSAYAVP--KNAASIQAEIYANGPVEAAFSVYED-FYKYKSGVYKHTAGKYL---GGHAIKIIGWGT--------------------------ESGSP-------YWLVANSWGVNWGESGFFKIYRG-DDQCGIESAVVA---- Caenorhabditis_elegans_CPZ1_G5EGP8 --------------------------------VRKYSNRNRYNLKGCYKQTGRVFEHKRYDRIYETEDFD-------------------------------------------------------------SEDLPKTWDWRDANGINYAS-ADRNQHIPQYCGSCWAFGATSALADRINIKRKNA---WPQAYLSVQEVIDCSG-----------AGTCVMGGEPGGVYKYAHEH------GIP-------HETCNNYQAR--------------DGKCDPYNRCGSCWP-GE--CFSIKNYTLYKVSEYGTVHGY-----EKMKAEIYHKGPIACGIAATKA-FETYAGGIYKEVTDED----IDHIISVHGWGVDH------------------------ESGVE-------YWIGRNSWGEPWGEHGWFKIVTS-QYKN-AGSKYNL---- Clonorchis_sinensis_AAP33049 VLVTTIWSALARTTQVEPDNARALYEEFTLKYKKTYSND-DDELRFE-IFKDN---LLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYLRMRFDGPIVSED------------------LTPEEDVTMD----NEKFDWRE-H------GAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHL------LALSEQQLVDCD-----------YLDDGCDGGYPPQTYTAIQK--------------MGGLELASDYPYTGVGGI------------------CHMDKS------------KFV--AYVNGSTILPL-SEKVQAQKLRAIGPLSSALNADT--LQLYKGGIMRPK--WCDPAGVNHAVLTVGYG--------------------------VQN-------GKPYWIVKNSWGEDFGEKGYFRIYRGD-GTCGINSIVTTAIIK Clonorchis_sinensis_AAP33050 LVALGFFGVLG-SNIPESENARQLYEEFKLKYKKSYSND-DDEYRFR-VFKDN---LLRIKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDRE------------------PVPFIRMDVD----DDNFDWRN-H------GAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNL------LQLSEQQLLDCD-----------EVDEGCNGGTPQQAFKQILG--------------MGGLQLDSDYPYEGREGQ------------------CRMVPS------------KVK--VYINGSKILPE-DEQIQAQMLKETGPLSSALNALF--LQFYTEGILHPLPALCDAQSLNHAVLTVGYG--------------------------KEG-------RLPYWTVKNSWSTMFGENGYFRIYRGD-GTCGINTLVSTSIIL 'Danio rerio NP_001071036' KVAAVPLTHSKPMKES--VELLTMFKNFMITYNRTYSSQEEAEKRLR-IFQQN---MKTAQTLQSLEQGSAEYGITKFSDLTEDEFRMMYLNPMLSQWSLKKE------------------MKPAIP--ASA-PAPDTWDWRD-------HGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQL------LSLSEQELVDCD-----------KLDQACGGGLPSNAYEAIEN--------------LGGLETETDYSYTGHKQS------------------CDFSTG------------KVA--AYINSSVELPK-DEKEIAAFLAENGPVSAALNAFA--MQFYRKGVSHPLKIFCNPWMIDHAVLLVGFG--------------------------QRN-------GVPFWAIKNSWGEDYGEQGYYYLYRGS-GLCGIHKMCSSAIVN 'Drosophila melanogaster CATB NP_001259536' --------------------------------SGEPSLLSDEFIEVGR-NFDASVTEGHIRRLMGVHPDA-----HKF-ALPD----KREVLG--------------------------------DLYVNSVDELPEEFDSRKQ--WPNCP-TIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGK----VNFHFSADDLVSCCH----------TCGFGCNGGFPGAAWSYWTRK------GIVSGGPYGSNQGCRPYEISPCEHHVNG-TRPPCAHGG-RTPKCSHVCQS----GYTVD-YAKDKHFGSKSYSVR--RNVREIQEEIMTNGPVEGAFTVYED-LILYKDGVYQHEHGKEL---GGHAIRILGWGVWG------------------------EEKIP-------YWLIGNSWNTDWGDHGFFRILRG-QDHCGIESSISA---- Drosophila_melanogaster_CATL_Q95029 ----------------VVMEE---WHTFKLEHRKNYQDETEERFRLK-IFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMNGFNYT---LHKQLRAADES----------FKGVTFISP-AHVTLPKSVDWRTKG-------AVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVL------VSLSEQNLVDCS-------TKYGNN--GCNGGLMDNAFRYI-----------KDNG---GIDTEKSYPYEA---------------IDDS---CHFN--------------KGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDA--QNLDHGVLVVGFG-------------------------TDESGED-------YWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSY---- Echinococcus_multilocularis_CATL_E9RH13 --------------------------------VALHEPLSSAIIDYVN-RINTTWKAEPSRRFTSPSQVR-----QQLGALPDPM--GRRLPVLY------------------------------SLS-ENYKSLPASFDPRKK--WPNCK-TLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVSGRAVMVRLSADDLLSCCR----------DCGMGCNGGFPSQAWNFWKHE------GLVSGGLYGTKGVCRAYEIPPCEHHVNG-TRPPCEGDA-PTPKCKNVCQE----EYKVP-YKKDKHYAVKVYSVH--SNEDAIKHELITHGPVEADFEVYAD-FPTYKSGVYQHVSGALL---GGHAIKLMGWGE--------------------------EDGVP-------YWLCANSWNTDWGEGGFFKILRG-KNHCGIESDIVA---- Echinococcus_multilocularis_CATL_Q0WYD8 ----------------FLQSI---WRGWKVANNKTYATLREEHLRMR-IFINNYLFVRWHNERYYLGLETYSTALNAFADLTLEEFAEKYLTLKQTP--MEGIWQDMS---------------TQYVERPTRMLVPDSIDWRKKG-------LVTPIKDQGDCGSCWAFSATGALEGQLKRKTGKL------ISLSEQQLVDCS-------TYTGNE--GCNGGDMNDAFRYW-----------MRN----GAESESDYPYTA---------------MDGK---CKFN--------------SSKVVTKVSKFVKVPKKREDQLKLSVAQVGPVSVAIDATSSGFMLYKKGIYQDNTCSQ--QYLDHAVLVVGYD--------------------------ADKTRQK------YWIVKNSWGEDWGQRGYIWMARDKGNMCGIATMASY---- Fasciola_gigantica_AAF44675 ----------------SNDDL---WHQWKRMYNKEYNGA-DDEHRRN-IWEENVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLT-EMPR--ASDILS-----HG-----------IPYEANNRA--VPDKIDWRESG-------YVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTS------ISFSEQQLVDCSG-------PWGNY--GCMGGLMENAYEYL-----------KQF----GLETESSYPYTA---------------VEGQ---CRYN--------------RQLGVAKVTDYYTVHSGSEVELKNLVGAEGPAAVAVDVESD-FTMYSGGIYQSRTCSS--LRVNHAVLAVGYG--------------------------TQGG-TD------YWIVKNSWGSSWGERGYIRMVRNRGNMCGIASLASLPMVA Fasciola_gigantica_AAF44676 ----------------SNDDL---WHQWKRIYNKEYNGA-DDDHRRN-IWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLT-EMPH--RSDILS-----HG-----------IPYEANKRA--VPASIDWRESG-------YVTEVKDQGQCGSCWAFSTTGAMEGQYMKNQRTS------ISFSEQQLVDCSD-------DFGNF--GCNGGLMENACEYL-----------KRF----GLETESSYPYRA---------------VEGP---CRYN--------------KQLGVAKVTGYYMVHSGDEVELQNLVGIEGPAAVALDVDSD-FMMYRSGIYQSQTCSP--EFLNHGVLAVGYG--------------------------TQSG-TD------YWIVKNSWGPWWGENGYIRMVRNRGNMCGIASLASVPMVA Fasciola_hepatica_CATLL_Q24940 ------------------DDL---WHQWKRMYNKEYNGA-DDQHRRN-IWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLT-EMSR--ASDILS-----HG-----------VPYEANNRA--VPDKIDWRESG-------YVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTS------ISFSEQQLVDCS-------GPWGNN--GCSGGLMENAYQYL-----------KQF----GLETESSYPYTA---------------VEGQ---CRYN--------------KQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVES-DFMMYRSGIYQSQTCSP--LRVNHAVLAVGYG--------------------------TQGG-TD------YWIVKNSWGTYWGERGYIRMARNRGNMCGIASLASL---- Homo_sapiens_AAD26616 FSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLS-VFVNN---MVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRK-EPGNK------------------MKQAKS--VGD-LAPPEWDWRS-K------GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTL------LSLSEQELLDCD-----------KMDKACMGGLPSNAYSAIKN--------------LGGLETEDDYSYQGHMQS------------------CNFSAE------------KAK--VYINDSVELSQ-NEQKLAAWLAKRGPISVAINAFG--MQFYRHGISRPLRPLCSPWLIDHAVLLVGYG--------------------------NRS-------DVPFWAIKNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD Homo_sapiens_CATB_P07858 --------------------------------RPSFHPLSDELVNYVN-KRNTTWQAGHNFYNVDMSYLK-----RLCGTFLG----GPKPPQR--------------------------------VMFTEDLKLPASFDAREQ--WPQCP-TIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAH----VSVEVSAEDLLTCCGS---------MCGDGCNGGYPAEAWNFWTRK------GLVSGGLYESHVGCRPYSIPPCEHHVNG-SRPPCTGEG-DTPKCSKICEP----GYSPT-YKQDKHYGYNSYSVS--NSEKDIMAEIYKNGPVEGAFSVYSD-FLLYKSGVYQHVTGEMM---GGHAIRILGWGV--------------------------ENGTP-------YWLVANSWNTDWGDNGFFKILRG-QDHCGIESEVVA---- Homo_sapiens_CATFF_Q9UBX1 -------------------KMASIFKNFVITYNRTYESKEEARWRLS-VFVNN---MVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRK-EPGNKMK------------------QAKS--VGD-LAPPEWDWRS-K------GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTL------LSLSEQELLDCD-----------KMDKACMGGLPSNAYSAI--------------KNLGGLETEDDYSYQGHM------------------QSCNFSAE------------KAK--VYINDSVELSQ-NEQKLAAWLAKRGPISVAINAFG--MQFYRHGISRPLRPLCSPWLIDHAVLLVGYG--------------------------NRS-------DVPFWAIKNSWGTDWGEKGYYYLHRG-SGACGVNTMASS---- Homo_sapiens_CATH_P09668 -----------------LEKFH--FKSWMSKHRKTYSTE-EYHHRLQ-TFASNWRKINAHNNGN----HTFKMALNQFSDMSFAEIKHKYLWSEP----QNCSATK-----------------SNY--LRGTGPYPPSVDWRKKG------NFVSPVKNQGACGSCWTFSTTGALESAIAIATGKM------LSLAEQQLVDCA-------QDFNNH--GCQGGLPSQAFEYI-----------LYNK---GIMGEDTYPYQG---------------KDGY---CKFQ--------------PGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG--------------------------EKNGIP-------YWIVKNSWGPQWGMNGYFLIERG-KNMCGLAACASY---- Homo_sapiens_CATK_P43235 ----------------ILDTH---WELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPL--SHSRSND------------------TLYIPEWEGRAPDSVDYRKKG-------YVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKL------LNLSPQNLVDCV------SENDG-----CGGGYMTNAFQYV-----------QKNR---GIDSEDAYPYVG---------------QEES---CMYN--------------PTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSD--NLNHAVLAVGYG--------------------------IQKGNK-------HWIIKNSWGENWGNKGYILMARNKNNACGIANLASF---- Homo_sapiens_CATL_P07711 ----------------SLEAQ---WTKWKAMHNRLYGMN-EEGWRRA-VWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNR----KPRKGK------------------VFQE-PLFYEAPRSVDWREKG-------YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRL------ISLSEQNLVDCS-------GPQGN--EGCNGGLMDYAFQYV-----------QDNG---GLDSEESYPYEA---------------TEES---CKYN--------------PKYSVANDTGF-VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS--EDMDHGVLVVGYGF----------------------ESTESDNNK-------YWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASY---- Homo_sapiens_CATO_P43234 --------------------RAPFTPTWPRSREREAAAFRESLNRHR-----------YLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAE---------------------VHMSIPNVSLPLRFDWRDKQ-------VVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPL------EDLSVQQVIDCS-----------YNNYGCNGGSTLNALNWLN-------------KMQVKLVKDSEYPFKA------------------QNGLCHYFSG------------SHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVS--WQDYLGGIIQHHCSSG---EANHAVLITGFDK---------------------TGSTP------------YWIVRNSWGSSWGVDGYAHVKMG-SNVCGIADSVSS---- Homo_sapiens_CATS_P25774 ----------------TLDHH---WHLWKKTYGKQYKEKNEEAVRRL-IWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP----SQWQRN------------------ITYKSNPNRILPDSVDWREKG-------CVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKL------VSLSAQNLVDCS------TEKYGN--KGCNGGFMTTAFQYI-----------IDNK---GIDSDASYPYKA---------------MDQK---CQYD--------------SKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQ---NVNHGVLVVGYG--------------------------DLNGKE-------YWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY---- Homo_sapiens_CATV_O60911 ----------------NLDTK---WYQWKATHRRLYGAN-EEGWRRA-VWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQ----KFRKGK------------------VFRE-PLFLDLPKSVDWRKKG-------YVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKL------VSLSEQNLVDCS-------RPQGN--QGCNGGFMARAFQYV-----------KENG---GLDSEESYPYVA---------------VDEI---CKYR--------------PENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSS--KNLDHGVLVVGYGF----------------------EGANSNNSK-------YWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY---- Homo_sapiens_CATW_P56202 -------------------ELKEAFKLFQIQFNRSYLSPEEHAHRLD-IFAHN---LAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYGYRRAAGGVPSMGR--------------------EIRSEEPEESVPFSCDWRKVA------GAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDF------VDVSVHELLDCG-----------RCGDGCHGGFVWDAFITV--------------LNNSGLASEKDYPFQGKV------------------RAHRCHPK------------KYQKVAWIQDFIMLQN-NEHRIAQYLATYGPITVTINMKP--LQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSV------KSEEGIWA------ETVSSQSQ-PQPPHPTPYWILKNSWGAQWGEKGYFRLHRG-SNTCGITKFPLT---- Homo_sapiens_CATZ_Q9UBR2 ----------------------------------LYFRRGQTCYRPLRGDGLAPLGRSTYPRP--HEYLS-------------------------------------------------------------PADLPKSWDWRNVDGVNYAS-ITRNQHIPQYCGSCWAHASTSAMADRINIKRKGA---WPSTLLSVQNVIDCGN-----------AGSCEGGNDLS-VWDYAHQH------GIP-------DETCNNYQAK--------------DQECDKFNQCGTCNEFKE--CHAIRNYTLWRVGDYGSLSGR-----EKMMAEIYANGPISCGIMATER-LANYTGGIYAEYQDTTY---INHVVSVAGWGIS--------------------------DGTE-------YWIVRNSWGEPWGERGWLRIVTS-TYKDGKGARYNL---- Mus_musculus_CATB_P10605 --------------------------------KPSFHPLSDDLINYIN-KQNTTWQAGRNFYNVDISYLK-----KLCGTVLG----GPKLPGR--------------------------------VAFGEDIDLPETFDAREQ--WSNCP-TIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGR----VNVEVSAEDLLTCCGI---------QCGDGCNGGYPSGAWSFWTKK------GLVSGGVYNSHVGCLPYTIPPCEHHVNG-SRPPCTGEG-DTPRCNKSCEA----GYSPS-YKEDKHFGYTSYSVS--NSVKEIMAEIYKNGPVEGAFTVFSD-FLTYKSGVYKHEAGDMM---GGHAIRILGWGV--------------------------ENGVP-------YWLAANSWNLDWGDNGFFKILRG-ENHCGIESEIVA---- Mus_musculus_CATF_Q9R013 -------------------KMAPLFKDFMTTYNRTYESREEAQWRLT-VFARN---MIRAQKIQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQK-ESGRKMS------------------PAKS--IND-LAPPEWDWRK-K------GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTL------LSLSEQELLDCD-----------KVDKACLGGLPSNAYAAI--------------KNLGGLETEDDYGYQGHV------------------QTCNFSAQ------------MAK--VYINDSVELSR-NENKIAAWLAQKGPISVAINAFG--MQFYRHGIAHPFRPLCSPWFIDHAVLLVGYG--------------------------NRS-------NIPYWAIKNSWGSDWGEEGYYYLYRG-SGACGVNTMASS---- Mus_musculus_CATH_P49935 -----------------IEKFH--FKSWMKQHQKTYSSV-EYNHRLQ-MFANNWRKIQAHNQRN----HTFKMALNQFSDMSFAEIKHKFLWSEP----QNCSATK-----------------SNY--LRGTGPYPSSMDWRKKG------NVVSPVKNQGACASCWTFSTTGALESAVAIASGKM------LSLAEQQLVDCA-------QAFNNH--GCKGGLPSQAFEYI-----------LYNK---GIMEEDSYPYIG---------------KDSS---CRFN--------------PQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEV-TEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYG--------------------------EQNGLL-------YWIVKNSWGSQWGENGYFLIERG-KNMCGLAACASY---- Mus_musculus_CATK_P55097 ----------------MLDTQ---WELWKKTHQKQYNSKVDEISRRL-IWEKNLKQISAHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIPP--SRSYSND------------------TLYTPEWEGRVPDSIDYRKKG-------YVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKL------LALSPQNLVDCV------TENYG-----CGGGYMTTAFQYV-----------QQNG---GIDSEDAYPYVG---------------QDES---CMYN--------------ATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRD--NVNHAVLVVGYG--------------------------TQKGSK-------HWIIKNSWGESWGNKGYALLARNKNNACGITNMASF---- Mus_musculus_CATL_P06797 ----------------TFSAE---WHQWKSTHRRLYGTN-EEEWRRA-IWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQ----KHKKGR------------------LFQE-PLMLKIPKSVDWREKG-------CVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKL------ISLSEQNLVDCS-------HAQGN--QGCNGGLMDFAFQYI-----------KENG---GLDSEESYPYEA---------------KDGS---CKYR--------------AEFAVANDTGF-VDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSS--KNLDHGVLLVGYGY----------------------EGTDSNKNK-------YWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASY---- Mus_musculus_CATM_Q9JL96 ----------------ILDVE---WQKWKIKYGKAYSLE-EEGQKRA-VWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVMIEIPVP----TVKKGK------------------SVQK-RLSVNLPKFINWKKRG-------YVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQL------IPLSVQNLVDCS-------RPQGN--WGCYLGNTYLALHYV-----------MENG---GLESEATYPYEE---------------KDGS---CRYS--------------PENSTANITGF-EFVPKNEDALMNAVASIGPISVAIDARHASFLFYKRGIYYEPNCSS--CVVTHSMLLVGYGF----------------------TGRESDGRK-------YWLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALY---- Mus_musculus_CATO_Q8BM88 --------------------RHGVAGTWSWSHQREAAALRESLHRHR-----------YLNS-FPHENSTAFYGVNQFSYLFPEEFKALYLGSKYAWAPRYPAE---------------------GQRPIPNVSLPLRFDWRDKH-------VVNPVRNQEMCGGCWAFSVVSAIESARAIQGKSL------DYLSVQQVIDCS-----------FNNSGCLGGSPLCALRWLN-------------ETQLKLVADSQYPFKA------------------VNGQCRHFPQ------------SQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMS--WQDYLGGIIQHHCSSG---EANHAVLITGFDR---------------------TGNTP------------YWMVRNSWGSSWGVEGYAHVKMG-GNVCGIADSVAA---- Mus_musculus_CATP_Q9R014 ----------------KLDAE---WKDWKTKYAKSYSPK-EEALRRA-VWEENMRMIKLHNKENSLGKNNFTMKMNKFGDQTSEEFRKSIDNIPIP----AAMTDP------------------HAQN-HVSIGLPDYKDWREEG-------YVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNL------TPLSVQNLLDCS-------KTVGN--KGCQSGTAHQAFEYV-----------LKNK---GLEAEATYPYEG---------------KDGP---CRYR--------------SENASANITDY-VNLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYYEPNCSS--YFVNHAVLVVGYGS----------------------EGDVKDGNN-------YWLIKNSWGEEWGMNGYMQIAKDHNNHCGIASLASY---- Mus_musculus_CATR_Q9JIA9 ----------------SLDAE---WQDWKIKYNKSYSLK-EEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVW----THREGK------------------SIMKREAGSILPKFVDWRKKG-------YVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKL------TPLSVQNLVDCS-------KPQGN--NGCLGGDTYNAFQYV-----------LHNG---GLESEATYPYEG---------------KDGP---CRYN--------------PKNSKAEITGF-VSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNCSS--DTVTHGVLVVGYGF----------------------KGIETDGNH-------YWLIKNSWGKRWGIRGYMKLAKDKNNHCGIASYAHY---- 'Mus musculus CATS NP_001254624' ----------------TLDYH---WDLWKKTHEKEYKDKNEEEVRRL-IWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCRMGALRIP----RQSPKT------------------VTFRSYSNRTLPDTVDWREKG-------CVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKL------ISLSAQNLVDCSN-----EEKYGN--KGCGGGYMTEAFQYI-----------IDNG---GIEADASYPYKA---------------TDEK---CHYN--------------SKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYDDPSCTG---NVNHGVLVVGYG--------------------------TLDGKD-------YWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCSY---- Mus_musculus_CATW_P56203 -------------------ELKEVFKLFQIRFNRSYWNPAEYTRRLS-IFAHN---LAQAQRLQQEDLGTAEFGETPFSDLTEEEFGQLYGQERSPERTPNMTK--------------------KVESNTWGESVPRTCDWRKAK------NIISSVKNQGSCKCCWAMAAADNIQALWRIKHQQF------VDVSVQELLDCE-----------RCGNGCNGGFVWDAYLTV--------------LNNSGLASEKDYPFQGDR------------------KPHRCLAK------------KYKKVAWIQDFTMLSN-NEQAIAHYLAVHGPITVTINMKL--LQHYQKGVIKATPSSCDPRQVDHSVLLVGFG--------KKKEGMQT------GTVLSHSR-KRR-HSSPYWILKNSWGAHWGEKGYFRLYRG-NNTCGVTKYPFT---- Mus_musculus_CATZ_Q9WUU7 ----------------------------------LYFRSGQTCYHPIRGDQLALLGRRTYPRP--HEYLS-------------------------------------------------------------PADLPKNWDWRNVNGVNYAS-VTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGA---WPSILLSVQNVIDCGN-----------AGSCEGGNDLP-VWEYAHKH------GIP-------DETCNNYQAK--------------DQDCDKFNQCGTCTEFKE--CHTIQNYTLWRVGDYGSLSGR-----EKMMAEIYANGPISCGIMATEM-MSNYTGGIYAEHQDQAV---INHIISVAGWGVSN-------------------------DGIE-------YWIVRNSWGEPWGEKGWMRIVTS-TYKGGTGDSYNL---- Mus_musculus_CTS1_Q9JI84 ----------------NLDAE---WEEWKRSNDRTYSPE-EEKQRRA-VWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEEM-KMLTESSSY----PLRNGK------------------HIQK--RNPKIPPTLDWRKEG-------YVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKL------IPLSVQNLMDCS-------VSYGT--KGCDGGRPYDAFQYV-----------KNNG---GLEAEATYPYEA---------------KAKH---CRYR--------------PERSVVKVNRF-FVVPRNEEALLQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRK--DTLDHGLLLVGYGY----------------------EGHESENRK-------YWLLKNSHGERWGENGYMKLPRGQNNYCGIASYAMY---- Mus_musculus_CTS2_Q9JI81 ----------------SLDSE---WQEWKRKFNKNYSME-EEGQKRA-VWEENMKLVKQHNIEYDQGKKNFTMDVNAFGDMTGEEYRKMLTDIPVP----NFRKKK------------------SIHQ-PIAGYLPKFVDWRKRG-------CVTPVKNQGTCNSCWAFSAAGAIEGQMFRKTGKL------VPLSTQNLVDCS-------RLEGN--FGCFKGSTFLALKYV-----------WKNR---GLEAESTYPYKG---------------TDGH---CRYH--------------PERSAARITSF-SFVSNSEKDLMRAVATIGPISVGIDARHKSFRLYREGIYYEPKCSS--NIINHSVLVVGYGY----------------------EGKESDGNK-------YWLIKNSHGEQWGMNGYMKLARGRNNHCGIASYAVY---- Mus_musculus_CTS3_Q9DAZ8 ----------------ILDAE---WQKWKIKYGKTYSLE-EEGQKRA-VWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEFRKEMIEIPVP----TVKKGK------------------SVQK-RLSVNLPKFINWKKRG-------YVTPARTQIACNSCWAISVTGAIEGQMFRKTGQL------IPLSVQNLVDCV---------DGS---GCHAGSVLDSFKYL-----------MEKG---GLESEATYPYED---------------KQGS---CRYN--------------PENSTASITGF-EFIPNNEVDLMSAVASLGPISVVIDAWHESFLFYKRGIYYEPNCNNSLFALRHAVLLVGYGF----------------------IGRESEGRK-------YWIIKNSLGTKWGYKGYMKIAKDQGNHCGIASLPVF---- Mus_musculus_CTS6_Q9ET52 ----------------NLNAE---WHDWKKQYEKSYTME-EEGLRRA-IWEENMRMIKLHNWENSLGKNNFTLKMNEFGDLTPEELRKMMNNFPIW----SHKKRK------------------IIRKRAVGDVLPKFVDWRKKG-------YVTRVRRQKFCNSCWAFAVNGAIEGQMFKKTGKL------TPLSVQNLVDCT-------KTQGN--DGCQWGDPYIAYEYV-----------LNNG---GLEAEATYPYEG---------------KEGP---CRYN--------------PKNSKAEITGF-VSLPESEDILMEAVATIGPISAAVDASFNRFSFYDGGIYHQPNCSN--NTVNHAVLVVGYGT----------------------EGNETDGNK-------YWLIKNSWGRRWGIGGYMKIIRDQNNHCGIATYAHY---- Paragonimus_westermani_AAF21461 IELVSLPSNIELLGFRLPQNTSRLFEEFQRKFRKSYSS--DTAKRYA-LFKYN---LLKMQLIQRLEKGTANYGITKFSDLSAEEFRHSLANMKRRKSKGSQM---------------ETAIFP---TTIQS--LPPSFDWRA-N------GAVTEVKDQGMCGSCWAFATTGNIEGQWFRKTNKL------ISLSEQQLLDCD-----------TKDEACNGGLPEWAYDEIVK--------------MGGLMSEKDYPYEAMKEQS-----------------CHLRRP------------NIS--AYINGSATLPS-DEAKLAAWLVQNGPISVGVNANF--LQFYLGGISHPPHMLCSEAGLDHAVLLVGYG--------------------------VSTF-----LRRPYWIVKNSWGGGWGEKGYFRMYRGD-GTCGINADPTTSIIQ Paragonimus_westermani_AAW28151 CFVLIVSCAVAV-----PDSARELYEQFKRDYGKVYANE-DDQKRFA-IFKDN---LMRAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVNNDQVKR--------------------VRPTGLKAA----PERIDWRA-K------GAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQL------VSLSKQQLVDCD-----------RAAQGCNGGWPASSYLEIMY--------------MGGLESESDYPYVGVEQT------------------CALNKE------------KLV--AKIDDSIVLGP-EEEDHAAYLAEHGPLSTLLNAVA--LQYYQSGVLKPTFEECPDTELNHAVLTVGYD--------------------------KEG-------DMPYWIIKNSWGTDWGEKGYFRLFRGD-CTCGINRMATSAIIK Paragonimus_westermani_AAY81946 CLVVVVGCSFAVNTVRVPDNARELYEQFKRDYGKAYANE-DDQKRFA-IFKDN---LVRAQQYQMQEQGTAKYGVTQFSDLTPEEFEAKYLGLRID-EQVDR--------------------VQLNDLQTA----PASVDWRE-K------GAVGPIENQGSCGSCWAFSVVGNIEGQWFLKTGYL------VSLSKQQLVDCD-----------TVDNGCYGGYPPYTYKEIKR--------------MGGLELQSDYPYTGWGHG------------------CRLDRS------------KLF--AKIDDSIVLEA-DEEKQAAWLAEHGPMSTCLNAKY--LQFYQSGILHPSKAMCSPEGLNHAVLTVGYD--------------------------TKH-------GIPYWIIKNSWGTSWGEDGYFRIYRGD-GTCGIDRLTTSAIIR Schistosoma_japonicum_AAW25775 PSIPRMPQNLEYLGFELPENVGEMYAQFKLTYRKQYHET-DNEKRFS-IFKSN---LLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNT------------------ISP--RREVGD--IPNNFDWRE-K------GAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKL------LSLSEQQLVDCD-----------SLDDGCNGGLPSNAYESIIR--------------MGGLMLEDNYPYDAKNEK------------------CHLKVA------------NVA--AYINSSVNLTQ-DESELAIWLYHHSAISVGMNALL--LQFYRHGISHPWWIFCSKYLLDHAVLLVGYG--------------------------VSE------KNEPFWIVKNSWGVEWGEKGYFRMYRGD-GTCGINTDATSALIY Schistosoma_japonicum_CATB_P43157 ---------------------------------QRIEPLSDEMISFINEHPDAGWKADKSDRFHSLDDAR-----ILMGARKEDAEMKRNRRPT-------------------------------VDHHDLNVEIPSQFDSRKK--WPHCK-SISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGG----QSAELSALDLISCCK----------DCGDGCQGGFPGVAWDYWVKR------GIVTGGSKENHTGCQPYPFPKCEHHTKG-KYPACGTKIYKTPQCKQTCQK----GYKTP-YEQDKHYGDESYNVQ--NNEKVIQRDIMMYGPVEAAFDVYED-FLNYKSGIYRHVTGSIV---GGHAIRIIGWGV--------------------------EKRTP-------YWLIANSWNEDWGEKGLFRMVRG-RDECSIESDVVA---- Schistosoma_japonicum_CB_Q7Z1I6 --------------------------------KRMHQPLSKELIHFINYEANTTWKAGPTRRFKTVSDIR-----RMLGALPDPN--GEQLETLC------------------------------TGYELTLNELPKSFDARKE--WTHCP-SISEIRDQSSCGSCWAFGAVEAMSDRICIESKGK----YKPFLSAENLVSCCS----------SCGMGCNGGFPHSAWLYWKNQ------GIVTGDLYNTTNGCQPYEFPPCEHHTLG-PLPVCDGDV-ETPPCKRTCQA----GYNVS-YENDKWYGKVVYRVK--SNQEAIMKELMQHGPVEVDFEVYAD-FPNYKSGVYQHVSGALL---GGHAVRLLGWGE--------------------------ENNVP-------YWLIANSWNTDWGDNGYFKIIRG-KNECGIESDVNA---- Schistosoma_mansoni_CAA83538 ----------------QYDDI---WKQWKLKYNKTYS-DSNEIRRKA-IFMRYVEKIQQHNLRHDLGLEGYTMGLNQFCDMDWEEIKTIMLS-KVFG--NSPLWDD---------------KKEELELSNDP--LPSKWDWRDHG-------AVTPVKNQGLCGSCWAFSAAGAVEGQLVKKHKKL------ISLSEQQLVDCSY-------KYGND--GCQGGTMDQSFAYL-----------EKY----PIESEKDYKYIG---------------HDSS---CHFR--------------KSKGVVKVKKFVDLPARDEEKLQKALYHYGPISVAIDALDD-LILYKSGIYESKQCSS--FLLNHGVLAVGYG--------------------------RENR-KD------YWLIKNSWGTTWGMNGYFKLRRNKHNMCGIATNASFPLL- Schistosoma_mansoni_CB2_Q95PM1 --------------------------------KRMYQPLSMELINFINYEANTTWKAAPTTRFRTVSDIR-----RMLGALPDPN--GEQLETLC------------------------------TGYIS--DELPKSFDARVE--WPHCP-SISEIRDQSSCGSCWAFGAVEAMSDRICIKSKGK----HKPFLSAENLVSCCS----------SCGMGCNGGFPHSAWLYWKNQ------GIVTGDLYNTTNGCQPYEFPPCEHHVIG-PLPSCDGDV-ETPSCKTNCQP----GYNIP-YEKDKWYGEKVYRIH--SNPEAIMLELMRNGPVEVDFEVYAD-FPNYKSGVYQHVSGALL---GGHAVRLLGWGE--------------------------ENNVP-------YWLIANSWNSDWGDKGYFKIVRG-KNECGIESDVNA---- Schistosoma_mansoni_CYSP_P25792 ---------------------------------EKFEPLSDDIISYINEHPNAGWRAEKSNRFHSLDDAR-----IQMGARREEPDLRRKRRPT-------------------------------VDHNDWNVEIPSNFDSRKK--WPGCK-SIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGK----QNVELSAVDLLTCCE----------SCGLGCEGGILGPAWDYWVKE------GIVTASSKENHTGCEPYPFPKCEHHTKG-KYPPCGSKIYNTPRCKQTCQR----KYKTP-YTQDKHRGKSSYNVK--NDEKAIQKEIMKYGPVEASFTVYED-FLNYKSGIYKHITGEAL---GGHAIRIIGWGV--------------------------ENKTP-------YWLIANSWNEDWGENGYFRIVRG-RDECSIESEVIA---- Taenia_asiatica_CATL_B7XBA1 ----------------ELSRQ---WIGWKLQHGRVYSEK-EEAYRRG-IFARNLLYIKGQNRRFNAGLESYSTGLNQFADLESSEFSERFLG-TRP---ESRAAGKRG---------------RIWKALASAADLPDTVDWRDKN-------LVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKL------ISLSEQQLVDCS-------LKNGND--GCNGGYMSYAFKYL-----------EEH----SIEPESAYPYRA---------------TDGP---CRYN--------------ESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSS--KFLNHGVLAIGYG--------------------------KQEG-KP------YWLVKNSWGTRWGMKGYIMMAKDYHNMCGVASLADF---- Taenia_pisiformis_CESTL_F6MEN8 ----------------ELDQQ---WGGWKLQHGRTYSGK-EEAHRRS-VFARNLLYIKGQNRRFEAGLESYSTGLNQFADLELSEFTERFLG-TRP---ENRVAGKCG---------------RVWKALKSFADLPDTVDWRDKN-------LVTEVKNQGNCGSCWAFSSTGALEAALAKKTGKL------ISLSEQQLVDCS-------LKNGND--GCNGGYMSNAFKYL-----------EDH----SIEPESAYPYRA---------------TDGP---CRYN--------------ESLGVGTVTDIGEIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSS--KFLNHGVLAVGYG--------------------------KLDG-KP------YWLVKNSWGSGWGMKGYIMMAKDYHNMCGIASLADF---- Taenia_saginata_CATL_B7XBA0 ----------------ELSRQ---WIGWKLQHGRVYSEK-EEAYRRG-IFARNLLYIKGQNRRFNAGLESYSTGLNQFADLESSEFSERFLG-TRP---GSRAAGKRG---------------RIWKALASAADLPDTVDWRDKN-------LVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKL------ISLSEQQLVDCS-------LKNGND--GCNGGYMSYAFKYL-----------EEH----SIEPESAYPYRA---------------TDGP---CRYN--------------ESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSS--KFLNHGVLAIGYG--------------------------KQDG-KP------YWLVKNSWGTRWGMKGYIMMAKDYHNMCGVASLADF---- Taenia_solium_AAS00027 -----------LLTERELSRQ---WAGWKLQHGRVYSGK-EEAYRRG-VFARNLLYIKGQNRRFNAGLESYSTGLNQFADLESSEFSERFLG-TRP---ESRVAGR---------------RGRIWKALASAAGLPDTVDWRDKN-------LVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKL------ISLSEQQLVDCSL-------KNGND--GCNGGYMSYAFKYL-----------EEH----FIEPESAYPYRA---------------TDGP---CRYN--------------ESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSS--KFLNHGVLAIGYG--------------------------KQDG-KP------YWLVKNSWGTRWGMKGYIMMAKDYHNMCGVASLADFPYV- Taenia_solium_LongOrf.asmbl_1043 --------------------------------VALHEPLSSAIIDYVN-HINTTWRAEPSRRFTSPSQIR-----QQLGALPDPM--GRRLPLLY------------------------------SLSEENYKSLPASFDPRKK--WPNCK-TLFEIRDQGSCGSCWAFGAAEAMSDRLCIQQQTVNGRAEMVQLSADDLLSCCR----------DCGMGCNGGFPSQAWNFWKHE------GLVSGGLYGTKGVCRAYEIPPCEHHVNG-TRPPCEGDA-PTPKCKTVCQE----EYKIP-YKKDKHYASKVYSLH--SNEDAIKHELLTFGPVEADFEVYAD-FPTYKSGVYQHVSGALL---GGHAVKLMGWGE--------------------------EEGVP-------YWLCANSWNTDWGEGGFFKILRG-KNHCGIESDIVA---- Taenia_solium_LongOrf.asmbl_24242 ----------------------------------------------------------------------------------------------------------------------------------------------------------------------WAFSATGALEGQYKRKKGKL------ISLSEQQLVDCS-------RSEGNE--GCNGGWMDYAFQYW-----------MRN----GVESEKDYPYTA---------------RDGS---CKFN--------------PSKVITKVAKSVNVSEKSEEQLKISVAKVGPISVAIDASSQGFMFYKNGIFEDPSCSE--DDLDHGVLAVGYD--------------------------ADKARRN------YWIVKNSWGKQWGQEGYIWMARDKGNMCGIATMARY---- Taenia_solium_LongOrf.asmbl_24428 ----------------ELSRQ---WAGWKLQHGRVYSGK-EEAYRRG-IFARNLLYIKGQNRRFNAGLESYSTGLNQFADLESSEFSERFLG-TRP---ESRVAGRRG---------------RIWKALASAAGLPDTVDWRDKN-------LVTEVKNQGNCGSCWAFSSTGALEGAFAKKTGKL------ISLSEQQLVDCS-------LKNGND--GCNGGYMSYA---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Taenia_solium_LongOrf.asmbl_6319 ----------------NEAMT---LKHLWQMFLKEYNHT-PSFLSFR-NFESNVIKMWRHLMVS----SSFSVGINKFSALSPSQYR-KYLGYRPSV--EKLNLTIPR---------------HIYRKLARYAPLPKFVDWRLKG-------RVTGVKDQGHCGSCWAFSAIGAIEGQYQRASGNL------ISLSEQQLVDCS-------SSFGN--MGCNGGLMDNAFNYV-----------KKAG---GVNREEDYPYVSGVTE-----------QPDN---CSFK--------------ANISVAKVTGLVDITSGSDEELMEALAFNGPISVAINAGLTSFMMYHEGIYDDPECKGHLDDLNHGVLLAGYG--------------------------EQNGIP-------YWLIKNSWGTNWGEHGYVRIKRA-GNICGVATAASY---- Taenia_solium_Scaffold00002.gene342 -----------------------------SSKTPYQSDCFVDIIEYVNNKANTTWRAGENERFVDALTAK-----SQMGSLFNPI--GSTLPTKSF-----------------------------HLSSMQKAELPSEFDARIA--WPDCP-TIGEIRDQGTCGSCWAFGATEAMSDRICIHSEGK----EVVRISADDILSCCGF---------FCGFGCNGGLPESAWRYWARE------GIVSGGPYGSHVGCRPYEIPPCEHHTKG-ERPDCKGNS-RTPKCRRQCVE----SYDVE-YLTDKHFASNVYNVR--ASEEDIMKEIMVHGPVESDFIVYAD-FLTYKSGVYQHVKGGFL---GGHAVKILGWGE--------------------------ENGVP-------YWLCANSWNTDWGDGGFFKILRG-HNHCNIEADAQE---- Taenia_solium_Scaffold00009.gene1353 -------------------STDEEWLKWTREINISFESEAEAKYRHS-VWRKHYDMIQAHNGRKD---SLYTMGTNHFTHLEHWEFVEMYLRSKKIEIDNYGVNDYFEG----------NKTVRVNESIIEGDCYLENFDWTERF-------PNRKVRDQLSCGSCWAFASVSAVEWHWAIHYGKS------LSLSVQQLVDCV-----------QSNFGCNGGIIENALAYI-----------QRH----GIMLERDYPYRS------------------RVTQCAEN--------------PAKVTLKIKGYETLYGVSEAVLACIVQRVGPVVIGFDASGSGLQHYKSGIYDGTDCNG--DMLNHGLVLLGFG-------------------------TDEQGNR-------YWICQNSFSQRSATPIIPMV-------------------- Taenia_solium_Scaffold00115.gene6434 ----------------FLQSI---WRGWKITNNKAYPTLREERLRMR-IFISNYRFIRWHNQRYYLGLETYSTALNAFADLTLKEFAEKYLTLDQAP--IEVFWGDMS---------------TQYVEQPIHPHVPNYIDWRKKG-------LVTPIKDQGPCGSCWAFSATGALEGQYKKKKGEL------TSLSEQQLIDCS-------RSEGND--GCNGGYMDYAFQYW-----------MHN----GAESERDYPYTA---------------KDGT---CKFN--------------SSNVITNVAKFVKVPEKSEEQLKISVAKVGPISVGIDASSQGFMFYNDGIFQDPTCSE--DVLDHGVLVVGYN--------------------------ADKTRQK------YWIVKNSWGEQWGQEGYIWMARDKENMCGVATMASY---- Taenia_solium_Scaffold00212.gene8293 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSYAFKYL-----------EEH----FIEPESAYPYRA---------------TDGP---CRYN--------------ESLGVGTVTDIGDIPEGNETALMEAVATVGPISIAIDASSLGFMFYRHGIYKSHWCSS--KFLNHGVLAIGYG--------------------------KQDG-KP------YWLVKNSWGTRSTSEECINPLTWVAAVVLGGSVISL---- Trichobilharzia_regenti_CB2 --------------------------------KFMHQPLSSELIHFINHEANTTWKAAPSPRFKSVSDIR-----RMLGALPDPN--GGHLPTLC------------------------------TGYTPSLDELPKEFDARKY--WPHCP-SISEIRDQSSCGSCWAFGAVEAMSDRICIESKGL----HKPFLSAENLVACCS----------SCGMGCNGGFPHSAWSYWKRS------GIVTGDLYNPTDGCQPYEFPPCEHHVVG-PRPSCEGDV-ETPKCKTTCQP----GYNIP-YNKDKWYGKTVYRVH--SNQEAIMKEVKEHGPVEVDFEVYAD-FPNYKSGVYQHVSGGLL---GGHAVRLLGWGE--------------------------ENGVP-------YWLIANSWNSDWGDNGYFKIIRG-RNECGIESDVNA---- Trypanosoma_brucei_AAX80359 LAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFR-AFEEN---MEQAK-IQAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKR--------------------LRKTVNVTTGRAPAAVDWRE-K------GAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPL------VSLSEQMLVSCD-----------TIDSGCNGGLMDNAFNWIVNS------------NGGNVFTEASYPYVSGNGEQ---------------PQCQMNGH------------EIG--AAITDHVDLPQ-DEDAIAAYLAENGPLAIAVDATS--FMDYNGGILTS----CTSEQLDHGVLLVGYN--------------------------DNS-------NPPYWIIKNSWSNMWGEDGYIRIEKGT-NQCLMNQAVSSAVVG ; END; [ The following blocks are output data for analysis step 13557 ] BEGIN TREES; TITLE Regulatory_Proteases_c1_v1; LINK TAXA = TaxaForAnalysisStep13557; TREE Fig._2 = [&R] (Homo_sapiens_CATO_P43234:0.103417,Mus_musculus_CATO_Q8BM88:0.181958,((((Homo_sapiens_CATH_P09668:0.109417,Mus_musculus_CATH_P49935:0.059967):0.447071,((((Taenia_solium_LongOrf.asmbl_24242:0.058023,(Taenia_solium_Scaffold00115.gene6434:0.10014,Echinococcus_multilocularis_CATL_Q0WYD8:0.156337):0.060882):0.354041,(((Taenia_saginata_CATL_B7XBA0:0.006763,Taenia_asiatica_CATL_B7XBA1:0.006713):0.013786,(Taenia_solium_LongOrf.asmbl_24428:0.006137,Taenia_solium_AAS00027:0.005667,Taenia_solium_Scaffold00212.gene8293:0.191025):0.012014):0.039369,Taenia_pisiformis_CESTL_F6MEN8:0.052702):0.377739):0.095845,(((((Homo_sapiens_CATK_P43235:0.057895,Mus_musculus_CATK_P55097:0.089565):0.273826,(Homo_sapiens_CATS_P25774:0.143359,'Mus musculus CATS NP_001254624':0.142824):0.206214):0.18613,((Homo_sapiens_CATL_P07711:0.116783,(Homo_sapiens_CATV_O60911:0.107798,Mus_musculus_CATL_P06797:0.168936):0.048544):0.100134,(((Mus_musculus_CATM_Q9JL96:0.077616,Mus_musculus_CTS3_Q9DAZ8:0.150496):0.175785,(Mus_musculus_CTS2_Q9JI81:0.23348,Mus_musculus_CTS1_Q9JI84:0.387774):0.063832):0.048303,((Mus_musculus_CATR_Q9JIA9:0.146193,Mus_musculus_CTS6_Q9ET52:0.19282):0.071087,Mus_musculus_CATP_Q9R014:0.284636):0.069816):0.193497):0.104807):0.084021,(Drosophila_melanogaster_CATL_Q95029:0.18739,Aedes_aegypti_EAT45919:0.149593):0.275928):0.114772,Taenia_solium_LongOrf.asmbl_6319:0.644476):0.093072,Taenia_solium_Scaffold00009.gene1353:1.024248):0.067402,(((Fasciola_hepatica_CATLL_Q24940:0.029422,Fasciola_gigantica_AAF44675:0.0398):0.07197,Fasciola_gigantica_AAF44676:0.056769):0.366284,Schistosoma_mansoni_CAA83538:0.434128):0.088711):0.089234):0.119312,((((((((Homo_sapiens_CATFF_Q9UBX1:0.003141,Homo_sapiens_AAD26616:0.00295):0.080679,Mus_musculus_CATF_Q9R013:0.098726):0.182436,'Danio rerio NP_001071036':0.220281):0.24396,(Schistosoma_japonicum_AAW25775:0.2586,Paragonimus_westermani_AAF21461:0.343056):0.158245):0.088606,(Caenorhabditis_elegans_AAB65956:0.294118,Brugia_malayi_AAT07059:0.426557):0.192601):0.087561,((Clonorchis_sinensis_AAP33049:0.243866,Clonorchis_sinensis_AAP33050:0.271345):0.200273,(Paragonimus_westermani_AAW28151:0.262881,Paragonimus_westermani_AAY81946:0.140507):0.129712):0.177555):0.115562,(Homo_sapiens_CATW_P56202:0.197139,Mus_musculus_CATW_P56203:0.168663):0.648888):0.091998,Trypanosoma_brucei_AAX80359:0.541003):0.202081):0.243098,((((Taenia_solium_LongOrf.asmbl_1043:0.031326,Echinococcus_multilocularis_CATL_E9RH13:0.024973):0.230475,(((Homo_sapiens_CATB_P07858:0.096769,Mus_musculus_CATB_P10605:0.130304):0.270247,'Drosophila melanogaster CATB NP_001259536':0.32392,((Schistosoma_mansoni_CYSP_P25792:0.122392,Schistosoma_japonicum_CATB_P43157:0.163317):0.269071,(Caenorhabditis_elegans_CPR6_P43510:0.361055,((Caenorhabditis_elegans_CPR3_P43507:0.336461,Caenorhabditis_elegans_CPR6_Q8MQC6:0.220475):0.164788,Caenorhabditis_elegans_CPR5_P43509:0.281565):0.280129):0.133838):0.190955):0.089376,Taenia_solium_Scaffold00002.gene342:0.34147):0.128222):0.208206,((Schistosoma_mansoni_CB2_Q95PM1:0.064632,Schistosoma_japonicum_CB_Q7Z1I6:0.065072):0.051955,Trichobilharzia_regenti_CB2:0.063299):0.117829):0.599209,((Homo_sapiens_CATZ_Q9UBR2:0.091617,Mus_musculus_CATZ_Q9WUU7:0.05986):0.337652,Caenorhabditis_elegans_CPZ1_G5EGP8:0.268162):0.850616):0.409527):0.622769); [! TreeBASE tree URI: http://purl.org/phylo/treebase/phylows/tree/TB2:Tr74413] END;