ClustalW Guide

 

Purpose: A multiple sequence alignment program for DNA or proteins. Can match up to 500 sequences.

 

Instructions:

 

1)      Go to http://www.ebi.ac.uk/clustalw/

2)      Put in your e-mail (if you want the results e-mailed to you – not necessary) and an alignment title of your choice.

3)      Paste sequences in the box below using the following format(s):

a.      >Name of Sequence #1

aaatatatatatattggatgagaccctgatc

 

>Name of Sequence #2

aattccctctctctcgggtc

                                                               i.      The sequences can have gaps and numbers but do not allow any spaces before inputting the first character.

                                                             ii.      Every sequence MUST have a different name given to it or the alignment will not work.

- OR -

b.      Upload a file that includes all your sequences (such as a .doc file) in an acceptable format (see above)

4)      Press run button to start alignment reading.

5)      When viewing your results, these are the consensus symbols used by ClustalW:

a.      "*" means that the residues or nucleotides in that column are identical in all sequences in the alignment.

b.      ":" means that conserved substitutions have been observed.

c.      "." means that semi-conserved substitutions are observed.

6)      If you would like to see your results in color, push the button that displays Show Colors. Click Hide Colors to get rid of color. A table for the color code is shown below.

7)      Click on the button named View Alignment File to see the alignment on a larger scale (ie bigger font).

 

AVFPMILW      RED             Small (small+ hydrophobic (incl.aromatic -Y))

DE                    BLUE           Acidic

RHK                  MAGENTA   Basic

STYHCNGQ          GREEN          Hydroxyl + Amine + Basic - Q

Others          Gray      

 

If you have any questions, the EBI Help site is immensely helpful. You can find that here: http://www.ebi.ac.uk/clustalw/clustalw_frame.html