![]() |
CCP4i: Graphical User Interface |
| MIR Tutorial Bath - Phased refinement |
![]() |
BACK TO INDEX | ![]() |
Each derivative should be carefully monitored for evidence of non-isomorphism, as this is a major source of difficulties during refinement; any changes in cell lengths relative to the native crystal should be at most » 0.5% (this will give » 15% mean intensity change at 3Å resolution). A plot (e.g. using SCALEIT) of mean isomorphous difference vs. resolution should decrease smoothly at the high resolution end. Note that if non-isomorphism is detected, it is not sufficient simply to apply a high resolution cutoff at the point of upturn in the plot; ideally the dataset should be discarded completely.
From the isomorphous difference Patterson, locate 1 or, even better, 2 major sites in your "best" derivative. Frequently what happens is that you collect data for many derivatives, but none of the Pattersons appear readily soluble. However, provided a derivative is not non-isomorphous it may still be usable for phasing. Then one day, the next Patterson is soluble: this is your best derivative! At this point you stop collecting data and start refining.
Do 5 cycles of phasing and refinement of the coordinates and site occupancies (initially set to 1) for this derivative. At this stage it is advisable to keep the overall scale and isotropic thermal parameter fixed (at the default values of 1 and 0 resp.), to fix the individual thermal parameters (e.g. 25), and to use a high resolution data cutoff (e.g. if the derivative data extends to 3Å, cutoff at 4.5Å). Also the anomalous data should not be included yet, and centric data only should be used in the refinement, provided the space group has more than one centric zone. Note that MLPHARE doesn't need initial estimates of RMS lack of closure errors.
#
mlphare HKLIN pbgd_fhscal HKLOUT ufmr <<EOD
CYCLES 5
LABIN FP=FNAT SIGFP=SIGFNAT -
FPH1=FUF SIGFPH1=SIGFUF
LABOUT ALLIN
PRINT AVF AVE
THRESH 2.5 .5
RESOL 20 4.5
CENTRI
DERIV UF
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.28 0.18 0.16 1 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL
EOD
Using the reflection file output by MLPHARE, calculate a difference Fourier with the FFT program, and run a peak search program (e.g. PEAKMAX) on it, listing the highest 10 peaks. Examine the peak listing, and eliminate any symmetry-related peaks at the edges of the map. You will certainly see the sites that were input (even if they are wrong! - but at this stage you have to assume they are right). Take the remaining peaks in order one at a time, and check against the Patterson (e.g. with program RSPS or VECSUM), until a site is found or the list is exhausted.
# fft HKLIN ufmr MAPOUT ufmrdf <<EOD TITLE UF diff Fourier phased on UF derivative refined by MLPHARE. LABIN F1=FUF SIG1=SIGFUF F2=FNAT SIG2=SIGFNAT PHI=PHIB W=FOM EOD if ($status) exit peakmax MAPIN ufmrdf <<EOD NUMPEA 10 OUTPUT NONE EOD rm ufmrdf.map
Add any new site found to the input for MLPHARE, and repeat steps b and c, until no new sites are found. The statistics for each derivative output by MLPHARE on the final phasing cycle should always be checked. A reduction in RCullis (average P-weighted lack of closure divided by average isomorphous difference) is supposed to be a good validator of a new site.
#
mlphare HKLIN pbgd_fhscal HKLOUT ufmr <<EOD
CYCLES 5
LABIN FP=FNAT SIGFP=SIGFNAT -
FPH1=FUF SIGFPH1=SIGFUF
LABOUT ALLIN
PRINT AVF AVE
THRESH 2.5 .5
RESOL 20 4.5
CENTRI
DERIV UF
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.282 0.173 0.158 0.889 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL
ATOM U 0.18731 0.17452 0.11287 .5 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL
EOD
if ($status) exit
fft HKLIN ufmr MAPOUT ufmrdf <<EOD
TITLE UF diff Fourier phased on UF derivative refined by MLPHARE.
LABIN F1=FUF SIG1=SIGFUF F2=FNAT SIG2=SIGFNAT PHI=PHIB W=FOM
EOD
if ($status) exit
peakmax MAPIN ufmrdf <<EOD
NUMPEA 10
OUTPUT NONE
EOD
rm ufmr.mtz ufmrdf.map
Repeat step c, but this time calculate a cross-difference Fourier on another derivative.
# fft HKLIN ufmr MAPOUT uamrdf <<EOD TITLE UAc diff Fourier phased on UF derivative refined by MLPHARE. LABIN F1=FUAC SIG1=SIGFUAC F2=FNAT SIG2=SIGFNAT PHI=PHIB W=FOM EOD if ($status) exit peakmax MAPIN uamrdf <<EOD NUMPEA 10 OUTPUT NONE EOD rm ufmr.mtz uamrdf.map
Add the new derivative and its site to the input for MLPHARE and repeat steps b, c and d on the new derivative.
#
mlphare HKLIN pbgd_fhscal HKLOUT uamr <<EOD
CYCLES 5
LABIN FP=FNAT SIGFP=SIGFNAT -
FPH1=FUF SIGFPH1=SIGFUF -
FPH2=FUAC SIGFPH2=SIGFUAC
LABOUT ALLIN
PRINT AVF AVE
THRESH 2.5 .5
RESOL 20 4.5
CENTRI
DERIV UF
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.284 0.166 0.162 0.926 BFAC 25.000
ATREF X ALL Y ALL Z ALL OCC ALL
ATOM U 0.185 0.176 0.106 0.603 BFAC 25.000
ATREF X ALL Y ALL Z ALL OCC ALL
ATOM U 0.499 0.243 0.381 0.281 BFAC 25.000
ATREF X ALL Y ALL Z ALL OCC ALL
DERIV UAc
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.284 0.170 0.160 1 BFAC 25.000
ATREF X ALL Y ALL Z ALL OCC ALL
EOD
if ($status) exit
fft HKLIN uamr MAPOUT uamrdf <<EOD
TITLE UAc diff Fourier phased on UF & UAc derivatives refined by MLPHARE.
LABIN F1=FUAC SIG1=SIGFUAC F2=FNAT SIG2=SIGFNAT PHI=PHIB W=FOM
EOD
if ($status) exit
peakmax MAPIN uamrdf <<EOD
NUMPEA 10
OUTPUT NONE
EOD
rm uamr.mtz uamrdf.map
Repeat steps e and f for all remaining derivatives (at this stage no sites consistent with the Patterson could be found for the PCMBS derivative, so it was not included).
At this point any anomalous data should be brought into play, and it is first necessary to establish the "hand" of the heavy atoms. The easiest way to do this is to set all "anomalous occupancies" to zero, and using both centric and acentric data, do 10 cycles of phasing and refinement. The high resolution cutoff should be removed.
The anomalous occupancies should all refine positive if the hand is already correct, or negative if it is wrong, in which case the coordinates of all sites must be inverted (i.e. x,y,z becomes -x,-y,-z , which is equivalent to x,y,-z in point group 222).
If there is no consistent change in the anomalous occupancies, the anomalous data should be left out and brought in later; alternatively it may be possible to include anomalous data for only the higher occupancy derivatives.
#
mlphare HKLIN pbgd_fhscal HKLOUT allmr1 <<EOD
TITLE Refine 5 derivatives with anomalous occupancy = 0.
CYCLES 10
LABIN FP=FNAT SIGFP=SIGFNAT -
FPH1=FUF SIGFPH1=SIGFUF DPH1=DANUF SIGDPH1=SIGDANUF -
FPH2=FUAC SIGFPH2=SIGFUAC DPH2=DANUAC SIGDPH2=SIGDANUAC -
FPH3=FUS SIGFPH3=SIGFUS -
FPH4=FPTCL SIGFPH4=SIGFPTCL DPH4=DANPTCL SIGDPH4=SIGDANPTCL -
FPH5=FYBCL SIGFPH5=SIGFYBCL
LABOUT ALLIN
PRINT AVF AVE
THRESH 2.5 .5
DERIV UF
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.287 0.171 0.163 0.968 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
ATOM U 0.495 0.238 0.380 0.547 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
ATOM U 0.186 0.187 0.109 0.615 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
ATOM U 0.986 0.051 0.510 0.314 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
DERIV UAc
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.188 0.186 0.110 0.974 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
ATOM U 0.292 0.170 0.155 0.837 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
ATOM U 0.486 0.235 0.370 0.829 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
DERIV US
DCYCLE PHASE ALL REFCYC ALL
ATOM U 0.180 0.195 0.111 0.625 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL
DERIV Pt
DCYCLE PHASE ALL REFCYC ALL
ATOM PT 0.254 0.042 0.589 0.910 0 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL
DERIV Yb
DCYCLE PHASE ALL REFCYC ALL
ATOM YB 0.493 0.234 0.381 0.413 BFAC 25
ATREF X ALL Y ALL Z ALL OCC ALL
Using the newly determined phases, repeat the difference Fourier and peak search for each derivative, adding any new sites found. Repeat the phasing and refinement, including the overall scale and thermal parameter, and possibly the individual thermal parameters, though it is often found that these tend to be unstable.
#
mlphare HKLIN pbgd_fhscal HKLOUT allmr2 <<EOD
TITLE Refine all derivatives (hand inverted with z changed to 1-z).
CYCLES 10
LABIN FP=FNAT SIGFP=SIGFNAT -
FPH1=FUF SIGFPH1=SIGFUF DPH1=DANUF SIGDPH1=SIGDANUF -
FPH2=FUAC SIGFPH2=SIGFUAC DPH2=DANUAC SIGDPH2=SIGDANUAC -
FPH3=FUS SIGFPH3=SIGFUS -
FPH4=FPTCL SIGFPH4=SIGFPTCL DPH4=DANPTCL SIGDPH4=SIGDANPTCL -
FPH5=FYBCL SIGFPH5=SIGFYBCL -
FPH6=FPCMBS SIGFPH6=SIGFPCMBS DPH6=DANPCMBS SIGDPH6=SIGDANPCMBS
LABOUT ALLIN
PRINT AVF AVE
THRESH 2.5 .5
DERIV UF
DCYCLE PHASE ALL REFCYC ALL KBOV ALL
ATOM U 0.287 0.172 0.837 0.811 0.638 BFAC 13.881
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
ATOM U 0.494 0.240 0.619 0.564 0.425 BFAC 12.593
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
ATOM U 0.185 0.187 0.892 0.607 0.476 BFAC 15.953
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
ATOM U 0.988 0.052 0.489 0.437 0.460 BFAC 65.797
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
DERIV UAc
DCYCLE PHASE ALL REFCYC ALL KBOV ALL
ATOM U 0.186 0.186 0.890 0.912 0.759 BFAC 19.199
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
ATOM U 0.289 0.172 0.845 0.732 0.537 BFAC 16.749
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
ATOM U 0.488 0.239 0.627 0.755 0.593 BFAC 17.873
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
DERIV US
DCYCLE PHASE ALL REFCYC ALL KBOV ALL
ATOM U 0.181 0.195 0.889 0.571 BFAC 34.773
ATREF X ALL Y ALL Z ALL OCC ALL B ALL
DERIV Pt
DCYCLE PHASE ALL REFCYC ALL KBOV ALL
ATOM PT 0.252 0.044 0.409 0.878 0.871 BFAC 33.104
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
DERIV Yb
DCYCLE PHASE ALL REFCYC ALL KBOV ALL
ATOM YB 0.493 0.237 0.619 0.458 BFAC 19.003
ATREF X ALL Y ALL Z ALL OCC ALL B ALL
DERIV PCMBS
DCYCLE PHASE ALL REFCYC ALL KBOV ALL
ATOM HG 0.245 0.060 0.145 0.271 0.252 BFAC 52.205
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
ATOM HG 0.074 0.064 0.154 0.326 0.311 BFAC 47.182
ATREF X ALL Y ALL Z ALL OCC ALL AOCC ALL B ALL
EOD
Repeat step i until there is no further change in the list of sites. Note that once a good derivative is well-refined and there are obviously no new sites to be found, its refinement flags can be switched off, and refinement performed on only the weaker derivatives. The printed "refinement parameters" indicate the progress of convergence of refinement for each derivative.
Note that in the above procedure, only the Patterson for the first (and best) derivative needs to be solved; the other derivatives are solved from the difference Fouriers, and the Pattersons, which are often difficult to solve ab initio, are then only used to cross-check the new sites. This also obviates the problem of ensuring that all derivatives are solved relative to the same origin and on the same hand. Of course, if more than one Patterson can be solved independently, so much the better, but then difference Fouriers must still be used to correlate the origins and the hand.
![]() |
BACK TO INDEX | ![]() |