Practical aspects of SAD/MAD. Judit É Debreczeni

Practical aspects of SAD/MAD Judit É Debreczeni

anomalous scattering Hg sinθ/λ CuKα 0 Å - 0.4 Å - 0.6 Å - Å.5Å 0.83Å f 80 53 4 f total (θ, λ) f(θ) + f (λ) + if (λ) f f -5 8-5 8-5 8 f total increasing with increasing resolution if diffraction strong enough Anomalous Δ/ for vitamin B: (overall /sigma: 39.0, highest res. shell:.9) Anomalous signal/noise ratios (.0 is random). The first line is based on input sigmas, the second on variances of + and - (if not already averaged): nf - 8.0-6.0-5.0-4.0-3.5-3.0 -.5 -.0 -.5 -.3 -. - 0.9 A 3.67 7.03 3.46 0.89 9.70 4.54 7.43 6.94 0.8 5.8 0.7 5.06 3.5 0..44 4.0.5 8.55.90 9.50.76 4.73 9.35 5.07

SAD/MAD phasing ± T + f + f f f A + T A cosα ± T A sinα f f f unknowns: T, A and α MAD: overdetermined SAD: approximations. α 90 if + > - 70 if + < -. T / ( + + - ) assuming A << T and f', f"<< f Δ f f " ' A sinα

SAD phasing - approximations approximations increased noise in Patterson function importance of E-value selection (example: insulin S-SAD) error caused by approximations ideal A ideal Δ measurement error measured Δ

predicting/assessing anomalous signal where is the limit? are our limits theoretical or practical? predicting success estimate from the number and type of anomalous scatterers Δ/ (>.0 in XPREP, or 0.798) CC between independent datasets (>30%) overall / scala, truncate DANO + / - in XDS etc.

predicting/assessing anomalous signal f" @ CuK α (e - ) Ratio of anomalous differences (%) O 0.03 0.75 P 0.433.9 S 0.556.4 Zn 0.678.6 Cl 0.70.68 Ca.85.85 e 3.97 6.83 U 3.409 3.76 Δ i i n f n f i i " i i anomalous scatterers in a 50-membered polyala the resulting anomalous differences from S are of the order of -% for normal proteins

Δ / vs. CC independent measurements error variables: - have a zero mean and the same s.u. - are uncorrelated with each other and with the signal M M S + E S + E the correlation coefficient between two independent measurements: CC Ε( M M D( M ) Ε( M )D( M ) Ε( M ) ) if M and M are equivalent and (equal mean and s.u.) CC Ε( S D( S) ) Ε( S) + D( E )

Δ / vs. CC SAD signal (Δ ) follows the distribution: ( ) Δ P Δ exp Σπ Σ measurement errors: normal distribution (zero mean and st. dev.) Σ N j if N anomalous scatterers are randomly distributed in the asymmetric unit. ( A follows an acentric Wilson distribution.) f j CC Σ Σ + + Σ Σ Δ Ε Σ + π π + Σ

Δ / vs. CC CC Δ Ε π Δ + Ε π CC can be predicted from Δ / no upper limit for Δ / 0.9 0.8 0.7 Δ /.0 when CC 30% (data truncation point) CC CC 0.6 0.5 0.4 0.3 0. 0. CC calc CC cins CC otrp CC ttrp CC iod-thau CC rins CC elas Δ / 0.79 when Σ/ 0 (no signal) 0 0 3 4 5 6 Δ/ Δ /

Δ / vs. / 4 Δ + Δ Ε π π and if Δ<< and + - Gauss error propagation ( ) ( ) + + + + Δ Δ Δ Δ Δ Δ Δ Δ Σ π π π Σ + Σ + Δ Ε the anomalous difference is calculated as previously shown that

Δ / vs. / Δ Ε + Δ π 4 π all f are measurable given / sufficiently large estimated Δ / required redundancy can be estimated before collecting a complete dataset (/ ~ redundancy) number of HA sites can be estimated (e.g. HA soaks)

Δ / vs. / GGPS Se-SAD.45 Δ / Δ /.35.5.5.05 0.95 Δ Ε + Δ π 4 π Observed Prediction linear at high / slope: experimental Δ/ (occupancy, B- factor, anisotropy of heavy atoms) 0.85 0.75 0 5 0 5 0 5 30 35 40 / y-intercept: ~0.8 / square root type function if lower /

limits? λ0.7å (MoKα) f' 0. f" 0. Δ/0.7% cubic (Zn-free) insulin Mo sealed tube overall redundancy: 3 R int : 5.4%, R sigma :.7% Anomalous Δ/ (for the whole dataset): Anomalous signal/noise ratios (.0 is random). The first line is based on input sigmas, the second on variances of + and - (if not already averaged): nf - 8.0-6.0-5.0-4.0-3.5-3.0 -.7 -.5 -.3 -. -.9 -.7 A 3.8 3.8.77.6.66.67.4.48.9.36..0 4.3 3.57.9.3.48.54.36.40.9.33.3.6

why SAD? absorption edge experimentally demanding air absorption (He purged beam path) strong absorption by the crystal, loop,... (scaling) radiation damage limited resolution

pseudo-mad λ.54å (CuKα) f' 0.3 f" 0.56 λ0.7å (MoKα) f' 0. f" 0. cubic insulin at two wavelengths Δf'0. dispersive signal sensitive to radiation damage measurement technique might be useful for eg. Cd Xe Correlation between Mo and Cu: Anomalous correlation coefficients (%) against previous datasets nf - 8.0-6.0-5.0-4.0-3.5-3.0 -.7 -.5 -.3 -. -.9 -.7 A 95.0 93.5 88.6 88. 56.7 55.7 47.7 35. 36.3 0.0.0-0.4

initial phases 70 60 50 40 30 WMPE MapCC SHELXE, heavy atoms included, no density modification 0 0 0 Cu-quarter Cu-half Cu-full λ f" redundancy CC CCweak.54Å 0.56 3. 39.46 0.5.54Å 0.56 6.5 54.38 9.53.54Å 0.56 3 56.50 8.8 insulin S-SAD

initial phases 70 60 50 40 30 WMPE MapCC SHELXE, heavy atoms included, no density modification 0 0 0 Mo Anka Cu-quarter WMPE and MapCC do not depend on f λ f" redundancy CC CCweak 0.7Å 0. 0 36.6 0.05.00Å 0.7 6 4.87 3.05.54Å 0.56 3. 39.46 0.5 insulin S-SAD

initial phases substructure initial phases density improvement autotracing what is the signal good enough for? SHELXD/E evolution: viscotoxin A3 00 - SHELXD: supersulfurs SHELXE: initial phases no interpretable map MR based upon super-s positions 003 - SHELXD: DSUL SHELXE: phases good enough for autotracing (arp) after NCS averaging in DM 006 - SHELXE: autotracing density modification recycled with autotracing /3 of the molecule traced???? - NCS...?

use of anomalous signal SAD/MAD etc equipment testing inverse ourier - anomalous map - confirm biology molecular replacement: -phased molecular replacement - locate molecules with heavy atom sites phased refinement - iffy initial model

acknowledgements Göttingen George M. Sheldrick Gábor Bunkóczi many members of the group Delft Leo Straver Anita Coetzee Bram Schierbeek