EMAN2/TomoSymmetry

Symmetric Particles in Subtomogram Averaging (new pipeline)

Symmetric biomolecules really aren't (symmetric). At some limiting resolution, the symmetry will always be broken. Consider 3 examples:

small rigid enzyme - symmetry breaking might be at the level of independent individual sidechain motion, may only impact at 3-5 Å resolution
C4 ion channel - Each of the 4 subunits will somewhat independently bind modulators or be otherwise conformationally variable, while maintaining overall symmetry, may impact at the level of 6-12 Å resolution
C8 nuclear pore complex - While ostensibly 8-fold (some parts D8) symmetric, in reality, the variations in separation of the inner and outer nuclear membranes, interactions with substrates, etc, require significant accommodations by the complex, resulting in large 20-40 Å resolution symmetry breaking

In canonical single particle reconstruction, you would simply impose the specified symmetry and hope for the best. Clearly, however, the structure resulting from a symmetrized average of non-identical or mispositioned components will not result in the best possible subunit resolution. We instead need a strategy to look at a single subunit at a time, and refine that, characterizing/classifying not entire symmetric particles, but on an individual subunit basis.

EMAN2 has 2 primary methods to permit this in the new subtomogram averaging pipeline (using e2spt_refine_new and related programs):

Approach 1 - re-extract particles centered on each oriented subunit

In this approach you begin with a normal symmetry-imposed refinement. Then, when you've done the best you can with the normal refinement, you use the resulting per-particle orientations to extract a new set of particles for each of the N subunits. ie - if you have N C4 particles, you will produce 4N smaller particles with no symmetry, and known initial orientations. These orientations could then be further refined or the 4N particles could be classified to produce self-similar subsets.

Run a normal refinement following the tutorial. Be sure to use e2spt_refine_new when presented with the option of the older programs.
The refinement will produce aliptcls3d_XX.lst and aliptcls2d_XX.lst files containing the particles with per-particle alignment data. The 3-D alignments are required for extraction of particles from the original tilt series.
Re-extract the particles using one of these files, but impose symmetry and optionally recenter on a subunit. For example, if we wish to re-extract the raw tilt series with each particle copied 8 times (C8 symmetry) with the appropriate transformation on each particle:

e2spt_extract.py --boxsz_unbin=320 --label=particles_00 --newlabel=reextract --threads=24 --padtwod=2.0 --alltomograms --tltkeep=1.0 --jsonali spt_01/aliptcls3d_04.lst --postxf=c8,0,0,0

However, the above command will keep the particles at their original size (320x320 pixels), including the density for the full particle in each subtilt. Normally you might prefer to re-extract a smaller box size with each of the 8 asymmetric units centered in the box. To accomplish this, we reduce boxsz_unbin and include a 3D translation in the --postxf option. The translation should be from the center of the box to where you wish the center of one subunit to be in the re-extracted particles. Each of the (in this case) 8 copies will be rotated to the position of the target subunit, then the smaller box will be centered on that point. The translation is specified as tx,ty,tz after the symmetry in --postxf, and --boxsz_unbin has been reduced:

e2spt_extract.py --boxsz_unbin=160 --label=particles_00 --newlabel=post40-mono2 --threads=24 --padtwod=2.0 --alltomograms --shrink=2.0 --tltkeep=1.0 --jsonali spt_40/aliptcls3d_01.lst --compressbits=8 --postxf=c8,72,0,0

When you run a refinement on the extracted smaller box-size, make sure to provide an appropriate initial model with the same translation and box size reduction as used above. Note that no "subtraction" is done to eliminate the adjacent subunits in the extracted particles. So using a --maskalign in your subsequent e2spt_refine_new runs is a good idea. The mask.cylinder processor can be quite useful in generating suitable alignment masks for C and D symmetric objects. It is a good idea to give the mask a soft edge either by low pass filtering it (Gaussian blur) or using the tri parameters in mask.cylinder. e2filtertool.py is a very useful tool when fine tuning the parameters of such masks.

Approach 2 - replicate each particle N times and align with a mask

note about "subtraction" methods - In the mid 2000s, EMAN2 offered an experimental approach for single particle analysis (which still exists) for not just masking out the individual subunits from symmetric particles, but subtracting away the density for the other subunits. That is, if you had a C4 particle, when extracting a subunit, you would subtract away the density for 3 of the 4 subunits, ostensibly leaving behind only the density for the subunit you wish to consider. This idea later propagated into Relion where it is considered a standard method there. However, there is a logical flaw of sorts in this scheme. If the macromolecule was perfectly C4 symmetric, it would, indeed, be possible to subtract away the other subunits. However, in that case, there would be no reason to do so, since a C4 refinement would yield optimal results. If the symmetry is broken then any subtraction we do will leave behind artifacts wherever the symmetry is broken, and, indeed, these artifacts are the only thing which should interfere with the correct alignment of the particle in the first place, so in the end, subtraction doesn't really accomplish anything very useful, with some minor caveats.