We use cookies to improve your experience. By continuing to browse this site, you accept our cookie policy.×
BenchmarkOpen Accesscc iconby iconnc iconnd icon

Specific N-terminal amino acids potentiate the periplasmic expression of single-chain variable fragments in Escherichia coli

    Yoshiro Hanyu

    *Author for correspondence:

    E-mail Address: y.hanyu@aist.go.jp

    Biomaterials Research Group, Health & Medical Research Institute, National Institute of Advanced Industrial Science & Technology (AIST), 1-1-1 Higashi, Tsukuba, 305-8566, Japan

    &
    Mieko Kato

    Department of Biochemistry, Bio-Peak Co., Ltd, 584-70 Shimonojo, Takasaki, 370-0854, Japan

    Published Online:https://doi.org/10.2144/btn-2022-0107

    Abstract

    Single-chain variable fragments (ScFvs) are important in therapy, diagnosis and research because of their elevated antigen affinity and low immunogenicity. At present, high-yield scFv expression in Escherichia coli is limited by insoluble aggregation in the reducing environment of the cytoplasm or low yields in the periplasm. Here we achieved increased expression of scFvs in the periplasm by inserting optimal amino acids between the signal peptide and scFv. We constructed an expression library with three random amino acids at the scFv N-terminus, screened this library with a single-step colony assay and identified the specific sequences that boosted periplasmic expression of scFvs.

    METHOD SUMMARY

    A library of single-chain variable fragments with three random amino acids at the N-terminus was constructed. The library was screened using a single-step colony assay. This simple and rapid screening method enabled the identification of specific sequences responsible for improved periplasmic expression, without false-positive clones.

    Antibodies are indispensable for therapy, diagnosis and research. Recombinant antibody fragments can be easily engineered and small, single-chain variable fragments (scFvs) have proven particularly advantageous [1,2]. Escherichia coli is the standard scFv expression host; however, the yield is low due to formation of inclusion bodies and cell lysis [3,4]. Various approaches have been attempted to improve yields, including fed-batch culture [5], a cold-shock promoter [6], a solubility-enhancing tag [7–9], host selection [10] and specific N-terminal sequences [11], yet none of these methods is sufficiently versatile in E. coli.

    Here we describe using a single-step colony assay [12,13] to efficiently screen scFvs whose N-terminal sequence enhanced periplasmic expression in E. coli. Three specific amino acid sequences capable of boosting periplasmic expression of scFvs were identified.

    A random library (Figure 1A) was constructed by inserting the degenerate codons (NNK)3, corresponding to three N-terminal amino acids, between the pelB signal peptide and anti-rabbit IgG scFv (A10B) [14]. The pET-22b expression vector containing the random library was then transformed into E. coli BL21 (DE3) cells.

    Figure 1. Schematic diagram of the procedure used in the single-step colony assay.

    (A) Schematic illustration of scFvs containing three additional N-terminal random amino acids used for cloning in pET-22b. White square: pelB leader; colored square: (NNK)3; white rectangles: scFv (A10B); gray square: His-tag; black circle: stop codon. (B) Outline of the single-step colony assay.

    HRP: Horseradish peroxidase; scFv: Single-chain variable fragment.

    The procedure used in the single-step colony assay to screen for clones with higher expression is depicted schematically in Figure 1B. A hydrophilic polyvinylidene difluoride filter and an antigen-coated nitrocellulose membrane were placed on agar plates. E. coli transformed with the library were grown to exponential phase and suspended in autoinduction solution containing glucose and lactose. The cells were then spread on the filter and cultivated until colonies appeared on the filter surface. ScFvs were automatically expressed by allolactose once glucose was depleted. The expressed scFvs diffused into the antigen-coated membrane, where those with affinity bound to the antigen beneath the colonies. The filter containing the colonies was removed, placed on a fresh plate and stored for later recovery of the clones. Antigen-bound scFvs on the membrane were detected by chemiluminescence using a horseradish peroxidase-conjugated anti-His antibody. The filter harboring the colonies was superimposed on the image with chemiluminescence results. Clones with stronger signals were identified and their reactivity against the antigen was measured by ELISA. Plasmids encoding scFvs were purified and their scFv and N-terminal sequences were read.

    Figure 2 shows a nitrocellulose membrane resulting from the single-step colony screening. E. coli were transformed with the scFv library, and 2 × 104 cells were spread on ten 15-cm plates. After incubation for 16 h at 30°C, 1.16 × 104 colonies were detected, corresponding to approximately 1000 colonies per plate. A10B scFvs with the three N-terminal amino acids obtained from each colony bound to the antigen on the nitrocellulose membrane and were detected by chemiluminescence. The library generated from (NNK)3 degenerate codons was estimated to contain 20 × 20 × 20 = 8 × 103 clones. We screened 1.16 × 104 clones, which was sufficient to cover the library size. Signal intensity, corresponding to scFv expression, varied between colonies. The 25 colonies with the highest intensities (indicated by arrow in Figure 2) were selected for further experiments.

    Figure 2. Nitrocellulose membrane image showing clones of single-chain variable fragments against rabbit IgG.

    Antigen binding of single-chain variable fragments to the membrane was detected by chemiluminescence. Scale bar = 5 mm.

    The 25 clones were picked from the upper filter and cultured overnight at 37°C in Luria–Bertani medium (Sigma-Aldrich, MO, USA) containing 50 μg/ml ampicillin. The resulting culture was inoculated in a 200-ml flask containing 20 ml Terrific Broth medium (Sigma-Aldrich) supplemented with 50 μg/ml ampicillin, and incubated at 37°C and 200 r.p.m. Once the optical density at 600 nm reached 0.5, expression was initiated by addition of isopropyl β-d-1-thiogalactopyranoside to a final concentration of 0.5 mM. The cells were incubated overnight at 26°C, harvested by centrifugation at 6000 × g and 4°C, washed with phosphate-buffered saline, and the periplasmic fraction from the pellet was extracted by cold osmotic shock.

    ScFv activity was measured by ELISA. Rabbit IgG was used to coat a 96-well ELISA plate, and the periplasmic fraction was applied. Antigen binding was detected using a horseradish peroxidase-conjugated anti-His-tag antibody. As shown in Figure 3, the signal from clones 7, 8 and 12 was more than 1.7-times higher than that of the control scFv without the N-terminal sequences.

    Figure 3. Screening of positive clones from the single-chain variable fragment library obtained by the single-step colony assay.

    ELISA results showing the reactivity of periplasmic extracts from 25 clones identified by the colony assay and A10B (control), labeled with C. Data represent the mean of three replicates; error bars represent the standard deviation.

    OD450: Optical density at 450 nm; scFv: Single-chain variable fragment.

    The DNA of each scFv from the 25 clones was sequenced. Every clone contained a complete scFv structure consisting of heavy and light chain variable domains plus a linker. The N-terminal amino acid sequences of the clones are listed in Table 1.

    Table 1. N-terminal sequences with the highest signal intensities based on ELISA results.
    Clone numberFirst amino acidSecond amino acidThird amino acidELISA signal (OD450)
    8ADN1.91
    12AEY1.82
    7GDN1.70
    1ADY1.63
    21GEK1.60
    23AQQ1.60
    20TDS1.55
    2GDK1.54
    5ADY1.33
    15AES1.31
    16GTK1.26
    17AQD1.26
    13DTN1.23
    4ADD1.23
    18GKY1.23
    24DNN1.15
    11TSD1.13
    3EDY1.13
    10TDS1.13
    9SKY1.13
    25GSY1.10
    14AKY1.09
    19NDD0.91
    6AES0.89
    22GSS0.89
    Control (A10B)0.79

    OD450: Optical density at 450 nm.

    Amino acids at each position were classified on the basis of their characteristics (Table 2). In the first position, most amino acids (68%) were either alanine (n = 10) or glycine (n = 7), indicating a bias toward smaller residues. In the second position, most amino acids (56%) were either glutamic acid or aspartic acid, suggesting a preference for residues with a negative charge. In the third position, all amino acids were either neutral or polar, with tyrosine being the predominant species (32%). No preference for charge or polarity was found at this position. The frequencies of amino acids at each position are listed in Table 3. Alanine in the first, glutamic acid in the second and tyrosine in the third position were the most frequent amino acids, with probabilities of 40, 40 and 32%, respectively.

    Table 2. Amino acid composition at each codon position of clones identified by the colony assay.
     Charge (+)Charge (−)HydrophilicHydrophobic
     DEKRSTYCNQHWGAVLIFp-valueM
    First amino acid21131710
    Second amino acid10433212
    Third amino acid435841

    –: None.

    Table 3. Frequency of amino acids at each codon position.
    Frequency (number)First amino acidSecond amino acidThird amino acid
    10AD
    9
    8
    7GY
    6
    5S
    4EN, D
    3TK, SK
    2DQ, T
    1E, N, SNQ

    –: None.

    For a quantitative comparison of scFv expression, the top three sequences (AND, AEY and GDN) and a synthetic sequence (ADY), comprising the most frequent amino acid at each position, were selected. The cells were cultured as described previously, and the scFvs in the periplasmic extract were purified using a Ni column. Cell growth, volumetric yield and relative yield were also analyzed (Table 4). Although the numbers of cells were almost identical in all cases, the volumetric yield was two-times larger and the relative yield (i.e., scFvs expressed per cell) was 2.2-times larger with ADY at the N-terminus. The other three sequences also presented expression levels two-times higher than that of the control. These results showed that specific amino acid sequences enhanced periplasmic scFv expression. Given that ADY achieved the highest yield, our screening was successful.

    Table 4. Comparison of cell growth and volumetric/relative yield for control A10B and A10B containing three additional amino acids at its N-terminus.
     A10B (control)ADYAND (clone 8)AEY (clone 12)GDN (clone 7)
    OD600 at harvest6.8 ± 0.86.5 ± 0.86.3 ± 0.77.0 ± 0.47.3 ± 0.07
    Volumetric yield (mg/l)0.45 ± 0.040.92 ± 0.110.80 ± 0.090.78 ± 0.080.83 ± 0.11
    Relative yield (mg/l/OD)0.0660.1420.1270.1110.114

    Estimates are based on OD600. Standard deviations were calculated from three independent experiments.

    OD600: Optical density at 600 nm.

    A single-step colony assay for antibody fragments was developed previously by our group to quickly and easily identify scFv clones with high affinity against antigens [12]. Here, the assay was used successfully to identify clones in a library of 1 × 104 clones. We used the degenerate codon NNK to generate a library with random sequences. This method can be easily applied in various contexts because it necessitates only primers with degenerate codons to introduce diversity [15]. In this study the probability of a stop codon being introduced by NNK was 9.0%, which was within the acceptable range. Screening with a longer random sequence could help identify a more effective sequence for periplasmic scFv expression. However, NNK is not suitable for longer sequences because the probability of a stop codon increases substantially with length, thereby interfering with the screening. To avoid this scenario, trimer oligos can be used to construct a highly diverse library [16].

    Here we identified specific N-terminal sequences that increased periplasmic expression. In the future, it will be important to confirm this effect on other scFvs and proteins, as well as to determine whether the identified sequence is valid for other signal sequences. This would help elucidate the mechanism underlying the observed potentiation.

    The second amino acid, after methionine, is important for the cytosolic expression of recombinant proteins. Specifically, alanine, cysteine, proline, serine, threonine or lysine at the +2 position may result in significantly higher (up to tenfold) expression of recombinant Ig-α, while methionine, histidine or glutamic acid may have the opposite effect [17]. Ojima-Kato et al. reported that N-terminal SKIK augmented the cytosolic expression of scFvs [11]. Kim et al. reported that N-terminal DDDDD improved the secretion of recombinant proteins [18,19]. The fusion of green fluorescent protein, which contains 15 negatively charged amino acids, to scFvs also improves secretion [20]. The effect of N-terminal sequences on secretion should be addressed in the future.

    In summary, we constructed a library with three random amino acids at the N-terminus of scFv, and screened this library using a simple and rapid single-step colony assay. Three specific amino acids capable of potentiating the periplasmic expression of scFvs were identified.

    Author contributions

    Y Hanyu designed the study and wrote the paper. M Kato performed the experiments and analyzed the data.

    Acknowledgments

    The authors would like to thank Editage (www.editage.com) for English-language editing, which was funded by the authors’ institution.

    Financial & competing interests disclosure

    The authors have no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

    Medical writing support was provided by Editage and was funded by the National Institute of Advances Industrial Science and Technology, Japan.

    Open access

    This work is licensed under the Attribution-NonCommercial-NoDerivatives 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/

    References

    • 1. Ahmad ZA, Yeap SK, Ali AM, Ho WY, Alitheen NBM, Hamid M. ScFv antibody: principles and clinical application. Clin. Dev. Immunol. 2012, 980250 (2012).
    • 2. de Aguiar RB, da Silva T de A, Costa BA et al. Generation and functional characterization of a single-chain variable fragment (scFv) of the anti-FGF2 3F12E7 monoclonal antibody. Sci. Rep. 11(1), 1432 (2021).
    • 3. Sandomenico A, Sivaccumar JP, Ruvo M. Evolution of Escherichia coli expression system in producing antibody recombinant fragments. Int. J. Mol. Sci. 21(17), 6324 (2020).
    • 4. Schirrmann T, Al-Halabi L, Dübel S, Hust M. Production systems for recombinant antibodies. Front. Biosci. 13, 4576–4594 (2008).
    • 5. Nghia NH, Kumada Y, Kishimoto M, Horiuchi J. Effective production of single-chain variable fragment (scFv) antibody using recombinant Escherichia coli by DO-stat fed-batch culture. J. Biosci. Bioeng. 132(1), 56–63 (2021).
    • 6. Subedi GP, Satoh T, Hanashima S et al. Overproduction of anti-Tn antibody MLS128 single-chain Fv fragment in Escherichia coli cytoplasm using a novel pCold-PDI vector. Protein Expr. Purif. 82, 197–204 (2012).
    • 7. Ki MR, Pack SP. Fusion tags to enhance heterologous protein expression. Appl. Microbiol. Biotechnol. 104(6), 2411–2425 (2020).
    • 8. Paraskevopoulou V, Falcone FH. Polyionic tags as enhancers of protein solubility in recombinant protein expression. Microorganisms 6(2), 47 (2018).
    • 9. Kato M, Hanyu Y. Fusion of Zif268 to the C-terminus of Scfvs promotes expression of the active form in the cytoplasm of Escherichia coli. Biochem. Mol. Biol. J. 2(1), 1–7 (2016).
    • 10. Ahmadzadeh M, Farshdari F, Nematollahi L, Behdani M, Mohit E. Anti-HER2 scFv expression in Escherichia coli SHuffle®T7 Express cells: effects on solubility and biological activity. Mol. Biotechnol. 62(1), 18–30 (2020).
    • 11. Ojima-Kato T, Nagai S, Nakano H. N-terminal SKIK peptide tag markedly improves expression of difficult-to-express proteins in Escherichia coli and Saccharomyces cerevisiae. J. Biosci. Bioeng. 123(5), 540–546 (2017).
    • 12. Kato M, Hanyu Y. Single-step colony assay with autoinduction of scFv expression for the screening of antibody libraries. BioTechniques 66(4), 194–197 (2019).
    • 13. Kato M, Hanyu Y. Single-step colony assay for screening antibody libraries. J. Biotechnol. 255, 1–8 (2017).
    • 14. Shen Z, Stryker GA, Mernaugh RL, Yu L, Yan H, Zeng X. Single-chain fragment variable antibody piezoimmunosensors. Anal. Chem. 77(3), 797–805 (2005).
    • 15. Nie J, Ma X, Hu F et al. Designing and constructing a phage display synthesized single domain antibodies library based on camel VHHs frame for screening and identifying humanized TNF-α-specific nanobody. Biomed. Pharmacother. 137, 111328 (2021).
    • 16. Moutel S, Bery N, Bernard V et al. NaLi-H1: a universal synthetic library of humanized nanobodies providing highly functional antibodies and intrabodies. Elife 5, e16228 (2016).
    • 17. Bivona L, Zou Z, Stutzman N, Sun PD. Influence of the second amino acid on recombinant protein expression. Protein Expr. Purif. 74(2), 248–256 (2010).
    • 18. Kim SK, Park YC, Lee HH, Jeon ST, Min WK, Seo JH. Simple amino acid tags improve both expression and secretion of Candida antarctica lipase B in recombinant Escherichia coli. Biotechnol. Bioeng. 112(2), 346–355 (2015).
    • 19. Kim SK, Min WK, Park YC, Seo JH. Application of repeated aspartate tags to improving extracellular production of Escherichia coli L-asparaginase isozyme II. Enzyme Microb. Technol. 79–80, 49–54 (2015).
    • 20. Liu M, Wang B, Wang F et al. Soluble expression of single-chain variable fragment (scFv) in Escherichia coli using superfolder green fluorescent protein as fusion partner. Appl. Microbiol. Biotechnol. 103(15), 6071–6079 (2019).