TY - JOUR
T1 - Uncovering position-specific patterns in codon and codon-pair usage in candidate genes associated with blood coagulation diseases
AU - Clement, Nathan J.
AU - Hamasaki-Katagiri, Nobuko
AU - Lin, Brian
AU - Komar, Anton A
AU - DiCuccio, Michael
AU - Bar, Haim
AU - Kimchi-Sarfaty, Chava
PY - 2025/12/1
Y1 - 2025/12/1
N2 - Current strategies for optimizing gene therapeutics and recombinant protein production typically rely on universal host codon usage indices. However, there is a growing shift toward incorporating gene-specific traits to enhance therapeutic characteristics. In this study, we investigate position-specific variations in codon and adjacent codon-pair usage biases (CPUBs), offering potential for more tailored gene engineering approaches. We focus our analysis on the coding sequences of four coagulation factors: ADAMTS13, von Willebrand factor, factor VIII, and factor IX, which have been used in therapeutic applications. By aligning transcript homologs with human sequences for each gene using Discontiguous Megablast and MACSE, we assess “sequence-position-specific” codon and CPUBs; 157 homologous sequences for ADAMTS13, 148 for F8, 96 for F9, and 202 for VWF. Species with homologs ranged from Primates and Artiodactyla (Even-toed Ungulates) to Testudines. Statistically significant, position-specific positive CPUBs were observed that contrasted with conventional, alignment-specific negative CPUBs. Moreover, we observed that codon and codon-pair usages are highly associated at sequence positions despite little or no association in conventional-position-agnostic analyses. The distinct biases observed at different positions/functionally critical domains in coding sequences highlight the importance of considering position-specific effects in codon optimization strategies.
AB - Current strategies for optimizing gene therapeutics and recombinant protein production typically rely on universal host codon usage indices. However, there is a growing shift toward incorporating gene-specific traits to enhance therapeutic characteristics. In this study, we investigate position-specific variations in codon and adjacent codon-pair usage biases (CPUBs), offering potential for more tailored gene engineering approaches. We focus our analysis on the coding sequences of four coagulation factors: ADAMTS13, von Willebrand factor, factor VIII, and factor IX, which have been used in therapeutic applications. By aligning transcript homologs with human sequences for each gene using Discontiguous Megablast and MACSE, we assess “sequence-position-specific” codon and CPUBs; 157 homologous sequences for ADAMTS13, 148 for F8, 96 for F9, and 202 for VWF. Species with homologs ranged from Primates and Artiodactyla (Even-toed Ungulates) to Testudines. Statistically significant, position-specific positive CPUBs were observed that contrasted with conventional, alignment-specific negative CPUBs. Moreover, we observed that codon and codon-pair usages are highly associated at sequence positions despite little or no association in conventional-position-agnostic analyses. The distinct biases observed at different positions/functionally critical domains in coding sequences highlight the importance of considering position-specific effects in codon optimization strategies.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105023880303&origin=inward
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=105023880303&origin=inward
U2 - 10.1093/nargab/lqaf169
DO - 10.1093/nargab/lqaf169
M3 - Article
SN - 2631-9268
VL - 7
JO - NAR Genomics and Bioinformatics
JF - NAR Genomics and Bioinformatics
IS - 4
M1 - lqaf169
ER -