Ultra High Diversity Factorizable Libraries for Efficient Therapeutic Discovery

Dai, Zheng; Saksena, Sachit D.; Horny, Geraldine; Banholzer, Christine; Ewert, Stefan; Gifford, David K.

doi:10.1007/978-3-031-04749-7_40

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 13278))

Included in the following conference series:

International Conference on Research in Computational Molecular Biology

1915 Accesses

Abstract

The successful discovery of novel biological therapeutics by selection requires highly diverse libraries of candidate sequences that contain a high proportion of desirable candidates.

Z. Dai, S. D. Saksena and D. K. Gifford: Equal contribution.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Drug Discovery Paradigms: Target-Based Drug Discovery

Bioinformatics in Drug Discovery

Innovative Strategies in Drug Discovery and Pharmacoinformatics

The successful discovery of novel biological therapeutics by selection requires highly diverse libraries of candidate sequences that contain a high proportion of desirable candidates. Here we propose the use of computationally designed factorizable libraries, whose sequences are made of concatenated segments from smaller segment libraries, as a method of creating large libraries that meet an objective function at low cost.

Designing segment libraries that result in a factorizable library that meets an objective function is a computationally difficult task. We present a computational method we call Stochastically Annealed Product Spaces (SAPS), which optimizes segment libraries though iterative improvements with respect to an objective function to design a full length factorizable library. Key to our method is the reverse kernel trick, which allows us to efficiently evaluate an objective over the full factorizable library by casting the objective function as an inner product of feature vectors (see Fig. 1).

We show that SAPS outperforms five different benchmark sampling approaches on simulated datasets. We next apply SAPS to design factorizable libraries of the third complementarity determining region of antibody heavy chains (CDR-H3s). We show that this framework can generate factorized CDR-H3 segment libraries that, when joined combinatorially, contain \(\sim 10^9\) unique sequences with highly specific and flexible design parameters. We compare these libraries to a randomized library and show that SAPS designed libraries are more diverse and more enriched in desirable sequences.

Applications of factorizable libraries include the discovery of biologics such as monoclonal antibody therapeutics [5], discovery of adeno-associated vectors (AAV) for gene therapy [1, 8], T-cell receptor (TCR) discovery [2, 4, 7], and aptamer libraries [3, 6].

Full Text Preprint: https://www.biorxiv.org/content/10.1101/2022.01.17.476670v1.

Data Availability: https://github.com/gifford-lab/FactorizableLibrary.

References

Bryant, D.H., et al.: Deep diversification of an AAV capsid protein by machine learning. Nat. Biotechnol. 39, 691–696 (2021)
Google Scholar
Holler, P.D., Holman, P.O., Shusta, E.V., O’Herrin, S., Wittrup, K.D., Kranz, D.M.: In vitro evolution of a t cell receptor with high affinity for peptide/mhc. Proc. Nat. Acad. Sci. 97(10), 5387–5392 (2000). https://doi.org/10.1073/pnas.080078297, https://www.pnas.org/content/97/10/5387
Keefe, A.D., Pai, S., Ellington, A.: Aptamers as therapeutics. Nat. Rev. Drug Discov. 9(7), 537–550 (2010)
Article Google Scholar
Li, Y., et al.: Directed evolution of human t-cell receptors with picomolar affinities by phage display. Nat. Biotechnol. 23(3), 349–354 (2005)
Google Scholar
Liu, G., et al.: Antibody complementarity determining region design using high-capacity machine learning. Bioinformatics 36(7), 2126–2133 (2020)
Google Scholar
Maier, K.E., Levy, M.: From selection hits to clinical leads: progress in aptamer discovery. Mol. Ther. Methods Clin. Dev. 5, 16014 (2016)
Article Google Scholar
Smith, S.N., Harris, D.T., Kranz, D.M.: T cell receptor engineering and analysis using the yeast display platform. Methods Mol. Biol. 1319, 95–141 (2015)
Article Google Scholar
Wang, D., Tai, P.W.L., Gao, G.: Adeno-associated virus vector as a platform for gene therapy delivery. Nat. Rev. Drug Discov. 18(5), 358–378 (2019)
Article Google Scholar

Download references

Acknowledgements

This work was funded by NIH Grant R01 CA218094, and a gift from Schmidt Futures to D.K.G. The experimental work was funded by Novartis.

Author information

Authors and Affiliations

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
Zheng Dai, Sachit D. Saksena & David K. Gifford
Novartis Institutes for BioMedical Research (NIBR), Basel, Switzerland
Geraldine Horny, Christine Banholzer & Stefan Ewert

Authors

Zheng Dai
View author publications
You can also search for this author in PubMed Google Scholar
Sachit D. Saksena
View author publications
You can also search for this author in PubMed Google Scholar
Geraldine Horny
View author publications
You can also search for this author in PubMed Google Scholar
Christine Banholzer
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Ewert
View author publications
You can also search for this author in PubMed Google Scholar
David K. Gifford
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Zheng Dai , Sachit D. Saksena or David K. Gifford .

Editor information

Editors and Affiliations

Columbia University, New York, NY, USA
Itsik Pe'er

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dai, Z., Saksena, S.D., Horny, G., Banholzer, C., Ewert, S., Gifford, D.K. (2022). Ultra High Diversity Factorizable Libraries for Efficient Therapeutic Discovery. In: Pe'er, I. (eds) Research in Computational Molecular Biology. RECOMB 2022. Lecture Notes in Computer Science(), vol 13278. Springer, Cham. https://doi.org/10.1007/978-3-031-04749-7_40

Download citation

DOI: https://doi.org/10.1007/978-3-031-04749-7_40
Published: 29 April 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-04748-0
Online ISBN: 978-3-031-04749-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Ultra High Diversity Factorizable Libraries for Efficient Therapeutic Discovery

Abstract

Similar content being viewed by others

Drug Discovery Paradigms: Target-Based Drug Discovery

Bioinformatics in Drug Discovery

Innovative Strategies in Drug Discovery and Pharmacoinformatics

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Ultra High Diversity Factorizable Libraries for Efficient Therapeutic Discovery

Abstract

Similar content being viewed by others

Drug Discovery Paradigms: Target-Based Drug Discovery

Bioinformatics in Drug Discovery

Innovative Strategies in Drug Discovery and Pharmacoinformatics

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation