Next-generation DNA sequencing (NGS) of human antibody repertoires has been extensively implemented to discover novel antibody drugs, to analyze B-cell developmental features, and to investigate antibody responses to infectious diseases and vaccination. Because the antibody repertoire encoded by human B cells is highly diverse, NGS analyses of antibody genes have provided a new window into understanding antibody responses for basic immunology, biopharmaceutical drug discovery, and immunotherapy. However, many antibody discovery protocols analyze the heavy and light chains separately due to the short-read nature of most NGS technologies, whereas paired heavy and light chain data are required for complete antibody characterization. Here, we describe a computational workflow to process millions of paired antibody heavy and light chain DNA sequence reads using the Illumina MiSeq 2x300 NGS platform. In this workflow, we describe raw NGS read processing and initial quality filtering, the annotation and assembly of antibody clonotypes relating to paired heavy and light chain antibody lineages, and the generation of complete heavy+light consensus sequences for the downstream cloning and expression of human antibody proteins.
Keywords: Antibody discovery; B cells; Bioinformatics; Next-generation sequencing.
© 2023. Springer Science+Business Media, LLC, part of Springer Nature.