Abstract
In the process of clone-based genome sequencing, initial assemblies frequently contain cloning gaps that can be resolved using cloning-independent methods, but the reason for their occurrence is largely unknown. By analyzing 9,328,693 sequencing clones from 393 microbial genomes, we systematically mapped more than 15,000 genes residing in cloning gaps and experimentally showed that their expression products are toxic to the Escherichia coli host. A subset of these toxic sequences was further evaluated through a series of functional assays exploring the mechanisms of their toxicity. Among these genes, our assays revealed novel toxins and restriction enzymes, and new classes of small, non-coding toxic RNAs that reproducibly inhibit E. coli growth. Further analyses also revealed abundant, short, toxic DNA fragments that were predicted to suppress E. coli growth by interacting with the replication initiator DnaA. Our results show that cloning gaps, once considered the result of technical problems, actually serve as a rich source for the discovery of biotechnologically valuable functions, and suggest new modes of antimicrobial interventions.
Publication types
-
Research Support, N.I.H., Extramural
-
Research Support, Non-U.S. Gov't
MeSH terms
-
Anti-Bacterial Agents / metabolism
-
Anti-Bacterial Agents / pharmacology
-
Bacterial Proteins / genetics
-
Bacterial Proteins / metabolism
-
Base Sequence
-
Binding Sites / genetics
-
Cloning, Molecular
-
DNA, Bacterial / genetics*
-
DNA, Bacterial / metabolism
-
DNA, Bacterial / pharmacology
-
DNA-Binding Proteins / genetics
-
DNA-Binding Proteins / metabolism
-
Escherichia coli / genetics*
-
Escherichia coli / metabolism
-
Gene Expression Regulation, Bacterial
-
Genes, Bacterial / genetics*
-
Genome, Bacterial / genetics
-
Microbial Viability / drug effects
-
Microbial Viability / genetics
-
Molecular Sequence Data
-
Protein Binding
-
RNA, Bacterial / genetics*
-
RNA, Bacterial / metabolism
-
RNA, Bacterial / pharmacology
-
RNA, Transfer / genetics
-
RNA, Transfer / metabolism
-
RNA, Transfer / pharmacology
-
Sequence Homology, Nucleic Acid
-
Transcription, Genetic
Substances
-
Anti-Bacterial Agents
-
Bacterial Proteins
-
DNA, Bacterial
-
DNA-Binding Proteins
-
DnaA protein, Bacteria
-
RNA, Bacterial
-
RNA, Transfer
Associated data
-
GENBANK/JQ317270
-
GENBANK/JQ317271
-
GENBANK/JQ317272
-
GENBANK/JQ317273
-
GENBANK/JQ317274
-
GENBANK/JQ323150
-
GENBANK/JQ323151
-
GENBANK/JQ323152