Abstract:Genome sequences of 4 644 representative strains from human gut microbiota were analyzed to mine gene clusters for biosynthesis of novel secondary metabolites, as well as genes encoding antibiotic resistance and virulence factors. AntiSMASH analysis showed that more than 60% of the representative strains encoded at least one secondary metabolite gene cluster, and 8 potential novel secondary metabolite gene clusters were identified from 8 unculturable bacteria. The secondary metabolite gene clusters in human intestine are mainly composed of nonribosomal peptide synthetase (NRPS), bacteriocin, arylpolyene, terpene, betalactone and NRPS like gene clusters distributed in Clostridia, Bacilli, Gammaproteobacteria, Bacteroidia, Actinobacteria and Negativicutes. PathoFact analysis showed that genes encoding antibiotic resistance and virulence factors are widely distributed in representative strains, but the frequency encoded by potential pathogens is significantly higher than that of non-potential pathogens. The frequency of genes encoding secretory toxins such as outer membrane protein, PapC N-terminal domain, PapC C-terminal domain, peptidase M16 inactive domain, and non-secretory toxins such as nitroreductase family, AcrB/AcrD/AcrF family, PLD-like domain, Cupin domain, putative hemolysin, S24-like peptidase, phosphotransferase enzyme family, endonuclease/exonuclease/phosphatase family, glyoxalase/bleomycin resistance was high in potential pathogens. This study may facilitate mining new microbial natural products from the intestinal microbiome, understanding the colonization and infection mechanism of intestinal microorganisms, and providing targeted prevention and treatment of intestinal microbial related diseases.