The human immunodeficiency viruses are two species of Lentivirus that infect humans. Over time, they cause acquired immunodeficiency syndrome, a condition in which progressive failure of the immune system allows life-threatening opportunistic infections and cancers to thrive.
Method:
We use Metamap to find biological semantics in the literature. TF and TF-IDF are used to evaluate the importance of entities respectively. TF represents word frequency, and IDF in TF-IDF represents inverse document frequency, which can be used to filter high-frequency words that cannot represent a certain field. Here, the TF-IDF calculation is to construct the semantics of a carcinogenic viruses and eight carcinogenic viruses into text respectively for calculation.